blocks|key|2119909|text|如果你想解析响应，JsDom可以很好地实现这样的功能。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|2119910|++++var+request+=+require('request'),
++++jsdom+=+require('jsdom');

request({+uri:'http://www.myawesomepage.com/'+},+function+(error,+response,+body)+{
++if+(error+&&+response.statusCode+!==+200)+{
++++console.log('Error+when+contacting+myawesomepage.com')
++}

++jsdom.env({
++++html:+body,
++++scripts:+[
++++++'http://code.jquery.com/jquery-1.5.min.js'
++++]
++},+function+(err,+window)+{
++++var+$+=+window.jQuery;

++++//+jQuery+is+now+loaded+on+the+jsdom+window+created+from+'agent.body'
++++console.log($('body').html());
++});
});|code-block|syntax|javascript|2119911|另外，如果您的页面加载了大量javascript/ajax内容，则可能需要考虑使用phantomjs源http://blog.nodejitsu.com/jsdom-jquery-in-5-lines-on-nodejs/|2119912|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/tmpvar/jsdom|1|http://phantomjs.org/|2|http://blog.nodejitsu.com/jsdom-jquery-in-5-lines-on-nodejs/^0|9|5|0|0|0|15|9|1|1F|1O|2|0^^$0|@$1|2|3|4|5|6|7|W|8|@]|9|@$A|X|B|Y|1|Z]]|C|$]]|$1|D|3|E|5|F|7|10|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|11|8|@]|9|@$A|12|B|13|1|14]|$A|15|B|16|1|17]]|C|$]]|$1|K|3|-4|5|6|7|18|8|@]|9|@]|C|$]]]|L|$M|$5|N|O|P|C|$Q|R]]|S|$5|N|O|P|C|$Q|T]]|U|$5|N|O|P|C|$Q|V]]]]

<a href="https://github.com/tmpvar/jsdom" rel="nofollow">JsDom</a> is pretty good to achieve things like this if you want to parse the response.

<pre><code> var request = require('request'),
 jsdom = require('jsdom');

request({ uri:'http://www.myawesomepage.com/' }, function (error, response, body) {
 if (error &amp;&amp; response.statusCode !== 200) {
 console.log('Error when contacting myawesomepage.com')
 }

 jsdom.env({
 html: body,
 scripts: [
 'http://code.jquery.com/jquery-1.5.min.js'
 ]
 }, function (err, window) {
 var $ = window.jQuery;

 // jQuery is now loaded on the jsdom window created from 'agent.body'
 console.log($('body').html());
 });
});
</code></pre>

also if your page has lot of javascript/ajax content being loaded you might want to consider using <a href="http://phantomjs.org/" rel="nofollow">phantomjs</a>
Source <a href="http://blog.nodejitsu.com/jsdom-jquery-in-5-lines-on-nodejs/" rel="nofollow">http://blog.nodejitsu.com/jsdom-jquery-in-5-lines-on-nodejs/</a>

blocks|key|776717|text|var+request+=+require("request");

var+parseMyAwesomeHtml+=+function(html)+{
++++//Have+at+it
};

request("http://www.myawesomepage.com/",+function+(error,+response,+body)+{
++++if+(!error)+{
++++++++parseMyAwesomeHtml(body);
++++}+else+{
++++++++console.log(error);
++++}
});|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|776718|编辑:正如Kishore所说，有很好的解析选项可用。如果你在windows上遇到了jsdom的python/gyp问题，也可以看看cheerio。Cheerio+on+github|unstyled|offset|length|776719|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/MatthewMueller/cheerio^0|0|21|H|0|0^^$0|@$1|2|3|4|5|6|7|Q|8|@]|9|@]|A|$B|C]]|$1|D|3|E|5|F|7|R|8|@]|9|@$G|S|H|T|1|U]]|A|$]]|$1|I|3|-4|5|F|7|V|8|@]|9|@]|A|$]]]|J|$K|$5|L|M|N|A|$O|P]]]]

<pre><code>var request = require("request");

var parseMyAwesomeHtml = function(html) {
 //Have at it
};

request("http://www.myawesomepage.com/", function (error, response, body) {
 if (!error) {
 parseMyAwesomeHtml(body);
 } else {
 console.log(error);
 }
});
</code></pre>

Edit: As Kishore noted, there are nice options for parsing available. Also see cheerio if you have python/gyp issues with jsdom on windows. <a href="https://github.com/MatthewMueller/cheerio" rel="noreferrer">Cheerio on github</a>

blocks|key|2114748|text|该request()调用是异步的，因此该响应仅在回调中可用。你必须从它调用你的解析函数：|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|2114749|function+parse_my_awesome_html(text){
++++...
}

request("http://www.myawesomepage.com/",+function+(error,+response,+body)+{
++++parse_my_awesome_html(body)
})|code-block|syntax|javascript|2114750|习惯于链接回调，这基本上就是javascript中任何I/O发生的方式:)|2114751|entityMap^0|1|9|0|0|0^^$0|@$1|2|3|4|5|6|7|O|8|@$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|S|8|@]|D|@]|E|$]]|$1|M|3|-4|5|6|7|T|8|@]|D|@]|E|$]]]|N|$]]

That <code>request()</code> call is asynchronous, so the response is only available inside the callback. You have to call your parse function from it:

<pre><code>function parse_my_awesome_html(text){
 ...
}

request("http://www.myawesomepage.com/", function (error, response, body) {
 parse_my_awesome_html(body)
})
</code></pre>

Get used to chaining callbacks, that's essentially how any I/O will happen in javascript :)

I'm apparently a little newer to Javascript than I'd care to admit. I'm trying to pull a webpage using Node.js and save the contents as a variable, so I can parse it however I feel like.

In Python, I would do this:

<pre><code>from bs4 import BeautifulSoup # for parsing
import urllib

text = urllib.urlopen("http://www.myawesomepage.com/").read()

parse_my_awesome_html(text)
</code></pre>

How would I do this in Node?
I've gotten as far as:

<pre><code>var request = require("request");
request("http://www.myawesomepage.com/", function (error, response, body) {
 /*
 Something here that lets me access the text
 outside of the closure

 This doesn't work:
 this.text = body;
 */ 
})
</code></pre>

Node.js Saving a GET request's HTML response

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

我显然是Javascript的新手，我不愿意承认这一点。我正在尝试使用Node.js拉取一个网页，并将其内容保存为一个变量，这样我就可以随心所欲地解析它。在Python中，我会这样做：from bs4 import BeautifulSoup # for parsingimport urllibtext = urlli...

问Node.js保存GET请求的超文本标记语言响应
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Node.js保存GET请求的超文本标记语言响应EN