我正在尝试抓取一个本地HTML文件,以便获得页面标题。我使用的是playwright节点Js库,但我无法将其通过,请给我帮助这里是代码
import playwright from "playwright";
async function main() {
const browser = await playwright.chromium.launch({
headless: true// setting this to true will not run the UI
});
const page = await browser.newPage();
await page.goto('C:\Users\weske\Desktop\WebScraping\index.html');
let title = await page.title();
console.log(title);
await page.waitForTimeout(5000); // wait for 5 seconds
await browser.close();
}
main();
// error messages
(node:15800) UnhandledPromiseRejectionWarning: page.goto: net::ERR_FILE_NOT_FOUND at
c:UsersweskeDesktopWebScrapingindex.html
=========================== logs ===========================
navigating to "c:UsersweskeDesktopWebScrapingindex.html", waiting until "load"
============================================================
at main (file:///C:/Users/weske/Desktop/WebScraping/node.js:11:13)
(Use `node --trace-warnings ...` to show where the warning was created)
(node:15800) UnhandledPromiseRejectionWarning: Unhandled promise rejection. This error
originated either by throwing inside of an
async function without a catch block, or by rejecting a promise which was not handled with
.catch(). To terminate the node process on unhandled promise rejection, use the CLI flag `--
unhandled-rejections=strict` (see
https://nodejs.org/api/cli.html#cli_unhandled_rejections_mode). (rejection id: 1)
(node:15800) [DEP0018] DeprecationWarning: Unhandled promise rejections are deprecated. In
the future, promise rejections that are not handled will terminate the Node.js process with a
non-zero exit code.
发布于 2021-11-17 19:04:08
请始终确保将文件路径放入如下格式
`file://C:/Users/weske/Desktop/Webscraping/index.html`
https://stackoverflow.com/questions/70009973
复制相似问题