文章/答案/技术大牛

发布

社区首页 >问答首页 >使用symfony dom爬虫提取无标记元素

问使用symfony dom爬虫提取无标记元素
EN

Stack Overflow用户

提问于 2015-07-02 11:32:14

回答 2查看 407关注 0票数 2

如何使用symfony爬虫提取无标记元素。例如，在下面的示例html中，我想提取Hello World。

<strong>title</strong>Hello World<strong>Sub-Title</strong><div>This is just stuff</div>

php

symfony

domcrawler

回答 2

Stack Overflow用户

回答已采纳

发布于 2015-07-02 12:46:25

使用PHP DOM可以很容易地做到这一点;)

$dom = new DOMDocument();
$dom->loadHTML('<strong>title</strong>Hello World<strong>Sub-Title</strong><div>This is just stuff</div>');
$xpath = new DOMXPath($dom);
// use the fact that PHP DOM wraps everything into the body and get the text()
$entries = $xpath->query('//body/text()');
foreach ($entries as $entry) {
    echo $entry->nodeValue;
}

票数 0

Stack Overflow用户

发布于 2019-09-26 07:43:46

我有更好的方法给你

$ExtractText = $crawler->filter('strong')->eq(1)->text();

这可以得到索引1的标记，因为标题是索引0。

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/31183435

复制

相似问题

问使用symfony dom爬虫提取无标记元素
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问使用symfony dom爬虫提取无标记元素EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问使用symfony dom爬虫提取无标记元素
EN