entityMap|0|type|LINK|mutability|MUTABLE|data|url|https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/calculator.js?ver=1.7.7|1|https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/94.json|2|https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/shared.js?ver=1.7.7|blocks|key|5vl7c|text|这些值是通过在页面上运行的脚本生成的。您当前的方法不允许这样做，因此您会得到结果。您最好使用一种允许脚本运行的方法，比如RSelenium。|unstyled|depth|inlineStyleRanges|entityRanges|9rrfa|您填写的表单将值输入到脚本标记#results-template中的模板。此脚本https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/calculator.js?ver=1.7.7中介绍了相关的计算，您将在其中找到逻辑和预设值，例如每年的贫困线。|offset|length|9goki|最简单的快速查看可能是在创建新的SubsidyCalculator对象来处理表单时检查javascript变量，即以var+sc+=+new+SubsidyCalculator开头的js。你可以用你的值加上从json返回的值来“逆向工程”这些变量，我认为，根据邮政编码，将6个以kff_sc开头的变量输入到计算器中，例如silver:+kff_sc.silver，但还没有确认。假设脚本顶部给出了缺省值，您就可以大致了解这些数字。|style|CODE|f4mqb|与邮政编码相关的数字可以从这里检索到：https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/94.json，其中.json之前的最后两个数字是邮政编码的前两个数字。您可以从输入验证脚本中确定这一点：https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/shared.js?ver=1.7.7|dj60v|​|a72cn|var+bucket+=+$(+this+).val().substring(+0,+2+);
		if+(+kff_sc.buckets[bucket]+)+return;
		$.ajax(+'/wp-content/themes/vip/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/'+%2B+bucket+%2B+'.json',+|code-block|syntax|javascript|f9v9n|a1bmc|前两个数字确定存储桶。|5fu3j|总而言之，您可能会实现自己的计算器，但您将重新发明轮子。只需自动化浏览器，然后提取结果值似乎更容易。|chh1m^0|0|14|3D|0|0|G|H|1M|U|3V|6|4G|L|0|J|37|1|50|39|2|0|0|0|0|0|0^^$0|$1|$2|3|4|5|6|$7|8]]|9|$2|3|4|5|6|$7|A]]|B|$2|3|4|5|6|$7|C]]]|D|@$E|F|G|H|2|I|J|19|K|@]|L|@]|6|$]]|$E|M|G|N|2|I|J|1A|K|@]|L|@$O|1B|P|1C|E|1D]]|6|$]]|$E|Q|G|R|2|I|J|1E|K|@$O|1F|P|1G|S|T]|$O|1H|P|1I|S|T]|$O|1J|P|1K|S|T]|$O|1L|P|1M|S|T]]|L|@]|6|$]]|$E|U|G|V|2|I|J|1N|K|@]|L|@$O|1O|P|1P|E|1Q]|$O|1R|P|1S|E|1T]]|6|$]]|$E|W|G|X|2|I|J|1U|K|@]|L|@]|6|$]]|$E|Y|G|Z|2|10|J|1V|K|@]|L|@]|6|$11|12]]|$E|13|G|X|2|I|J|1W|K|@]|L|@]|6|$]]|$E|14|G|15|2|I|J|1X|K|@]|L|@]|6|$]]|$E|16|G|17|2|I|J|1Y|K|@]|L|@]|6|$]]|$E|18|G|-4|2|I|J|1Z|K|@]|L|@]|6|$]]]]

The values are generated through scripts that run on the page. Your current method won't allow for this hence your result. You are likely better off using a method which allows scripts to run such as RSelenium.

The form you complete #subsidy-form feeds values into a template in a script tag #results-template. The associated calculations are covered in this script <a href="https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/calculator.js?ver=1.7.7" rel="nofollow noreferrer">https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/calculator.js?ver=1.7.7</a> where you will find the logic and the pre-set values such as poverty lines per year. 

The simplest quick view is probably to inspect the javascript variables when the new <code>SubsidyCalculator</code> object is created to process the form i.e. js starting with <code>var sc = new SubsidyCalculator</code>. You could 'reverse engineer' those variables with your values plus the values returned from the json below which I think, but haven't confirmed, feed the 6 variables that begin with <code>kff_sc</code>, according to zipcode, into the calculator e.g. <code>silver: kff_sc.silver</code> . You get an idea of the ballpark figures given there are default values given at top of script.

Figures in relation to zipcode are retrieved from this: <a href="https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/94.json" rel="nofollow noreferrer">https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/94.json</a> where the last two numbers before .json are the first two numbers of zipcode. You can determine this from the input validation script: <a href="https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/shared.js?ver=1.7.7" rel="nofollow noreferrer">https://www.kff.org/wp-content/themes/kaiser-foundation-2016/interactives/subsidy-calculator/2019/shared.js?ver=1.7.7</a>

<div class="snippet" data-lang="js" data-hide="false" data-console="true" data-babel="false">
<div class="snippet-code">
<pre class="snippet-code-js lang-js prettyprint-override"><code>var bucket = $( this ).val().substring( 0, 2 );
		if ( kff_sc.buckets[bucket] ) return;
		$.ajax( '/wp-content/themes/vip/kaiser-foundation-2016/interactives/subsidy-calculator/2019/json/zips/' + bucket + '.json', </code></pre>
</div>
</div>


The first two digits determine the bucket.

All in all you could likely implement your own calculator but you would be re-inventing the wheel. Seems easier to just automate the browser and then extract the resultant values.

I'm trying to scrape information from <a href="https://www.kff.org/interactive/subsidy-calculator" rel="nofollow noreferrer">https://www.kff.org/interactive/subsidy-calculator</a>. For instance, put state=California, zip=90001, income=20000, no coverage, 1 people, 1 adult, no children, age=21, no tobacco. 

We get the following:
<a href="https://www.kff.org/interactive/subsidy-calculator/#state=ca&amp;zip=94704&amp;income-type=dollars&amp;income=20000&amp;employer-coverage=0&amp;people=1&amp;alternate-plan-family=individual&amp;adult-count=1&amp;adults%5B0%5D%5Bage%5D=21&amp;adults%5B0%5D%5Btobacco%5D=0&amp;child-count=0" rel="nofollow noreferrer">https://www.kff.org/interactive/subsidy-calculator/#state=ca&amp;zip=94704&amp;income-type=dollars&amp;income=20000&amp;employer-coverage=0&amp;people=1&amp;alternate-plan-family=individual&amp;adult-count=1&amp;adults%5B0%5D%5Bage%5D=21&amp;adults%5B0%5D%5Btobacco%5D=0&amp;child-count=0</a>

I would like to get the numbers for "estimated financial help" and "your cost for a silver plan" (they are bolded-blue in the "Results" grey box, for some reason I can't upload the screenshot). When I use the xpath for the numbers, I get back empty string. This is not the case if I were to retrieve some other text (not in the grey box). I wonder what could be wrong with this. I have attached code below. Please forgive me if this is a stupid question since I'm very new to web-scraping. Thank you!

<pre class="lang-r prettyprint-override"><code>state = tolower('CA')
zip = 94704
income = 20000
people = 1
adult = 1
children = 0

url = paste0("https://www.kff.org/interactive/subsidy-calculator/#state=", state, "&amp;zip=", zip, "&amp;income-type=dollars&amp;income=", income, "&amp;employer-coverage=0&amp;people=", people, "&amp;alternate-plan-family=individual&amp;adult-count=", adult, "&amp;adults%5B0%5D%5Bage%5D=21&amp;adults%5B0%5D%5Btobacco%5D=0&amp;child-count=", children)

# This returns empty string
r = read_html(url) %&gt;%
 html_nodes(xpath ='//*[@id="subsidy-calculator-new"]/div[5]/div/div/dl/dd[1]/span') %&gt;% html_text()

# This returns "Number of children (20 and younger) enrolling in Marketplace coverage", a line that's not in the grey box.
r = read_html(url) %&gt;%
 html_nodes(xpath = '//*[@id="subsidy-form"]/div[2]/div[3]/div[3]/p') %&gt;%
 html_text()
</code></pre>

xpath returning empty text when web-scraping in r

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

 我在试着从https://www.kff.org/interactive/subsidy-calculator那里获取信息。例如，放置state=California，zip=90001，income=20000，无覆盖，1人，1成人，无儿童，age=21，无烟草。 我们得到以下信息：https://www.kff....

在r中进行web抓取时，xpath返回空文本-腾讯云开发者社区-腾讯云

问在r中进行web抓取时，xpath返回空文本
EN

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在r中进行web抓取时，xpath返回空文本EN

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在r中进行web抓取时，xpath返回空文本
EN