我下载了apache-nutch-2.1-src.zip并提取了。gora.sqlstore.jdbc.driver=com.mysql.jdbc.Driver gora.sqlstore.jdbc.url=jdbc:mysql://localhost:3306/nutch在/runtime/local/urls目录中添加了带有seeds.txt值的www.apache.nutch.org文件。(NutchJob.java:50)
at org.apache.nut
我第一次安装了纳奇。安装和安装似乎相当顺利。我让它在Windows 7上运行,我为nutch安装设置了类路径。在看到下面显示的错误(缺少主类)后,我麻烦地拍摄了一段时间的设置。C:\Users\Public\PublicApps\apache-nutch-1.12>nutch.bat crawl urls -dir crawl -depth 1 > crawl.log Error: Could not find or load main class org.
首先,我是一个Nutch/Hadoop新手。我已经安装了Cassandra。我已经在EMR集群的主节点上安装了Nutch。org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: file:/home/hadoop/apache-nutch(Injector.java:279)
at org.apache.nutch.crawl.Injector.r