我有两个datanode服务器的nutch/hadoop。我尝试抓取一些urls,但nutch失败,并显示以下错误:
Fetcher: segment: crawl/segments
Fetcher: org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: hdfs://devcluster01:9000/user/nutch/crawl/segments/crawl_generate
at org.apache.hadoop.mapred.FileInputFormat.listStatus
我有一个mapreduce应用程序,它接受一个HBase源数据,并将其映射为另一个HBase表,所有这些都是用Java.When编写的,我使用
hadoop jar myhbase.jar
它以NullpointerException结尾,如下所示:
14/01/31 11:07:02 INFO zookeeper.ClientCnxn: Socket connection established to 127.0.0.1/127.0.0.1:2181, initiating session
14/01/31 11:07:02 INFO zookeeper.ClientCnxn: Sessi