我正在使用Spark编写Java类的代码。我有一个错误:"DataFrame不能解析为类型“,而有关导入的错误:”导入org.apache.spark.sql.DataFrame“不能被解析。这是类导入:
import org.apache.spark.api.java.*;
import org.apache.spark.api.java.function.Function;
import org.apache.spark.sql.DataFrameReader;
import org.apache.spark.sql.Dataset;
import org.apache.spark.sql.Row;
import org.apache.spark.sql.SQLContext;
import org.apache.spark.sql.DataFrame;
这是文件pom.xml:
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>SparkBD</groupId>
<artifactId>SparkProject</artifactId>
<version>0.0.1-SNAPSHOT</version>
<dependencies>
<dependency> <!-- Spark dependency -->
<groupId>org.apache.spark</groupId>
<artifactId>spark-core_2.11</artifactId>
<version>2.3.0</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-streaming_2.11</artifactId>
<version>2.3.0</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-sql_2.11</artifactId>
<version>2.3.0</version>
</dependency>
</dependencies>
</project>
发布于 2018-05-14 08:31:33
DataFrame
已经在Java中被删除(在Scala中,它只是一个别名),在Spark2.0中。您应该用Dataset<Row>
替换它。
import org.apache.spark.sql.Dataset
DataFrame
的地方使用Dataset<Row>
https://stackoverflow.com/questions/50335017
复制