我有两个csv文件,每个都有3 GB的大小,用来比较和存储第三个文件中的差异。
Python代码:
with open('JUN-01.csv', 'r') as f1:
file1 = f1.readlines()
with open('JUN-02.csv', 'r') as f2:
file2 = f2.readlines()
with open('JUN_Updates.csv', 'w') as outFile:
outFile.write(file1[0])
for line in file2:
if line not in file1:
outFile.write(line)
执行时间:45分钟且仍在运行...
https://stackoverflow.com/questions/50678710
复制相似问题