我主要想把一个大文件分解成更小的文件。
我使用流是因为我不想将大文件保存在我的磁盘中。
我所看到的是类似于:
sed -n 'a,bp,' #this uses lines in file while i want bytes
或者:
cat filename|head -c a| tail -c (a-b) # this way takes too long with big files
Q: What is the largest possible size of an ext3 filesystem and of files on ext3?
Ext3 can support files up to 1TB. With a 2.4 kernel the filesystem size is limited by the maximal block device size, which is 2TB. In 2.6 the maximum (32-bit CPU) limit is of block devices is 16TB, but ext3 supports on
以下代码在r中工作:
pdf <- pdf_text("xyz.pdf")
text <- c(pdf)
text_df <- tibble(line = 1:2, text = text)
words <- text_df %>%
unnest_tokens(word, text)
x <- words
y <- gsub("apple","fruit", x)
y
我需要帮助的是为潜艇添加多个条件:
我还想用“香蕉”、“水果”、“南瓜”、“蔬菜”代替。
我能为一份大文件列一张清单吗?
谢谢