我有一个日志文件:
19-3-2020 01:37:31.995 INFO 18 188 mailbox allocated for rsvp
19-3-2020 01:37:32.039 INFO 14 194 creating mailslot for dump
19-3-2020 01:37:32.082 INFO 18 194 out of INFO allcations
19-3-2020 01:37:32.119 INFO 18 188 creating mailslot for RSVP client API
19-3-2020 01:37
我想要做的基本上是从日志文件的处理文件中提取关键字,并创建这些关键字的向量化数据。但是,当我将该数据写入CSV时,单词在列中,它们各自的值在第二行中。而I want the words to be in rows and their value in second column.
trial.py:
import re
import pandas as pd
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.feature_extraction.text import ENGLISH_STOP_WO