我试图研究句子的结构相似性,特别是动词、动词、名词的位置。例如,我有三个(或更多)句子,如下所示:
I ate an apple pie, yesterday.
I ate an orange, yesterday.
I eat a lemon, today.
所有这些词都以代词(I)开头,然后是动词(吃/吃)和名词(苹果饼、橙、柠檬),最后是副词(昨天/明天)。
我想知道是否有一种方法来识别结构,即句子中的代名词动词名词副词。
如果我把它看作是一只熊猫
SENTENCE
I ate an apple pie, yesterday.
I ate an orange, yesterday.
I eat a lemon, today.
我需要做如下的事情:
SENTENCE STRUCTURE
I ate an apple pie, yesterday. PRONOUN VERB NOUN ADJECTIVE
I ate an orange, yesterday. PRONOUN VERB NOUN ADJECTIVE
I eat a lemon, today. PRONOUN VERB NOUN ADJECTIVE
你知道我怎样才能得到这个(或类似的)结果吗?
发布于 2020-09-06 12:28:50
下面是一个使用spacy的简单示例:
import spacy
import pandas as pd
# load english language model
nlp = spacy.load('en_core_web_sm',disable=['ner','textcat'])
text = "I ate an apple pie, yesterday."
# create spacy
doc = nlp(text)
pos = ""
for token in doc:
pos += token.pos_ + " "
# create dataframe
df = pd.DataFrame([[text, pos]], columns=['Sentence', 'Structure'])
print(df)
产出如下:
Sentence Structure
0 I ate an apple pie, yesterday. PRON VERB DET NOUN NOUN PUNCT NOUN PUNCT
https://stackoverflow.com/questions/63763542
复制相似问题