Python根据词根将关键词进行分类

#-*- coding: UTF-8 -*-
import jieba
jieba.initialize()
fenlei = {
           '器械' : ["二类","三类"],
           '医美' : ["医疗","备案"],
           '保安' : ["保安","服务"]
}

keyword = open('keyword.txt','r',encoding='utf-8')
keywords = [x.strip() for x in keyword]
new_k = {
            '器械': [],
            '医美': [],
            '保安': [],
            '其他': []
}
for i in keywords:
    res = jieba.lcut(i)
    for k,v in fenlei.items():
        if set(v) & set(res) :
            new_k[k].append(i)
            break
    else:
        new_k['其他'].append(i)

# f = open('keword1.txt','w',encoding='utf-8')
for k,v in new_k.items():
     with open(f'{k}', 'w', encoding='utf-8') as fd :
      fd.writelines(f'{v}\n')

 


版权声明:本文为haohaomax1原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明。