程序師世界是廣大編程愛好者互助、分享、學習的平台,程序師世界有你更精彩!
首頁
編程語言
C語言|JAVA編程
Python編程
網頁編程
ASP編程|PHP編程
JSP編程
數據庫知識
MYSQL數據庫|SqlServer數據庫
Oracle數據庫|DB2數據庫
您现在的位置: 程式師世界 >> 編程語言 >  >> 更多編程語言 >> Python

Python naturallanguageprocessing (I) word frequency statistics

編輯:Python
import jieba
txt = open("lg.txt", "r", encoding="gb18030").read()
import collections
txt1 = txt
txt1 = txt1.replace('\n', '') # Delete line breaks
txt1 = txt1.replace(',', '') # Delete comma
txt1 = txt1.replace('.', '') # Delete the full stop
mylist = list(txt1)
mycount = collections.Counter(mylist)
for key, val in mycount.most_common(10): # Orderly ( Return to the former 10 individual )
print(key, val)
 38618
了 21157
. 20313
Of 15604
No 14958
One 12107
: 11710
Come on 11405
Avenue 11029
“ 10983

  1. 上一篇文章:
  2. 下一篇文章:
Copyright © 程式師世界 All Rights Reserved