Alternative Nlp libraries and tools for Python
Updated :
April 23, 2024
textacy
Github stargazers
2173
Github forks
246
Commits
1816
Code contributors Contributors
30
NLP, before and after spaCy
Created
Feb. 3, 2016
Updated
April 3, 2023
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
35
lazynlp
Github stargazers
2150
Github forks
311
Commits
14
Code contributors Contributors
4
Library to scrape and clean web pages to create massive datasets.
Created
Feb. 27, 2019
Updated
Oct. 7, 2019
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
10
CodeSearchNet
Read-only repository, archived by owner Archived
Github stargazers
2113
Github forks
377
Commits
286
Code contributors Contributors
52
Datasets, tools, and benchmarks for representation learning of code.
Created
Feb. 28, 2019
Updated
Jan. 31, 2022
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Jupyter
Issues
14
Homepage
pytextrank
Github stargazers
2096
Github forks
335
Commits
468
Code contributors Contributors
17
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Created
Oct. 2, 2016
Updated
Feb. 21, 2024
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
7
Homepage
Algorithm_Interview_Notes-Chinese
Github stargazers
2058
Github forks
512
Commits
505
Code contributors Contributors
7
2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学习面经中的问题。 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。
Created
Dec. 4, 2018
Updated
Nov. 7, 2018
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
3
news-please
Github stargazers
1927
Github forks
403
Commits
711
Code contributors Contributors
32
news-please - an integrated web crawler and information extractor for news that just works
Created
Dec. 18, 2016
Updated
Dec. 27, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
16
DeepLearn
Github stargazers
1817
Github forks
360
Commits
220
Code contributors Contributors
2
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
Created
May 20, 2017
Updated
Dec. 4, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
13
china-dictatorship
Github stargazers
1787
Github forks
205
Commits
1121
Code contributors Contributors
5
反中共政治宣传库。Anti Chinese government propaganda. 住在中国真名用户的网友请别给星星,不然你要被警察请喝茶。常见问答集,新闻集和饭店和音乐建议。卐习万岁卐。冠状病毒审查郝海东新疆改造中心六四事件法轮功 996.ICU709大抓捕巴拿马文件邓家贵低端人口西藏骚乱。Friends who live in China and have real name on account, please don't star this repo, or else the police might pay you a visit. Home to the mega-FAQ, news compilation, restaurant and music recommendations.Heil
Created
April 2, 2015
Updated
Aug. 2, 2022
License
cc-by-sa-4.0
Github repo
Primary Language, based on Github DataLanguage
HTML
Issues
632
Awesome-pytorch-list-CNVersion
Github stargazers
1701
Github forks
397
Commits
54
Code contributors Contributors
1
Awesome-pytorch-list 翻译工作进行中......
Created
Sept. 4, 2019
Updated
July 26, 2021
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Jupyter
Homepage
TextInfoExp
Github stargazers
1664
Github forks
772
Commits
68
Code contributors Contributors
1
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Created
Feb. 27, 2017
Updated
Dec. 16, 2018
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
8
bootcamp
Github stargazers
1615
Github forks
537
Commits
2858
Code contributors Contributors
56
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Created
Aug. 9, 2019
Updated
April 23, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
HTML
Issues
14
Homepage
magnitude
Github stargazers
1612
Github forks
118
Commits
350
Code contributors Contributors
4
A fast, efficient universal vector embedding utility package.
Created
Feb. 24, 2018
Updated
July 17, 2020
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
39
pet
Github stargazers
1610
Github forks
286
Commits
40
Code contributors Contributors
3
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Created
April 7, 2020
Updated
March 30, 2022
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
30
Homepage
sense2vec
Github stargazers
1594
Github forks
237
Commits
460
Code contributors Contributors
15
🦆 Contextually-keyed word vectors
Created
Jan. 23, 2016
Updated
April 20, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
24
Homepage
usaddress
Github stargazers
1487
Github forks
292
Commits
433
Code contributors Contributors
14
:us: a python library for parsing unstructured United States address strings into address components
Created
July 17, 2014
Updated
March 17, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
160
tika-python
Github stargazers
1412
Github forks
233
Commits
475
Code contributors Contributors
55
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Created
June 26, 2014
Updated
Aug. 11, 2023
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
8
konlpy
Github stargazers
1390
Github forks
330
Commits
611
Code contributors Contributors
33
Python package for Korean natural language processing.
Created
May 1, 2014
Updated
Nov. 10, 2022
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
139
Homepage
budoux
Github stargazers
1376
Github forks
30
Commits
388
Code contributors Contributors
12
--
Created
Nov. 18, 2021
Updated
April 19, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
DataProfiler
Github stargazers
1360
Github forks
154
Commits
596
Code contributors Contributors
48
What's in your data? Extract schema, statistics and entities from datasets
Created
Nov. 9, 2020
Updated
March 6, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
64
gnes
Read-only repository, archived by owner Archived
Github stargazers
1300
Github forks
212
Commits
3190
Code contributors Contributors
9
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
Created
July 8, 2019
Updated
Oct. 24, 2019
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
16
Homepage