Alternative Nlp libraries and tools for Python
textacy
Github stargazers
2214
Github forks
250
Commits
1816
Code contributors Contributors
30
NLP, before and after spaCy
Created
Feb. 3, 2016
Updated
April 3, 2023
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
35
CodeSearchNet
Read-only repository, archived by owner Archived
Github stargazers
2204
Github forks
385
Commits
286
Code contributors Contributors
52
Datasets, tools, and benchmarks for representation learning of code.
Created
Feb. 28, 2019
Updated
Jan. 31, 2022
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Jupyter
Issues
14
Homepage
Algorithm_Interview_Notes-Chinese
Github stargazers
2174
Github forks
509
Commits
505
Code contributors Contributors
7
2018/2019/校招/春招/秋招/自然语言处理(NLP)/深度学习(Deep Learning)/机器学习(Machine Learning)/C/C++/Python/面试笔记,此外,还包括创建者看到的所有机器学习/深度学习面经中的问题。 除了其中 DL/ML 相关的,其他与算法岗相关的计算机知识也会记录。 但是不会包括如前端/测试/JAVA/Android等岗位中有关的问题。
Created
Dec. 4, 2018
Updated
Nov. 7, 2018
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
3
lazynlp
Github stargazers
2159
Github forks
311
Commits
14
Code contributors Contributors
4
Library to scrape and clean web pages to create massive datasets.
Created
Feb. 27, 2019
Updated
Oct. 7, 2019
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
10
pytextrank
Github stargazers
2137
Github forks
334
Commits
468
Code contributors Contributors
17
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
Created
Oct. 2, 2016
Updated
May 21, 2024
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
15
Homepage
news-please
Github stargazers
2071
Github forks
426
Commits
711
Code contributors Contributors
38
news-please - an integrated web crawler and information extractor for news that just works
Created
Dec. 18, 2016
Updated
Sept. 5, 2024
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
7
china-dictatorship
Github stargazers
2050
Github forks
229
Commits
1121
Code contributors Contributors
5
反中共政治宣传库。Anti Chinese government propaganda. 住在中国真名用户的网友请别给星星,不然你要被警察请喝茶。常见问答集,新闻集和饭店和音乐建议。卐习万岁卐。冠状病毒审查郝海东新疆改造中心六四事件法轮功 996.ICU709大抓捕巴拿马文件邓家贵低端人口西藏骚乱。Friends who live in China and have real name on account, please don't star this repo, or else the police might pay you a visit. Home to the mega-FAQ, news compilation, restaurant and music recommendations.Heil
Created
April 2, 2015
Updated
Aug. 3, 2022
License
cc-by-sa-4.0
Github repo
Primary Language, based on Github DataLanguage
HTML
Issues
689
DeepLearn
Github stargazers
1821
Github forks
353
Commits
220
Code contributors Contributors
2
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
Created
May 20, 2017
Updated
Dec. 4, 2022
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
13
bootcamp
Github stargazers
1861
Github forks
574
Commits
2858
Code contributors Contributors
63
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Created
Aug. 9, 2019
Updated
Sept. 29, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
HTML
Issues
17
Homepage
Awesome-pytorch-list-CNVersion
Github stargazers
1733
Github forks
394
Commits
54
Code contributors Contributors
1
Awesome-pytorch-list 翻译工作进行中......
Created
Sept. 4, 2019
Updated
July 26, 2021
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Jupyter
Homepage
TextInfoExp
Github stargazers
1694
Github forks
772
Commits
68
Code contributors Contributors
1
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
Created
Feb. 27, 2017
Updated
Dec. 16, 2018
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
8
pet
Github stargazers
1623
Github forks
282
Commits
40
Code contributors Contributors
3
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Created
April 7, 2020
Updated
March 30, 2022
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
30
Homepage
magnitude
Github stargazers
1627
Github forks
119
Commits
350
Code contributors Contributors
4
A fast, efficient universal vector embedding utility package.
Created
Feb. 24, 2018
Updated
July 17, 2020
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
39
sense2vec
Github stargazers
1621
Github forks
239
Commits
460
Code contributors Contributors
15
🦆 Contextually-keyed word vectors
Created
Jan. 23, 2016
Updated
April 20, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
23
Homepage
usaddress
Github stargazers
1528
Github forks
304
Commits
433
Code contributors Contributors
12
:us: a python library for parsing unstructured United States address strings into address components
Created
July 17, 2014
Updated
Sept. 27, 2024
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
165
tika-python
Github stargazers
1505
Github forks
234
Commits
475
Code contributors Contributors
55
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Created
June 26, 2014
Updated
Aug. 11, 2023
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
14
budoux
Github stargazers
1436
Github forks
32
Commits
388
Code contributors Contributors
13
--
Created
Nov. 18, 2021
Updated
Sept. 23, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
8
DataProfiler
Github stargazers
1428
Github forks
160
Commits
596
Code contributors Contributors
51
What's in your data? Extract schema, statistics and entities from datasets
Created
Nov. 9, 2020
Updated
June 14, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
72
konlpy
Github stargazers
1416
Github forks
333
Commits
611
Code contributors Contributors
33
Python package for Korean natural language processing.
Created
May 1, 2014
Updated
Nov. 10, 2022
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
140
Homepage
gnes
Read-only repository, archived by owner Archived
Github stargazers
1300
Github forks
209
Commits
3190
Code contributors Contributors
9
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
Created
July 8, 2019
Updated
Oct. 24, 2019
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
16
Homepage