Alternative Data analysis libraries for Python
Updated :
Dec. 9, 2022
funNLP
Github stargazers
45413
Github forks
11915
Commits
143
Code contributors Contributors
10
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报告、自然语言生成、N
Created
Aug. 21, 2018
Updated
Nov. 17, 2022
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
12
pandas
Github stargazers
36182
Github forks
15
Commits
30879
Code contributors Contributors
2
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Created
Aug. 24, 2010
Updated
Dec. 9, 2022
License
bsd-3-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
3685
dlib
Github stargazers
11576
Github forks
3
Commits
8133
Code contributors Contributors
165
A toolkit for making real world machine learning and data analysis applications in C++
Created
Jan. 29, 2014
Updated
Nov. 29, 2022
License
bsl-1.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
C++
Issues
42
Homepage
dev-setup
Github stargazers
5848
Github forks
1147
Commits
356
Code contributors Contributors
16
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
Created
July 8, 2015
Updated
April 13, 2019
License
other
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
35
machine-learning-mindmap
Github stargazers
5661
Github forks
987
Commits
41
Code contributors Contributors
3
A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
Created
Aug. 10, 2017
Updated
May 17, 2019
License
apache-2.0
Github repo
Issues
4
Ai-Learn
Github stargazers
5606
Github forks
1437
Commits
80
Code contributors Contributors
1
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Created
Jan. 28, 2020
Updated
June 23, 2022
Github repo
Issues
19
Data-Analysis-and-Machine-Learning-Projects
Github stargazers
5408
Github forks
1963
Commits
113
Code contributors Contributors
10
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Created
Feb. 12, 2015
Updated
Sept. 17, 2020
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
11
DataSciencePython
Github stargazers
4681
Github forks
1450
Commits
112
Code contributors Contributors
7
common data analysis and machine learning tasks using python
Created
Oct. 6, 2015
Updated
Jan. 1, 2019
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
10
Data-Analysis
Github stargazers
4518
Github forks
3514
Commits
374
Code contributors Contributors
6
Data Science Using Python
Created
March 15, 2017
Updated
June 26, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
32
Homepage
interesting-python
Github stargazers
4239
Github forks
1603
Commits
206
Code contributors Contributors
1
有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)
Created
March 16, 2018
Updated
Aug. 11, 2020
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
26
mlxtend
Github stargazers
4171
Github forks
795
Commits
1548
Code contributors Contributors
86
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Created
Aug. 14, 2014
Updated
Dec. 3, 2022
License
other
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
130
orange3
Github stargazers
3850
Github forks
888
Commits
14481
Code contributors Contributors
92
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
Created
Feb. 22, 2013
Updated
Dec. 5, 2022
License
other
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
114
awesome-single-cell
Github stargazers
2344
Github forks
822
Commits
718
Code contributors Contributors
153
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Created
June 29, 2016
Updated
Dec. 2, 2022
License
mit
Github repo
Type
App
Issues
11
pypika
Github stargazers
1856
Github forks
229
Commits
1020
Code contributors Contributors
73
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
Created
July 6, 2016
Updated
March 15, 2022
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
156
cubes
Github stargazers
1475
Github forks
316
Commits
4675
Code contributors Contributors
43
Light-weight Python OLAP framework for multi-dimensional data analysis
Created
Jan. 10, 2011
Updated
Feb. 2, 2019
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
140
python_data_analysis_and_mining_action
Github stargazers
1415
Github forks
653
Commits
57
Code contributors Contributors
2
《python数据分析与挖掘实战》的代码笔记
Created
Oct. 29, 2017
Updated
June 22, 2019
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
5
agate
Github stargazers
1115
Github forks
147
Commits
1466
Code contributors Contributors
44
A Python data analysis library that is optimized for humans instead of machines.
Created
April 25, 2014
Updated
July 15, 2021
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
18
datacleaner
Github stargazers
1019
Github forks
205
Commits
49
Code contributors Contributors
4
A Python tool that automatically cleans data sets and readies them for analysis.
Created
Feb. 27, 2016
Updated
Jan. 18, 2017
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
12
bootcamp
Github stargazers
1014
Github forks
428
Commits
2643
Code contributors Contributors
43
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Created
Aug. 9, 2019
Updated
Dec. 2, 2022
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
30
Homepage
nfstream
Github stargazers
882
Github forks
102
Commits
1815
Code contributors Contributors
10
NFStream: a Flexible Network Data Analysis Framework.
Created
Oct. 18, 2019
Updated
Nov. 19, 2022
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
8
Homepage