Alternative Data analysis libraries for Python
Updated :
March 29, 2024
prince
Github stargazers
1179
Github forks
178
Commits
394
Code contributors Contributors
15
:crown: Multivariate exploratory data analysis in Python — PCA, CA, MCA, MFA, FAMD, GPA
Created
Oct. 22, 2016
Updated
Dec. 21, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
6
agate
Github stargazers
1160
Github forks
152
Commits
1524
Code contributors Contributors
47
A Python data analysis library that is optimized for humans instead of machines.
Created
April 25, 2014
Updated
Feb. 23, 2024
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
8
data-structures-algorithms-python
Github stargazers
1094
Github forks
1
Commits
86
Code contributors Contributors
6
This tutorial playlist covers data structures and algorithms in python. Every tutorial has theory behind data structure or an algorithm, BIG O Complexity analysis and exercises that you can practice on.
Created
Sept. 29, 2020
Updated
Nov. 14, 2022
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
59
veles
Read-only repository, archived by owner Archived
Github stargazers
1050
Github forks
118
Commits
637
Code contributors Contributors
14
Binary data analysis and visualization tool
Created
Jan. 12, 2017
Updated
May 18, 2018
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
C++
Issues
56
Homepage
datacleaner
Github stargazers
1042
Github forks
211
Commits
49
Code contributors Contributors
4
A Python tool that automatically cleans data sets and readies them for analysis.
Created
Feb. 27, 2016
Updated
Jan. 18, 2017
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
12
SQL-Data-Analysis-and-Visualization-Projects
Github stargazers
1041
Github forks
454
Commits
119
Code contributors Contributors
1
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
Created
Feb. 29, 2020
Updated
April 14, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
nfstream
Github stargazers
1036
Github forks
116
Commits
1841
Code contributors Contributors
11
NFStream: a Flexible Network Data Analysis Framework.
Created
Oct. 18, 2019
Updated
March 15, 2023
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
26
Homepage
Deep-Learning-For-Hackers
Github stargazers
980
Github forks
429
Commits
45
Code contributors Contributors
1
Machine Learning tutorials with TensorFlow 2 and Keras in Python (Jupyter notebooks included) - (LSTMs, Hyperameter tuning, Data preprocessing, Bias-variance tradeoff, Anomaly Detection, Autoencoders, Time Series Forecasting, Object Detection, Sentiment Analysis, Intent Recognition with BERT)
Created
April 24, 2019
Updated
April 23, 2020
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Jupyter
Issues
6
Homepage
hail
Github stargazers
930
Github forks
236
Commits
11258
Code contributors Contributors
79
Cloud-native genomic dataframes and batch computing
Created
Oct. 27, 2015
Updated
March 28, 2024
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
224
Homepage
Doing_bayesian_data_analysis
Github stargazers
886
Github forks
283
Commits
128
Code contributors Contributors
7
Python/PyMC3 versions of the programs described in Doing bayesian data analysis by John K. Kruschke
Created
July 4, 2014
Updated
July 16, 2021
Github repo
Type
Resource
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
kaggle-titanic
Github stargazers
857
Github forks
668
Commits
143
Code contributors Contributors
8
A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.
Created
May 1, 2013
Updated
Dec. 18, 2017
License
apache-2.0
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Jupyter
Issues
8
python-for-data-analysis
Github stargazers
822
Github forks
327
Commits
220
Code contributors Contributors
1
An introduction to data science using Python and Pandas with Jupyter notebooks
Created
Nov. 23, 2016
Updated
Oct. 8, 2019
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Jupyter
Issues
3
retentioneering-tools
Github stargazers
753
Github forks
118
Commits
6
Code contributors Contributors
21
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.
Created
July 2, 2019
Updated
Dec. 1, 2023
License
other
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
1
DataAnalysisInAction
Github stargazers
689
Github forks
277
Commits
127
Code contributors Contributors
2
(Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test
Created
Dec. 19, 2018
Updated
Jan. 21, 2024
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
deeptime
Github stargazers
685
Github forks
75
Commits
1249
Code contributors Contributors
16
Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation
Created
March 27, 2018
Updated
Sept. 10, 2023
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
13
DBDA-python
Github stargazers
664
Github forks
261
Commits
210
Code contributors Contributors
1
Doing Bayesian Data Analysis, 2nd Edition (Kruschke, 2015): Python/PyMC3 code
Created
July 13, 2016
Updated
Aug. 13, 2021
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
3
Dora
Github stargazers
637
Github forks
70
Commits
38
Code contributors Contributors
2
Tools for exploratory data analysis in Python
Created
Feb. 16, 2016
Updated
Jan. 18, 2024
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
1
Hands-on-Exploratory-Data-Analysis-with-Python
Github stargazers
635
Github forks
298
Commits
58
Code contributors Contributors
5
Hands-on Exploratory Data Analysis with Python, published by Packt
Created
Oct. 7, 2019
Updated
Jan. 30, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
sparkMeasure
Github stargazers
633
Github forks
135
Commits
303
Code contributors Contributors
10
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
Created
March 16, 2017
Updated
March 25, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Scala
Issues
1
DaPy
Github stargazers
584
Github forks
47
Commits
382
Code contributors Contributors
1
Easy-to-use data analysis / manipulation framework for humans
Created
March 8, 2018
Updated
Feb. 15, 2020
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python