Alternative Data analysis libraries for Python
SQL-Data-Analysis-and-Visualization-Projects
Github stargazers
1282
Github forks
516
Commits
119
Code contributors Contributors
1
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
Created
Feb. 29, 2020
Updated
April 14, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
prince
Github stargazers
1263
Github forks
183
Commits
394
Code contributors Contributors
15
:crown: Multivariate exploratory data analysis in Python — PCA, CA, MCA, MFA, FAMD, GPA
Created
Oct. 22, 2016
Updated
Sept. 7, 2024
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
4
data-structures-algorithms-python
Github stargazers
1221
Github forks
1506
Commits
86
Code contributors Contributors
6
This tutorial playlist covers data structures and algorithms in python. Every tutorial has theory behind data structure or an algorithm, BIG O Complexity analysis and exercises that you can practice on.
Created
Sept. 29, 2020
Updated
Nov. 14, 2022
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
61
agate
Github stargazers
1172
Github forks
154
Commits
1524
Code contributors Contributors
48
A Python data analysis library that is optimized for humans instead of machines.
Created
April 25, 2014
Updated
July 30, 2024
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
4
veles
Read-only repository, archived by owner Archived
Github stargazers
1149
Github forks
118
Commits
637
Code contributors Contributors
14
Binary data analysis and visualization tool
Created
Jan. 12, 2017
Updated
May 18, 2018
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
C++
Issues
56
Homepage
nfstream
Github stargazers
1082
Github forks
120
Commits
1841
Code contributors Contributors
12
NFStream: a Flexible Network Data Analysis Framework.
Created
Oct. 18, 2019
Updated
May 10, 2024
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
25
Homepage
datacleaner
Github stargazers
1054
Github forks
204
Commits
49
Code contributors Contributors
4
A Python tool that automatically cleans data sets and readies them for analysis.
Created
Feb. 27, 2016
Updated
Jan. 18, 2017
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
12
Deep-Learning-For-Hackers
Github stargazers
1024
Github forks
441
Commits
45
Code contributors Contributors
1
Machine Learning tutorials with TensorFlow 2 and Keras in Python (Jupyter notebooks included) - (LSTMs, Hyperameter tuning, Data preprocessing, Bias-variance tradeoff, Anomaly Detection, Autoencoders, Time Series Forecasting, Object Detection, Sentiment Analysis, Intent Recognition with BERT)
Created
April 24, 2019
Updated
April 23, 2020
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Jupyter
Issues
6
Homepage
hail
Github stargazers
978
Github forks
246
Commits
11258
Code contributors Contributors
82
Cloud-native genomic dataframes and batch computing
Created
Oct. 27, 2015
Updated
Sept. 26, 2024
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
254
Homepage
Doing_bayesian_data_analysis
Github stargazers
894
Github forks
285
Commits
128
Code contributors Contributors
7
Python/PyMC3 versions of the programs described in Doing bayesian data analysis by John K. Kruschke
Created
July 4, 2014
Updated
July 16, 2021
Github repo
Type
Resource
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
kaggle-titanic
Github stargazers
871
Github forks
676
Commits
143
Code contributors Contributors
8
A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.
Created
May 1, 2013
Updated
Dec. 18, 2017
License
apache-2.0
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
python-for-data-analysis
Github stargazers
845
Github forks
330
Commits
220
Code contributors Contributors
1
An introduction to data science using Python and Pandas with Jupyter notebooks
Created
Nov. 23, 2016
Updated
Oct. 8, 2019
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Jupyter
Issues
3
retentioneering-tools
Github stargazers
798
Github forks
122
Commits
6
Code contributors Contributors
21
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.
Created
July 2, 2019
Updated
Dec. 1, 2023
License
other
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
2
deeptime
Github stargazers
754
Github forks
84
Commits
1249
Code contributors Contributors
17
Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation
Created
March 27, 2018
Updated
July 16, 2024
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
15
Hands-on-Exploratory-Data-Analysis-with-Python
Github stargazers
718
Github forks
328
Commits
58
Code contributors Contributors
5
Hands-on Exploratory Data Analysis with Python, published by Packt
Created
Oct. 7, 2019
Updated
Jan. 30, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
DataAnalysisInAction
Github stargazers
700
Github forks
275
Commits
127
Code contributors Contributors
2
(Finished) Geek Time Data Analysis Practical 45 Lecture - Detailed notes containing markdown images mind map code data can be read directly code test
Created
Dec. 19, 2018
Updated
Jan. 21, 2024
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
sparkMeasure
Github stargazers
704
Github forks
145
Commits
303
Code contributors Contributors
12
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
Created
March 16, 2017
Updated
Aug. 13, 2024
License
apache-2.0
Github repo
Primary Language, based on Github DataLanguage
Scala
Issues
2
DBDA-python
Github stargazers
673
Github forks
264
Commits
210
Code contributors Contributors
1
Doing Bayesian Data Analysis, 2nd Edition (Kruschke, 2015): Python/PyMC3 code
Created
July 13, 2016
Updated
Aug. 13, 2021
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
3
Dora
Github stargazers
644
Github forks
73
Commits
38
Code contributors Contributors
2
Tools for exploratory data analysis in Python
Created
Feb. 16, 2016
Updated
Jan. 18, 2024
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
pyRiemann
Github stargazers
635
Github forks
164
Commits
557
Code contributors Contributors
35
Machine learning for multivariate data through the Riemannian geometry of positive definite matrices in Python
Created
April 19, 2015
Updated
Sept. 27, 2024
License
bsd-3-clause
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
5