Alternative Big Data libraries for Python
python-bigdata
Github stargazers
132
Github forks
166
Commits
31
Code contributors Contributors
2
Data science and Big Data with Python
Created
July 14, 2016
Updated
Aug. 27, 2023
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
kubernetes-bigquery-python
Read-only repository, archived by owner Archived
Github stargazers
130
Github forks
88
Commits
41
Code contributors Contributors
7
Example Kubernetes app that shows how to build a 'pipeline' to stream data into BigQuery. Uses Redis or Google Cloud PubSub
Created
Dec. 17, 2014
Updated
Oct. 20, 2020
License
apache-2.0
Github repo
Type
App
Primary Language, based on Github DataLanguage
Python
Issues
5
Python-big-data
Github stargazers
126
Github forks
67
Commits
2826
Code contributors Contributors
110
Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.
Created
Dec. 28, 2022
Updated
Feb. 20, 2024
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Frank-Kanes-Taming-Big-Data-with-Apache-Spark-and-Python
Github stargazers
118
Github forks
199
Commits
14
Code contributors Contributors
5
Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt
Created
June 30, 2017
Updated
Jan. 30, 2023
License
mit
Github repo
Type
Resource
Primary Language, based on Github DataLanguage
Python
Issues
1
bigflow
Github stargazers
117
Github forks
22
Commits
981
Code contributors Contributors
20
A Python framework for data processing on GCP.
Created
July 25, 2019
Updated
July 30, 2024
License
other
Github repo
Type
Cli
Primary Language, based on Github DataLanguage
Python
Issues
47
Big-Data-Engineering-Coursera-Yandex
Github stargazers
102
Github forks
75
Commits
33
Code contributors Contributors
1
Big Data for Data Engineers Coursera Specialization from Yandex
Created
March 29, 2018
Updated
March 15, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
4
Homepage
spark-and-python-for-big-data-with-pyspark
Github stargazers
97
Github forks
121
Commits
2
Code contributors Contributors
1
Course on Udemy by Jose Portilla
Created
Jan. 17, 2018
Updated
Jan. 17, 2018
Github repo
Primary Language, based on Github DataLanguage
Jupyter
A-Deep-Learning-Based-Illegal-Insider-Trading-Detection-and-Prediction-Technique-in-Stock-Market
Github stargazers
86
Github forks
21
Commits
4
Code contributors Contributors
1
Illegal insider trading of stocks is based on releasing non-public information (e.g., new product launch, quarterly financial report, acquisition or merger plan) before the information is made public. Detecting illegal insider trading is difficult due to the complex, nonlinear, and non-stationary nature of the stock market. In this work, we present an approa
Created
Nov. 27, 2017
Updated
Jan. 8, 2019
Github repo
Primary Language, based on Github DataLanguage
Python
Homepage
python-bigquery-datatransfer
Read-only repository, archived by owner Archived
Github stargazers
85
Github forks
29
Commits
427
Code contributors Contributors
36
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-datatransfer
Created
Dec. 10, 2019
Updated
Sept. 29, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Coursera-Bioinformatics
Github stargazers
82
Github forks
46
Commits
6
Code contributors Contributors
1
My solution to Bioinformatics Specialization (Finding Hidden Messages in DNA; Genome Sequencing; Comparing Genes, Proteins, and Genomes; Molecular Evolution; Genomic Data Science and Clustering; Finding Mutations in DNA and Proteins; Bioinformatics Capstone: Big Data in Biology)
Created
April 5, 2018
Updated
Nov. 1, 2018
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
BigDataPython
Github stargazers
77
Github forks
127
Commits
6
Code contributors Contributors
3
Material de apoyo del libro BIG DATA CON PYTHON. Recolección, almacenamiento y procesamiento de datos, de Enrique Martín Martín, Adrián Riesco y Rafael Caballero, editado por RC libros
Created
July 3, 2018
Updated
March 2, 2024
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
Homepage
pypar
Github stargazers
69
Github forks
15
Commits
198
Code contributors Contributors
4
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Created
May 21, 2013
Updated
Nov. 11, 2016
License
gpl-3.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
5
Spark-and-Kafka_IoT-Data-Processing-and-Analytics
Github stargazers
65
Github forks
26
Commits
3
Code contributors Contributors
1
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Created
Nov. 21, 2016
Updated
Nov. 21, 2016
Github repo
Primary Language, based on Github DataLanguage
Python
xcast
Github stargazers
65
Github forks
5
Commits
196
Code contributors Contributors
3
A High-Performance Data Science Toolkit for the Earth Sciences
Created
July 15, 2021
Updated
June 8, 2024
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
Location-based-Restaurants-Recommendation-System
Github stargazers
63
Github forks
24
Commits
9
Code contributors Contributors
1
Big Data Management and Analysis Final Project
Created
July 21, 2017
Updated
March 21, 2018
Github repo
Primary Language, based on Github DataLanguage
Python
Data-Visualizations
Github stargazers
62
Github forks
29
Commits
10
Code contributors Contributors
1
Data Visualizations is emerging as one of the most essential skills in almost all of the IT and Non IT Background Sectors and Jobs. Using Data Visualizations to make wiser decisions which could land the Business to make bigger profits and understand the root cause and behavioral analysis of people and customers associated to it. In this Repository I have dee
Created
April 9, 2020
Updated
April 9, 2020
License
gpl-3.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
Python-Basic-programs
Github stargazers
60
Github forks
15
Commits
217
Code contributors Contributors
1
What is Python? Executive Summary Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid Application Development, as well as for use as a scripting or glue language to connect existing compon
Created
Feb. 11, 2021
Updated
March 2, 2021
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
2
torrents
Github stargazers
53
Github forks
0
Commits
None
Code contributors Contributors
1
Skip to content Search… All gists Back to GitHub Sign in Sign up Instantly share code, notes, and snippets. @giansalex giansalex/torrent-courses-download-list.md forked from M-Younus/torrent courses download-list Last active 2 days ago 15188 Code Revisions 15 Stars 151 Forks 88 <script src="https://gist.github.com/giansalex/4cd3631e94433bbbd71bf07aedb33a
Created
Feb. 23, 2020
Updated
Feb. 23, 2020
Github repo
Issues
5
pykylin
Github stargazers
51
Github forks
76
Commits
2
Code contributors Contributors
1
Python DBAPI Driver and Sqlalchemy Dialect for Apache Kylin, the "Extreme OLAP Engine for Big Data"
Created
Nov. 16, 2015
Updated
Nov. 16, 2015
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
11
big-data
Github stargazers
51
Github forks
30
Commits
116
Code contributors Contributors
1
Python tools for big data
Created
Sept. 9, 2019
Updated
Oct. 9, 2023
Github repo
Primary Language, based on Github DataLanguage
Jupyter