Alternative Big Data libraries for Python
bdbag
Github stargazers
49
Github forks
23
Commits
373
Code contributors Contributors
13
Big Data Bag Utilities
Created
March 28, 2016
Updated
June 28, 2024
License
apache-2.0
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
6
architect_big_data_solutions_with_spark
Github stargazers
44
Github forks
37
Commits
48
Code contributors Contributors
4
code, labs and lectures for the course
Created
April 30, 2018
Updated
April 16, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
architect_big_data_solutions_with_spark
Github stargazers
44
Github forks
37
Commits
48
Code contributors Contributors
4
code, labs and lectures for the course
Created
April 30, 2018
Updated
April 16, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
big-data-exploration
Read-only repository, archived by owner Archived
Github stargazers
43
Github forks
27
Commits
77
Code contributors Contributors
5
[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Created
May 28, 2013
Updated
Dec. 3, 2018
Github repo
Primary Language, based on Github DataLanguage
JavaScript
omniture-data-tools
Github stargazers
41
Github forks
32
Commits
41
Code contributors Contributors
5
A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.
Created
Oct. 8, 2011
Updated
May 14, 2019
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Java
Issues
2
BigDataRiver
Github stargazers
38
Github forks
18
Commits
20
Code contributors Contributors
1
Simple demo implementation of Lambda and Kappa architectures using Python, Docker, Kafka, Spark and Cassandra
Created
Aug. 30, 2017
Updated
March 15, 2018
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Homepage
trough
Github stargazers
38
Github forks
7
Commits
696
Code contributors Contributors
7
Trough: Big data, small databases.
Created
Dec. 16, 2016
Updated
July 25, 2024
License
bsd-2-clause
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
google-analytics-big-query-importer
Github stargazers
36
Github forks
11
Commits
13
Code contributors Contributors
2
A Python script that extracts data from Google Analytics and imports it into a Google Big Query table.
Created
Oct. 8, 2019
Updated
Oct. 16, 2019
License
apache-2.0
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
2
ie-mbd-advanced-python
Read-only repository, archived by owner Archived
Github stargazers
31
Github forks
70
Commits
185
Code contributors Contributors
37
"Advanced Python" subject from the Master in Big Data @ IE
Created
March 25, 2019
Updated
Nov. 26, 2021
License
other
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
2
Image-Classifier
Github stargazers
30
Github forks
17
Commits
25
Code contributors Contributors
1
Image Classifier Going forward, AI algorithms will be incorporated into more and more everyday applications. For example, you might want to include an image classifier in a smartphone app. To do this, you'd use a deep learning model trained on hundreds of thousands of images as part of the overall application architecture. A large part of software developme
Created
Jan. 24, 2019
Updated
Feb. 1, 2019
License
mit
Github repo
Type
App
Primary Language, based on Github DataLanguage
Jupyter
Issues
4
python-bigquery-connection
Read-only repository, archived by owner Archived
Github stargazers
30
Github forks
15
Commits
253
Code contributors Contributors
19
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-connection
Created
May 19, 2020
Updated
Sept. 29, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
big-data-madison-dagster
Github stargazers
30
Github forks
7
Commits
15
Code contributors Contributors
1
--
Created
Dec. 27, 2021
Updated
March 6, 2022
Github repo
Primary Language, based on Github DataLanguage
Python
LibraryBigData
Github stargazers
29
Github forks
6
Commits
26
Code contributors Contributors
1
Pythonๅ’ŒR่ฏญ่จ€ๅบ”็”จๆกˆไพ‹๏ผŒๆไพ›1ๅนด็š„ๅ›พไนฆ้ฆ†ๅ€Ÿ้˜…ๆ•ฐๆฎ๏ผŒๅนถ่ฟ›่กŒๅคงๆ•ฐๆฎๅˆ†ๆžใ€‚
Created
April 25, 2018
Updated
June 24, 2018
Github repo
Primary Language, based on Github DataLanguage
R
Big-Data-Analysis-with-Python
Github stargazers
29
Github forks
42
Commits
69
Code contributors Contributors
2
Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning
Created
Nov. 6, 2018
Updated
May 10, 2019
License
mit
Github repo
Type
Resource
Primary Language, based on Github DataLanguage
Jupyter
Flight_delay_prediction_web_app
Github stargazers
28
Github forks
10
Commits
4
Code contributors Contributors
1
A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn, MLlib and Apache Airflow.
Created
May 17, 2018
Updated
April 1, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
Spark_Python_Do_Big_Data_Analytics
Github stargazers
26
Github forks
59
Commits
9
Code contributors Contributors
1
Course materials in Udemy Apache Spark 2.0 + Python: Do Big Data Analytics & ML
Created
March 9, 2017
Updated
March 14, 2017
Github repo
Primary Language, based on Github DataLanguage
Python
django-bigbuild
Read-only repository, archived by owner Archived
Github stargazers
26
Github forks
0
Commits
308
Code contributors Contributors
1
The open-source engine that powers bigbuilder, the Los Angeles Times Data Desk's system for publishing standalone pages
Created
July 18, 2016
Updated
May 31, 2019
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
14
Homepage
Introduction-to-Python-Programming
Github stargazers
24
Github forks
9
Commits
23
Code contributors Contributors
1
Lectures in Big Data Institute, Seoul National University
Created
Sept. 10, 2017
Updated
Sept. 25, 2017
Github repo
Primary Language, based on Github DataLanguage
Jupyter
QuakeLabeler
Github stargazers
22
Github forks
4
Commits
73
Code contributors Contributors
2
QuakeLabeler is a Python package to create and manage your seismic training data, processes, and visualization in a single place โ€” so you can focus on building the next big thing.
Created
May 28, 2021
Updated
Oct. 5, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
2
rastercube
Github stargazers
19
Github forks
4
Commits
91
Code contributors Contributors
1
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Created
June 11, 2017
Updated
July 27, 2017
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python