Alternative Big Data libraries for Python
Updated :
March 25, 2024
architect_big_data_solutions_with_spark
Github stargazers
43
Github forks
36
Commits
48
Code contributors Contributors
4
code, labs and lectures for the course
Created
April 30, 2018
Updated
April 16, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
architect_big_data_solutions_with_spark
Github stargazers
43
Github forks
36
Commits
48
Code contributors Contributors
4
code, labs and lectures for the course
Created
April 30, 2018
Updated
April 16, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
big-data-exploration
Read-only repository, archived by owner Archived
Github stargazers
43
Github forks
30
Commits
77
Code contributors Contributors
5
[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Created
May 28, 2013
Updated
Dec. 3, 2018
Github repo
Primary Language, based on Github DataLanguage
JavaScript
omniture-data-tools
Github stargazers
41
Github forks
32
Commits
41
Code contributors Contributors
5
A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.
Created
Oct. 8, 2011
Updated
May 14, 2019
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Java
Issues
2
torrents
Github stargazers
40
Github forks
0
Commits
None
Code contributors Contributors
1
Skip to content Searchโ€ฆ All gists Back to GitHub Sign in Sign up Instantly share code, notes, and snippets. @giansalex giansalex/torrent-courses-download-list.md forked from M-Younus/torrent courses download-list Last active 2 days ago 15188 Code Revisions 15 Stars 151 Forks 88 <script src="https://gist.github.com/giansalex/4cd3631e94433bbbd71bf07aedb33a
Created
Feb. 23, 2020
Updated
Feb. 23, 2020
Github repo
Issues
5
BigDataRiver
Github stargazers
38
Github forks
18
Commits
20
Code contributors Contributors
1
Simple demo implementation of Lambda and Kappa architectures using Python, Docker, Kafka, Spark and Cassandra
Created
Aug. 30, 2017
Updated
March 15, 2018
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Homepage
trough
Github stargazers
36
Github forks
7
Commits
696
Code contributors Contributors
6
Trough: Big data, small databases.
Created
Dec. 16, 2016
Updated
Aug. 30, 2022
License
bsd-2-clause
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
google-analytics-big-query-importer
Github stargazers
33
Github forks
11
Commits
13
Code contributors Contributors
2
A Python script that extracts data from Google Analytics and imports it into a Google Big Query table.
Created
Oct. 8, 2019
Updated
Oct. 16, 2019
License
apache-2.0
Github repo
Type
Script
Primary Language, based on Github DataLanguage
Python
Issues
2
ie-mbd-advanced-python
Read-only repository, archived by owner Archived
Github stargazers
31
Github forks
70
Commits
185
Code contributors Contributors
37
"Advanced Python" subject from the Master in Big Data @ IE
Created
March 25, 2019
Updated
Nov. 26, 2021
License
other
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
2
python-bigquery-connection
Read-only repository, archived by owner Archived
Github stargazers
30
Github forks
15
Commits
253
Code contributors Contributors
19
This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-connection
Created
May 19, 2020
Updated
Sept. 29, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Image-Classifier
Github stargazers
29
Github forks
17
Commits
25
Code contributors Contributors
1
Image Classifier Going forward, AI algorithms will be incorporated into more and more everyday applications. For example, you might want to include an image classifier in a smartphone app. To do this, you'd use a deep learning model trained on hundreds of thousands of images as part of the overall application architecture. A large part of software developme
Created
Jan. 24, 2019
Updated
Feb. 1, 2019
License
mit
Github repo
Type
App
Primary Language, based on Github DataLanguage
Jupyter
Issues
3
big-data-madison-dagster
Github stargazers
29
Github forks
7
Commits
15
Code contributors Contributors
1
--
Created
Dec. 27, 2021
Updated
March 6, 2022
Github repo
Primary Language, based on Github DataLanguage
Python
Big-Data-Analysis-with-Python
Github stargazers
27
Github forks
41
Commits
69
Code contributors Contributors
2
Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning
Created
Nov. 6, 2018
Updated
May 10, 2019
License
mit
Github repo
Type
Resource
Primary Language, based on Github DataLanguage
Jupyter
Spark_Python_Do_Big_Data_Analytics
Github stargazers
26
Github forks
58
Commits
9
Code contributors Contributors
1
Course materials in Udemy Apache Spark 2.0 + Python: Do Big Data Analytics & ML
Created
March 9, 2017
Updated
March 14, 2017
Github repo
Primary Language, based on Github DataLanguage
Python
django-bigbuild
Read-only repository, archived by owner Archived
Github stargazers
26
Github forks
0
Commits
308
Code contributors Contributors
1
The open-source engine that powers bigbuilder, the Los Angeles Times Data Desk's system for publishing standalone pages
Created
July 18, 2016
Updated
May 31, 2019
License
mit
Github repo
Primary Language, based on Github DataLanguage
JavaScript
Issues
14
Homepage
Flight_delay_prediction_web_app
Github stargazers
25
Github forks
10
Commits
4
Code contributors Contributors
1
A big data web application to predict USA airline traffic delay with Python, Flask, Apache Spark, Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn, MLlib and Apache Airflow.
Created
May 17, 2018
Updated
April 1, 2022
License
mit
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1
LibraryBigData
Github stargazers
24
Github forks
6
Commits
26
Code contributors Contributors
1
Pythonๅ’ŒR่ฏญ่จ€ๅบ”็”จๆกˆไพ‹๏ผŒๆไพ›1ๅนด็š„ๅ›พไนฆ้ฆ†ๅ€Ÿ้˜…ๆ•ฐๆฎ๏ผŒๅนถ่ฟ›่กŒๅคงๆ•ฐๆฎๅˆ†ๆžใ€‚
Created
April 25, 2018
Updated
June 24, 2018
Github repo
Primary Language, based on Github DataLanguage
R
Introduction-to-Python-Programming
Github stargazers
24
Github forks
9
Commits
23
Code contributors Contributors
1
Lectures in Big Data Institute, Seoul National University
Created
Sept. 10, 2017
Updated
Sept. 25, 2017
Github repo
Primary Language, based on Github DataLanguage
Jupyter
rastercube
Github stargazers
19
Github forks
6
Commits
91
Code contributors Contributors
1
rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Created
June 11, 2017
Updated
July 27, 2017
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Learning-Scipy
Github stargazers
19
Github forks
15
Commits
25
Code contributors Contributors
1
This repository contains source code programs and some notes to complement the book about the scientific Python module SciPy entitle [Learning SciPy for Numerical and Scientific Computing - Second Edition (2015)](https://www.packtpub.com/big-data-and-business-intelligence/learning-scipy-numerical-and-scientific-computing-second-edition)
Created
June 28, 2015
Updated
Nov. 2, 2017
Github repo
Primary Language, based on Github DataLanguage
Jupyter
Issues
1