Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Created
June 15, 2018
Updated
Dec. 2, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
176
auto_ml
Github stargazers
1637
Github forks
314
Commits
1149
Code contributors Contributors
13
[UNMAINTAINED] Automated machine learning for analytics & production
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.
Created
July 21, 2017
Updated
Oct. 20, 2023
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
27
sematic
Github stargazers
941
Github forks
55
Commits
1066
Code contributors Contributors
19
An open-source ML pipeline development platform
Created
April 19, 2022
Updated
April 15, 2024
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
136
depthai
Github stargazers
864
Github forks
218
Commits
3150
Code contributors Contributors
45
DepthAI Python API utilities, examples, and tutorials.