Alternative File utilities and packages for Python
Updated :
May 31, 2023
yapf
Github stargazers
13248
Github forks
900
Commits
1264
Code contributors Contributors
138
A formatter for Python files
Created
March 18, 2015
Updated
May 27, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
391
airbyte
Github stargazers
10746
Github forks
2
Commits
11288
Code contributors Contributors
736
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
Created
July 27, 2020
Updated
May 31, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
4644
Homepage
q
Github stargazers
9870
Github forks
408
Commits
404
Code contributors Contributors
24
q - Run SQL directly on delimited files and multi-file sqlite databases
Created
Jan. 30, 2012
Updated
Aug. 6, 2022
License
gpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
102
OCRmyPDF
Github stargazers
8975
Github forks
714
Commits
3414
Code contributors Contributors
74
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Created
Dec. 20, 2013
Updated
May 23, 2023
License
mpl-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
108
boltons
Github stargazers
6206
Github forks
342
Commits
1519
Code contributors Contributors
76
🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Created
Feb. 20, 2013
Updated
May 6, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
65
python-dotenv
Github stargazers
6047
Github forks
366
Commits
354
Code contributors Contributors
81
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
Created
Sept. 6, 2014
Updated
April 17, 2023
License
bsd-3-clause
Github repo
Type
Cli
Primary Language, based on Github DataLanguage
Python
Issues
34
pyWhat
Github stargazers
5966
Github forks
321
Commits
643
Code contributors Contributors
35
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️
Created
March 19, 2021
Updated
May 16, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
24
pypdf
Github stargazers
5718
Github forks
1
Commits
1245
Code contributors Contributors
179
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Created
Jan. 6, 2012
Updated
May 21, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
73
onionshare
Github stargazers
5617
Github forks
627
Commits
4795
Code contributors Contributors
214
Securely and anonymously share files, host websites, and chat with friends using the Tor network
Created
May 20, 2014
Updated
May 26, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
100
XlsxWriter
Github stargazers
3257
Github forks
605
Commits
1328
Code contributors Contributors
41
A Python module for creating Excel XLSX files.
Created
Jan. 4, 2013
Updated
May 29, 2023
License
bsd-2-clause
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
15
picard
Github stargazers
3181
Github forks
370
Commits
9505
Code contributors Contributors
97
MusicBrainz Picard audio file tagger
Created
Aug. 16, 2011
Updated
May 31, 2023
License
gpl-2.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
gdown
Github stargazers
3059
Github forks
283
Commits
441
Code contributors Contributors
17
Download a large file from Google Drive (curl/wget fails because of the security notice).
Created
Oct. 17, 2015
Updated
April 22, 2023
License
mit
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
9
borb
Github stargazers
3021
Github forks
121
Commits
78
Code contributors Contributors
2
borb is a library for reading, creating and manipulating PDF files in python.
Created
Nov. 7, 2020
Updated
May 27, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
10
Homepage
smart_open
Github stargazers
2876
Github forks
354
Commits
1011
Code contributors Contributors
91
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Created
Jan. 2, 2015
Updated
Jan. 19, 2023
License
mit
Github repo
Type
Tool/utility
Primary Language, based on Github DataLanguage
Python
Issues
82
cleanrl
Github stargazers
2742
Github forks
361
Commits
805
Code contributors Contributors
35
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Created
June 7, 2019
Updated
May 20, 2023
License
other
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
61
pex
Github stargazers
2327
Github forks
241
Commits
1333
Code contributors Contributors
99
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
Created
July 21, 2014
Updated
May 13, 2023
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
156
whitenoise
Github stargazers
2236
Github forks
139
Commits
726
Code contributors Contributors
60
Radically simplified static file serving for Python web apps
Created
Aug. 8, 2013
Updated
May 20, 2023
License
mit
Github repo
Type
App
Primary Language, based on Github DataLanguage
Python
Issues
32
graphtage
Github stargazers
2212
Github forks
43
Commits
519
Code contributors Contributors
10
A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV.
Created
April 21, 2020
Updated
Feb. 17, 2023
License
lgpl-3.0
Github repo
Primary Language, based on Github DataLanguage
Python
Issues
23
PyLaTeX
Github stargazers
2080
Github forks
282
Commits
755
Code contributors Contributors
54
A Python library for creating LaTeX files
Created
Jan. 15, 2014
Updated
July 10, 2021
License
mit
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
115
pdftabextract
Github stargazers
2063
Github forks
357
Commits
171
Code contributors Contributors
3
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Created
July 8, 2016
Updated
June 24, 2022
License
apache-2.0
Github repo
Type
Module/library
Primary Language, based on Github DataLanguage
Python
Issues
4