Python File utilities | 𝟐𝟎𝟐𝟎 | 𝐍𝐞𝐰𝐛𝐲𝐂𝐨𝐝𝐞𝐫.𝐜𝐨𝐦

Alternative File utilities and packages for Python

airbyte

Github stargazers

15999

Github forks

Commits

13958

Code contributors Contributors

987

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Created

July 27, 2020

Updated

Sept. 29, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

2082

Homepage

airbyte.com

yapf

Github stargazers

13768

Github forks

891

Commits

1336

Code contributors Contributors

146

A formatter for Python files

Created

March 18, 2015

Updated

April 1, 2024

License

apache-2.0

Github repo

Type

Module/library

Primary Language, based on Github DataLanguage

Python

Issues

399

OCRmyPDF

Github stargazers

13987

Github forks

Commits

3737

Code contributors Contributors

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Created

Dec. 20, 2013

Updated

Sept. 15, 2024

License

mpl-2.0

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

109

Homepage

ocrmypdf.readthedocs.io

Github stargazers

10198

Github forks

423

Commits

405

Code contributors Contributors

q - Run SQL directly on delimited files and multi-file sqlite databases

Created

Jan. 30, 2012

Updated

Dec. 21, 2023

License

gpl-3.0

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

118

Homepage

harelba.github.io

pypdf

Github stargazers

8255

Github forks

Commits

1538

Code contributors Contributors

241

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Created

Jan. 6, 2012

Updated

Sept. 28, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

pypdf.readthedocs.io

python-dotenv

Github stargazers

7617

Github forks

430

Commits

356

Code contributors Contributors

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.

Created

Sept. 6, 2014

Updated

July 23, 2024

License

bsd-3-clause

Github repo

Type

Cli

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

saurabh-kumar.com

pyWhat

Github stargazers

6580

Github forks

349

Commits

643

Code contributors Contributors

🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙‍♀️

Created

March 19, 2021

Updated

May 16, 2023

License

mit

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

boltons

Github stargazers

6527

Github forks

353

Commits

1534

Code contributors Contributors

🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

Created

Feb. 20, 2013

Updated

July 9, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

boltons.readthedocs.org

onionshare

Github stargazers

6278

Github forks

647

Commits

4963

Code contributors Contributors

235

Securely and anonymously share files, host websites, and chat with friends using the Tor network

Created

May 20, 2014

Updated

Sept. 26, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

134

Homepage

onionshare.org

cleanrl

Github stargazers

5568

Github forks

635

Commits

825

Code contributors Contributors

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Created

June 7, 2019

Updated

Sept. 24, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

docs.cleanrl.dev

fonttools

Github stargazers

4332

Github forks

457

Commits

11465

Code contributors Contributors

121

A library to manipulate font files from Python.

Created

July 24, 2013

Updated

Sept. 25, 2024

License

mit

Github repo

Type

Module/library

Primary Language, based on Github DataLanguage

Python

Issues

398

gdown

Github stargazers

4276

Github forks

350

Commits

441

Code contributors Contributors

Google Drive Public File Downloader when Curl/Wget Fails

Created

Oct. 17, 2015

Updated

May 12, 2024

License

mit

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

picard

Github stargazers

3762

Github forks

384

Commits

10232

Code contributors Contributors

115

MusicBrainz Picard audio file tagger

Created

Aug. 16, 2011

Updated

Sept. 14, 2024

License

gpl-2.0

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

picard.musicbrainz.org

ElegantRL

Github stargazers

3711

Github forks

847

Commits

2329

Code contributors Contributors

Massively Parallel Deep Reinforcement Learning. 🔥

Created

July 12, 2019

Updated

Sept. 27, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

139

Homepage

ai4finance.org

XlsxWriter

Github stargazers

3646

Github forks

633

Commits

1363

Code contributors Contributors

A Python module for creating Excel XLSX files.

Created

Jan. 4, 2013

Updated

Aug. 30, 2024

License

bsd-2-clause

Github repo

Type

Module/library

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

xlsxwriter.readthedocs.io

borb

Github stargazers

3388

Github forks

147

Commits

Code contributors Contributors

borb is a library for reading, creating and manipulating PDF files in python.

Created

Nov. 7, 2020

Updated

Aug. 26, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

borbpdf.com

smart_open

Github stargazers

3198

Github forks

384

Commits

1023

Code contributors Contributors

113

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Created

Jan. 2, 2015

Updated

Sept. 17, 2024

License

mit

Github repo

Type

Tool/utility

Primary Language, based on Github DataLanguage

Python

Issues

caj2pdf

Github stargazers

2953

Github forks

621

Commits

187

Code contributors Contributors

Convert CAJ (China Academic Journals) files to PDF. 转换中国知网 CAJ 格式文献为 PDF。佛系转换，成功与否，皆是玄学。

Created

Aug. 20, 2017

Updated

Jan. 9, 2024

License

other

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

datamodel-code-generator

Github stargazers

2727

Github forks

298

Commits

1034

Code contributors Contributors

141

Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.

Created

May 29, 2019

Updated

Sept. 27, 2024

License

mit

Github repo

Primary Language, based on Github DataLanguage

Python

Issues

221

Homepage

koxudaxi.github.io

pex

Github stargazers

2575

Github forks

258

Commits

1420

Code contributors Contributors

105

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Created

July 21, 2014

Updated

Sept. 29, 2024

License

apache-2.0

Github repo

Type

Module/library

Primary Language, based on Github DataLanguage

Python

Issues

Homepage

docs.pex-tool.org