newspaper by codelucas

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

created at Nov. 25, 2013, 9:50 a.m.

Python

386 +0

13,771 +28

2,092 +3

GitHub
micawber by coleifer

a small library for extracting rich content from urls

created at March 27, 2012, 9:42 p.m.

Python

17 +0

621 -1

90 +0

GitHub
lassie by michaelhelmick

Web Content Retrieval for Humans™

created at July 30, 2013, 8:41 p.m.

HTML

22 +0

600 +0

49 +0

GitHub
html2text by Alir3z4

Convert HTML to Markdown-formatted text.

created at Feb. 19, 2014, 10:41 p.m.

Python

26 +0

1,686 +16

264 +3

GitHub
MechanicalSoup by MechanicalSoup

A Python library for automating interaction with websites.

created at May 26, 2014, 9:06 a.m.

Python

108 +0

4,555 -1

375 +0

GitHub
scikit-video by aizvorski

Video processing routines for SciPy

created at May 29, 2014, 11:04 a.m.

Python

9 +0

136 +0

22 +0

GitHub
pyshorteners by ellisonleao

electric plug Generating short urls with python has never been easier

created at Sept. 30, 2013, 6:33 p.m.

Python

16 +0

377 -1

65 +0

GitHub
purl by codeinthehole

A simple, immutable URL class with a clean API for interrogation and manipulation.

created at March 27, 2012, 9:18 p.m.

Python

12 +0

294 -1

39 +0

GitHub
furl by gruns

🌐 URL parsing and manipulation made easy.

created at Nov. 17, 2011, 1:08 a.m.

Python

37 +0

2,578 +3

151 +0

GitHub
twython by ryanmcgrath

Actively maintained, pure Python wrapper for the Twitter API. Supports both normal and streaming Twitter APIs.

created at April 2, 2009, 5:01 a.m.

Python

102 +0

1,848 +0

399 +0

GitHub
gspread by burnash

Google Sheets Python API

created at Dec. 2, 2011, 10:46 a.m.

Python

158 +0

6,891 +9

928 +2

GitHub
facebook-sdk by mobolic

Python SDK for Facebook's Graph API

created at May 11, 2011, 1:38 p.m.

Python

200 +0

2,723 +1

954 +0

GitHub
sqlparse by andialbrecht

A non-validating SQL parser module for Python

created at April 18, 2012, 7:33 p.m.

Python

94 +0

3,594 +4

671 +2

GitHub
python-user-agents by selwin

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

created at Jan. 5, 2013, 12:44 a.m.

Python

38 +0

1,418 +2

196 +0

GitHub
python-nameparser by derek73

A simple Python module for parsing human names into their individual components

created at April 2, 2014, 3:31 a.m.

Python

26 +0

637 +0

134 +0

GitHub
unicode-slugify by mozilla

A slugifier that works in unicode

created at March 24, 2011, 11:08 p.m.

Python

9 +0

320 +0

51 +0

GitHub
python-slugify by un33k

Returns unicode slugs

created at Oct. 15, 2012, 1:44 a.m.

Python

35 +0

1,453 +2

106 +0

GitHub
shortuuid by skorokithakis

A generator library for concise, unambiguous and URL-safe UUIDs.

created at Jan. 8, 2011, 1 p.m.

Python

29 +0

2,003 +5

107 +0

GitHub
python-pinyin by mozillazg

汉字转拼音(pypinyin)

created at Sept. 14, 2013, 2:01 p.m.

Python

99 +0

4,696 +8

605 +0

GitHub
s3cmd by s3tools

Official s3cmd repo -- Command line tool for managing S3 compatible storage services (including Amazon S3 and CloudFront).

created at June 7, 2011, 2:08 a.m.

Python

108 +0

4,448 +10

902 +2

GitHub