newspaper by codelucas

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

created at Nov. 25, 2013, 9:50 a.m.

Python

386 +0

13,828 +17

2,098 +4

GitHub
micawber by coleifer

a small library for extracting rich content from urls

created at March 27, 2012, 9:42 p.m.

Python

17 +0

624 +0

90 +0

GitHub
lassie by michaelhelmick

Web Content Retrieval for Humans™

created at July 30, 2013, 8:41 p.m.

HTML

22 +0

603 +1

49 +0

GitHub
html2text by Alir3z4

Convert HTML to Markdown-formatted text.

created at Feb. 19, 2014, 10:41 p.m.

Python

26 +0

1,718 +9

265 +0

GitHub
MechanicalSoup by MechanicalSoup

A Python library for automating interaction with websites.

created at May 26, 2014, 9:06 a.m.

Python

108 +0

4,581 +6

376 +1

GitHub
scikit-video by aizvorski

Video processing routines for SciPy

created at May 29, 2014, 11:04 a.m.

Python

9 +0

136 +0

22 +0

GitHub
pyshorteners by ellisonleao

electric plug Generating short urls with python has never been easier

created at Sept. 30, 2013, 6:33 p.m.

Python

16 +0

379 +1

65 +2

GitHub
purl by codeinthehole

A simple, immutable URL class with a clean API for interrogation and manipulation.

created at March 27, 2012, 9:18 p.m.

Python

12 +0

295 +0

39 +0

GitHub
furl by gruns

🌐 URL parsing and manipulation made easy.

created at Nov. 17, 2011, 1:08 a.m.

Python

37 +0

2,592 +5

151 +0

GitHub
twython by ryanmcgrath

Actively maintained, pure Python wrapper for the Twitter API. Supports both normal and streaming Twitter APIs.

created at April 2, 2009, 5:01 a.m.

Python

102 +0

1,847 -1

399 +0

GitHub
gspread by burnash

Google Sheets Python API

created at Dec. 2, 2011, 10:46 a.m.

Python

159 +1

6,929 +10

933 +3

GitHub
facebook-sdk by mobolic

Python SDK for Facebook's Graph API

created at May 11, 2011, 1:38 p.m.

Python

200 +0

2,722 +1

954 +0

GitHub
sqlparse by andialbrecht

A non-validating SQL parser module for Python

created at April 18, 2012, 7:33 p.m.

Python

95 +0

3,612 +5

679 +2

GitHub
python-user-agents by selwin

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

created at Jan. 5, 2013, 12:44 a.m.

Python

38 +0

1,419 +0

196 +0

GitHub
python-nameparser by derek73

A simple Python module for parsing human names into their individual components

created at April 2, 2014, 3:31 a.m.

Python

26 +0

639 +0

104 +0

GitHub
unicode-slugify by mozilla

A slugifier that works in unicode

created at March 24, 2011, 11:08 p.m.

Python

9 +0

320 +0

51 +0

GitHub
python-slugify by un33k

Returns unicode slugs

created at Oct. 15, 2012, 1:44 a.m.

Python

35 +0

1,464 +1

107 +1

GitHub
shortuuid by skorokithakis

A generator library for concise, unambiguous and URL-safe UUIDs.

created at Jan. 8, 2011, 1 p.m.

Python

29 +0

2,010 +3

108 +0

GitHub
python-pinyin by mozillazg

汉字转拼音(pypinyin)

created at Sept. 14, 2013, 2:01 p.m.

Python

99 +0

4,713 +3

604 +0

GitHub
s3cmd by s3tools

Official s3cmd repo -- Command line tool for managing S3 compatible storage services (including Amazon S3 and CloudFront).

created at June 7, 2011, 2:08 a.m.

Python

108 +0

4,470 +2

903 +1

GitHub