Sam Hames

Oh, this is what focus is like.

weblyzard/inscriptis: A python based HTML to text conversion library, command line client and Web service.

https://github.com/weblyzard/inscriptis

Extracts text from HTML pages, but does a great job preserving formatting for fairly complex things - this means you can get a nice plaintext rendition of a HTML table out of the box.

Tags

Details

Revised
Created
Edited