textract - node

  •    HTML

A text extraction node module. In almost all cases above, what textract cares about is the mime type. So .html and .htm, both possessing the same mime type, will be extracted. Other extensions that share mime types with those above should also extract successfully. For example, application/vnd.ms-excel is the mime type for .xls, but also for 5 other file types.

code4goal-resume-parser - Solution for Code4Goal challenge

  •    Javascript

Library currently is not actively maintained, but I still read all the issues and try to give directions of their solving. I believe during that year I will find time to fix existing issues and make it more library-like rather then application for fun (like it is for now).

doc2audiobook - Convert text documents to high fidelity audio(books).

  •    Python

Extract text from a document (textract) and convert it into a natural sounding synthesised speech (Cloud Text-To-Speech), which is able to leverage Deepminds Wavenet models. Convert a document to an audiobook using the en-GB-Standard-C voice.