Displaying 1 to 2 from 2 results

textract - node

  •    HTML

A text extraction node module. In almost all cases above, what textract cares about is the mime type. So .html and .htm, both possessing the same mime type, will be extracted. Other extensions that share mime types with those above should also extract successfully. For example, application/vnd.ms-excel is the mime type for .xls, but also for 5 other file types.

tikaondotnet - Use the Java Tika text extraction library on the .NET platform

  •    CSharp

Take a look at our tests for more usage examples. Have an idea to make this project better? Great! Start out by taking a look at our Contributing Guide.