mtail is a tool for extracting metrics from application logs to be exported into a timeseries database or timeseries calculator for alerting and dashboarding.It aims to fill a niche between applications that do not export their own internal state, and existing monitoring systems, without patching those applications or rewriting the same framework for custom extraction glue code.
monitoring logs observability prometheus collector proxy metrics extraction mtail calculator vm compiler bytecodeA text extraction node module. In almost all cases above, what textract cares about is the mime type. So .html and .htm, both possessing the same mime type, will be extracted. Other extensions that share mime types with those above should also extract successfully. For example, application/vnd.ms-excel is the mime type for .xls, but also for 5 other file types.
extract-text extraction nodejs textract extract html csv text pdf docx doc xls xlsx png jpg gif rtf dxf pptx markdown xml odt ott xlsb xlsm xltx ods ots potx odg otgThe simple, customizable, tiny javascript color extractor.
color colors images extract extraction gradientsaubio is a library to label music and sounds. It listens to audio signals and attempts to detect events. For instance, when a drum is hit, at which frequency is a note, or at what tempo is a rhythmic melody. Its features include segmenting a sound file before each of its attacks, performing pitch detection, tapping the beat and producing midi streams from live audio.
audio music analysis sound extraction annotation onset pitch beat tempo-tracking mfccMeyda is a Javascript audio feature extraction library. Meyda supports both offline feature extraction as well as real-time feature extraction using the Web Audio API. We wrote a paper about it, which is available here. Please see the documentation for setup and usage instructions.
audio-features feature-extraction spectral-centroid zero-cross mir music-information-retrieval audio feature extraction music sound information retrievalThis is about extracting different XML entities and wrapping up with jumbled legal English alphabets and outputs as per File format defined in settings.
doc docgeneration document documentgeneration docx extraction generatingretext plugin to extract keywords and key-phrases. This package is ESM only: Node 12+ is needed to use it and it must be imported instead of required.
tensorflow natural-language keyword retext keyword-extraction term retext-plugin unified plugin phrase terminology extractionunrpa is a script to extract files from the RPA archive format created for the Ren'Py Visual Novel Engine. You will need Python 3.x in order to run it (either install through your package manager or directly from python.org).
rpa extraction renpy visual-novelsThis is an extracted copy of Node 0.12's keep-alive Agent implementation with some small changes intended to make it work with older versions of Node. It also has one extra feature, which I needed.The HTTP Agent is used for pooling sockets used in HTTP client requests.
keep-alive agent http https client extractionCreditcard number parsing, validation and information extraction. The source code has been commented using JSDoc and converted to documentation which can be found in the docs folder. The module is available in the NPM registry. It can be installed using the npm command line utility.
iec iso/iec-7812 bic card credit credit-card creditcard extraction iin iso mii parsing validationA simple library that facades org.apache.commons.compress, to provide an easy-to-use API for archiving and compressing into and out of File objects.
compression archiving extractionSchenkerian analysis is a method of musical analysis by interpreting the underlying structure of a tonal work and to help reading the score according to that structure. This library is that, but for HTML built on top of Natural Node which includes term frequency, string similarities, and tokenizing. Given most webpages (attempt) to use the semantics of HTML, it takes into account not only term frequency, but the weight of an HTML tag, placement in document, and other useful forms of denoting significance (like Open Graph).
html keyword analyze extractionExtract rich metadata from URLs. Scrappy uses a simple two step process to extract the metadata from any URL or file. First, it runs through plugin-able scrapeStream middleware to extract metadata about the file itself. With the result in hand, it gets passed on to a plugin-able extract pipeline to format the metadata for presentation and extract additional metadata about related entities.
scraper rdf rdfa microdata metadata html json-ld content extraction rich snippets info open-graph oembedThis is a our public version of Full-Text RSS available to download for free from http://code.fivefilters.org. For best extraction results, and to help us sustain the project, you can purchase the most up-to-date version at http://fivefilters.org/content-only/#download - so if you like this free version, please consider supporting us by purchasing the latest release.
fulltext rss rss-feed-parser extractionExtracts the filesystem from RaiderZ and GunZ: The Second Duel clients.
gunz extractor tool extraction raiderzJava library to extract links (URLs, email addresses) from plain text; fast, small and smart about recognizing where links end
java-library links extraction url autolink linkifymokolo intends to become a collection of machine learning algorithms for Node.js. The current release supports only the Non-Negative Matrix Factorization (NMF) algorithm; more are coming soon. Feedbacks and contributions are greatly appreciated. The easiest way to install mokolo is through npm, the nodejs package manager.
vector matrix geometry math features extractionAlternatively, the output can be configured as XML, Atom or RSS format with the output option. The reason redundant information is included, such as the source, is that each returned nugget is supposed to be an atomic piece of information. As such, each nugget is to contain the information that "somewhere, at some point in time, something was written (with a link to some place)".
text extraction mining statistics metadata scraping crawlingWebdext is a Javascript library for web data extraction (web scraping). Currently, it only supports data records extraction from a list page (a web page containing 2 or more data records). Intelligent extraction algorithm is heavily based on AutoRM [1] and DAG-MTM [2] (not an exact implementation though).
scraping web data extractionThis library was only created to extract video stream URLs from YouTube, not provide a video player. ExoMedia is a great library for playing the video streams to the user. See the sample app for an example. This library uses OkHttp, Moshi and Rhino under the hood, so you may need to apply their ProGuard rules.
rxjava youtube extraction stream-url android kotlin
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.