streamsx.regex - (Incubation) This toolkit provides support for RE2 regular expression library

  •        3

RE2 uses automata theory to guarantee that regular expression searches run in time linear in the size of the input. Web page with SPLDoc for operators and samples: streamsx.regex SPLDoc.

https://github.com/IBMStreams/streamsx.regex

Tags
Implementation
License
Platform

   




Related Projects

Regex Batch Replacer (Multi-File)

  •    

Regex Batch Replacer uses regular expression to find and replace text in multiple files.

url-pattern - easier than regex string matching patterns for urls and other strings

  •    CoffeeScript

easier than regex string matching patterns for urls and other strings. turn strings into data or data into strings.a pattern is immutable after construction. none of its methods changes its state. that makes it easier to reason about.

winton-kafka-streams - A Python implementation of Apache Kafka Streams

  •    Python

Implementation of Apache Kafka's Streams API in Python. Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. Kafka has Streams API added for building stream processing applications using Apache Kafka. Applications built with Kafka's Streams API do not require any setup beyond the provision of a Kafka cluster.

hyperscan - High-performance regular expression matching library

  •    C++

Hyperscan is a high-performance multiple regex matching library. It follows the regular expression syntax of the commonly-used libpcre library, but is a standalone library with its own C API. Hyperscan uses hybrid automata techniques to allow simultaneous matching of large numbers (up to tens of thousands) of regular expressions and for the matching of regular expressions across streams of data.

Regex Match Maker (RegExmm)

  •    C++

Multi-platform regular expression (regex) creation. Highlights regex matches, and regex groups for quick regular expression creation. Save/load regular expression sessions. Create robust regex patterns on any major platform. Supports PCRE and wxRegEx.


Convert Regex to XML and back

  •    

This is a small library + demo that transforms a .NET regular expression into an XML string and back. It helps understanding a long regex that was written by someone else, and allows easy editing. It's is developed in .NET 3.5 C#, sources included.

crocs - Write regex using pure python class/function syntax and test it better. (Regex for humans).

  •    Python

Write regex using pure python class/function syntax and test it better. The idea behind crocs is simplifying the construction and debugging of regex's. It is possible to implement regex's using a function syntax, the resulting structure is then compiled into a regex's string. it is as well possible to generate random inputs for the regex that would match the regex pattern.

faust - Python Stream Processing

  •    Python

Faust is a stream processing library, porting the ideas from Kafka Streams to Python. It is used at Robinhood to build high performance distributed systems and real-time data pipelines that process billions of events every day.

CRM114 Discriminator - Controllable Rege

  •    C

CRM114 is a Controllable Regex Mutilator and filter. It provides arbitrary stream filtering for antispam, adkiller proxies, syslog monitoring, etc. CRM114 has a structured, easily understood way to express regex-based filters, and a learn-by-example mod

Apache Storm - Distributed and fault-tolerant realtime computation

  •    Java

Storm is a distributed real time computation system. Storm makes it easy to reliably process unbounded streams of data, doing for real time processing what Hadoop did for batch processing. Storm has many use cases: realtime analytics, online machine learning, continuous computation, distributed RPC, ETL, and more.

Regex Lab .NET

  •    

Regex Lab .NET is a full-featured application for editing regular expressions (or regex patterns). The application features a modern, highly functional UI for writing or debugging regular expression patterns. It is written with C# and WinForms.

Silverlight Regex Tester

  •    CSharp

This Regular Expression tester allows .Net developers a very simple way to test regular expressions against a sample text. This regex checker validates the expression syntax and provides results by leveraging the .Net Framework's native System.Text.RegularExpressions.Regex cl...

Regex Builder

  •    

Regex Builder is a tool designed to help you create and debug Regular Expressions. It supports an XML file format including the source text, expression, and options to support easy debugging for non-matches from other applications. The tool also supports partial expression exe...

RegEx Tester

  •    CSharp

With RegEx Tester you can fully develop and test your regular expression against a target text. It's UI is designed to aid you in the RegEx developing. It uses and supports ALL of the features available in the .NET RegEx Class.

Regex Util

  •    Java

A eclipse plugin to test regular expressions (regex). Full highlighting of regexp syntax. Tooltip with detailed description of regex items, etc... Very usefull. Project page has java-applet to test regexp online. (Regex implementation: java.util.regex.*)

Sensorbee - Lightweight stream processing engine for IoT

  •    Go

Sensorbee is designed for low-latency processing of streaming data at the edge of the network. IoT devices frequently generate large volumes of unstructured streaming data, such as video and audio streams. Even if the data streams are structured, they may be meaningless if their temporal characteristics are not considered. Cloud-based services are generally not good at processing these kinds of data. Preprocessing data streams before they are sent to the cloud makes large scale data processing in the cloud more efficient and reduces the usage of network bandwidth.

BoomFilters - Probabilistic data structures for processing continuous, unbounded streams.

  •    Go

Boom Filters are probabilistic data structures for processing continuous, unbounded streams. This includes Stable Bloom Filters, Scalable Bloom Filters, Counting Bloom Filters, Inverse Bloom Filters, Cuckoo Filters, several variants of traditional Bloom filters, HyperLogLog, Count-Min Sketch, and MinHash.Classic Bloom filters generally require a priori knowledge of the data set in order to allocate an appropriately sized bit array. This works well for offline processing, but online processing typically involves unbounded data streams. With enough data, a traditional Bloom filter "fills up", after which it has a false-positive probability of 1.

Hazelcast Jet - A general purpose distributed data processing engine, built on top of Hazelcast.

  •    Java

Hazelcast Jet is a distributed computing platform built for high-performance stream processing and fast batch processing. It embeds Hazelcast In-Memory Data Grid (IMDG) to provide a lightweight, simple-to-deploy package that includes scalable in-memory storage. Hazelcast Jet performs parallel execution to enable data-intensive applications to operate in near real-time.

reactive-streams-jvm - Reactive Streams Specification for the JVM

  •    Java

The purpose of Reactive Streams is to provide a standard for asynchronous stream processing with non-blocking backpressure. Handling streams of data—especially “live” data whose volume is not predetermined—requires special care in an asynchronous system. The most prominent issue is that resource consumption needs to be carefully controlled such that a fast data source does not overwhelm the stream destination. Asynchrony is needed in order to enable the parallel use of computing resources, on collaborating network hosts or multiple CPU cores within a single machine.