python-webrtc-audio-processing - Python bindings of WebRTC Audio Processing

  •        517

Python binding of WebRTC Audio Processing. There are two way to build the package.

https://github.com/xiongyihui/python-webrtc-audio-processing

Tags
Implementation
License
Platform

   




Related Projects

Kurento - Media Server responsible for media transmission, processing, loading and recording

  •    C++

Kurento Media Server is Kurento's core element. It is responsible for media transmission, processing, loading and recording. It is implemented in low level technologies based on GStreamer to optimize the resource consumption. It supports Networked streaming protocols, including HTTP, RTP and WebRTC, Group communications (MCUs and SFUs functionality) supporting both media mixing and media routing/dispatching.

RecordRTC.js - WebRTC JavaScript Library for Audio+Video+Screen+Canvas (2D+3D animation) Recording

  •    Javasript

RecordRTC is WebRTC JavaScript library for audio/video as well as screen activity recording. It supports Chrome, Firefox, Opera, Android, and Microsoft Edge. Platforms: Linux, Mac and Windows.

Pion WebRTC - A pure Go implementation of the WebRTC API

  •    Go

Pion is the Modern Stack for Web Real-Time Communication. It implements the WebRTC API. Spend more time building and less time learning a new API. Its feature include PeerConnection API support for DataChannels, Send/Receive audio and video, Renegotiation. It provides API with direct RTP/RTCP access, API also allows developer to pass their own packetizer, Opus, PCM, H264, VP8 and VP9 packetizer, Sender/Receiver Reports, Easy integration with x264, libvpx, GStreamer and ffmpeg and lot more.

spreed-webrtc - WebRTC audio/video call and conferencing server.

  •    Javascript

Spreed WebRTC implements a WebRTC audio/video call and conferencing server and web client. The latest source of Spreed WebRTC can be found on GitHub. If you are a user, just wanting a secure and private alternative for online communication make sure to check out the Spreedbox, providing a ready to use hardware with Spreed WebRTC included.


LiveKit - Open source, distributed video/audio rooms over WebRTC

  •    Go

LiveKit is an open source project that provides scalable, multi-user conferencing over WebRTC. It's designed to give you everything you need to build real time video/audio capabilities in your applications. It is horizontally scalable WebRTC Selective Forwarding Unit (SFU). It supports advanced features like speaker detection, simulcast, selective subscription, and moderation APIs.

pyo - Python DSP module - ajaxsoundstudio.com

  •    C

pyo is a Python module written in C to help digital signal processing script creation. pyo is a Python module containing classes for a wide variety of audio signal processing types. With pyo, user will be able to include signal processing chains directly in Python scripts or projects, and to manipulate them in real time through the interpreter. Tools in pyo module offer primitives, like mathematical operations on audio signal, basic signal processing (filters, delays, synthesis generators, etc.), but also complex algorithms to create sound granulation and others creative audio manipulations. pyo supports OSC protocol (Open Sound Control), to ease communications between softwares, and MIDI protocol, for generating sound events and controlling process parameters. pyo allows creation of sophisticated signal processing chains with all the benefits of a mature, and widely used, general programming language.

webrtc-jingle-client - Webrtc audio + jingle protocol brought to IOS and Android.

  •    C

Webrtc audio + jingle protocol brought to IOS and Android.

amazon-kinesis-video-streams-webrtc-sdk-c - Amazon Kinesis Video Streams Webrtc SDK is for developers to install and customize realtime communication between devices and enable secure streaming of video, audio to Kinesis Video Streams

  •    C

Create a build directory in the newly checked out repository, and execute CMake from it. mkdir -p amazon-kinesis-video-streams-webrtc-sdk-c/build; cd amazon-kinesis-video-streams-webrtc-sdk-c/build; cmake ..

essentia - C++ library for audio and music analysis, description and synthesis, including Python bindings

  •    Jupyter

Essentia is an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license. It contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal and high-level music descriptors. The library is also wrapped in Python and includes a number of predefined executable extractors for the available music descriptors, which facilitates its use for fast prototyping and allows setting up research experiments very rapidly. Furthermore, it includes a Vamp plugin to be used with Sonic Visualiser for visualization purposes. Essentia is designed with a focus on the robustness of the provided music descriptors and is optimized in terms of the computational cost of the algorithms. The provided functionality, specifically the music descriptors included in-the-box and signal processing algorithms, is easily expandable and allows for both research experiments and development of large-scale industrial applications. If you use example extractors (located in src/examples), or your own code employing Essentia algorithms to compute descriptors, you should be aware of possible incompatibilities when using different versions of Essentia.

klio - Smarter data pipelines for audio.

  •    Python

Klio is an ecosystem that allows you to process audio files – or any binary files – easily and at scale. Klio jobs are opinionated data pipelines in Python (streaming or batch) built upon Apache Beam and tuned for audio and binary file processing.

wave-share - Serverless, peer-to-peer, local file sharing through sound

  •    C++

A proof-of-concept for WebRTC signaling using sound. Works with all devices that have microphone + speakers. Runs in the browser. Nearby devices negotiate the WebRTC connection by exchanging the necessary Session Description Protocol (SDP) data via a sequence of audio tones. Upon successful negotiation, a local WebRTC connection is established between the browsers allowing data to be exchanged via LAN.

madmom - Python audio and music signal processing library

  •    Python

Madmom is an audio signal processing library written in Python with a strong focus on music information retrieval (MIR) tasks. The library is internally used by the Department of Computational Perception, Johannes Kepler University, Linz, Austria (http://www.cp.jku.at) and the Austrian Research Institute for Artificial Intelligence (OFAI), Vienna, Austria (http://www.ofai.at).

nnAudio - Audio processing by using pytorch 1D convolution network

  •    Python

nnAudio is an audio processing toolbox using PyTorch convolutional neural network as its backend. By doing so, spectrograms can be generated from audio on-the-fly during neural network training and the Fourier kernels (e.g. or CQT kernels) can be trained. Kapre has a similar concept in which they also use 1D convolutional neural network to extract spectrograms based on Keras.

Libav - Audio and video processing tools

  •    C

Libav is a collection of libraries and tools to process multimedia content such as audio, video, subtitles and related metadata.

pedalboard - 🎛 🔊 A Python library for adding effects to audio.

  •    C++

pedalboard is a Python library for adding effects to audio. It supports a number of common audio effects out of the box, and also allows the use of VST3® and Audio Unit plugin formats for third-party effects. It was built by Spotify's Audio Intelligence Lab to enable using studio-quality audio effects from within Python and TensorFlow. Internally at Spotify, pedalboard is used for data augmentation to improve machine learning models. pedalboard also helps in the process of content creation, making it possible to add effects to audio without using a Digital Audio Workstation.

libjitsi - Advanced Java media library for secure real-time audio/video communication.

  •    Java

libjitsi is an advanced Java media library for secure real-time audio/video communication. It allows applications to capture, playback, stream, encode/decode and encrypt audio and video flows. It also allows for advanced features such as audio mixing, handling multiple streams, participation in audio and video conferences. Originally libjitsi was part of the Jitsi client source code but we decided to spin it off so that other projects can also use it. libjitsi is distributed under the terms of the Apache license.

Ant Media Server - Streaming engine software that provides adaptive, ultra low latency streaming by using WebRTC technology

  •    Java

Ant Media Server is designed to provide live video streaming technology infrastructure with ultra-low latency(WebRTC) and low-latency(HLS, CMAF available in v2.2+). It can be used to enable streaming any type of live or on demand video to any devices including mobiles, PCs or IPTV boxes.

audiomentations - A Python library for audio data augmentation

  •    Python

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and partially multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Note: ffmpeg can be installed via e.g. conda or from the official ffmpeg download page.

Beethoven - :guitar: A maestro of pitch detection.

  •    Swift

Beethoven is an audio processing Swift library that provides an easy-to-use interface to solve an age-old problem of pitch detection of musical signals. You can read more about this subject on Wikipedia. The basic workflow is to get the audio buffer from the input/output source, transform it to a format applicable for processing and apply one of the pitch estimation algorithms to find the fundamental frequency. For the end user it comes down to choosing estimation algorithm and implementation of delegate methods.






We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.