Displaying 1 to 20 from 24 results

mimesis - Mimesis is a fast and easy to use library for Python, which helps generate synthetic data for a variety of purposes in a variety of languages

  •    Python

Mimesis is a fast and easy to use library for Python programming language, which helps generate synthetic data for a variety of purposes in a variety of languages. This data can be particularly useful during software development and testing. For example, it could be used to populate a testing database, create beautiful JSON and XML files, anonymize data taken from a production service, etc. You can find the complete documentation on the Read the Docs.

lxml-python

  •    Python

lxml is a Pythonic binding for the libxml2 and libxslt libraries.

untangle - Converts XML to Python objects

  •    Python

Converts XML to Python objects

yq - Command-line YAML and XML processor - jq wrapper for YAML/XML documents

  •    Python

Before using yq, you also have to install its dependency, jq. See the jq installation instructions for details and directions specific to your platform. Use the --width/-w argument to pass the line wrap width for string literals. All other command line arguments are forwarded to jq. yq forwards the exit code jq produced, unless there was an error in YAML parsing, in which case the exit code is 1. See the jq manual for more details on jq features and options.




meza - A Python toolkit for processing tabular data

  •    Python

meza is a Python library for reading and processing tabular data. It has a functional programming style API, excels at reading/writing large files, and can process 10+ file types. meza has been tested and is known to work on Python 2.7, 3.5, and 3.6; PyPy2 5.8.0, and PyPy3 5.8.0.

mkvparse - Simple Python matroska (mkv) reading library, also mkv2xml and xml2mkv

  •    Python

Supports lacing and setting global timecode scale, subtitles (BlockGroup). Does not support cues, tags, chapters, seeking and so on. Supports resyncing when something bad is encountered in matroska stream. Subtitles should remain as text, binary data gets encoded to hex.

jinja2-cli - CLI for Jinja2

  •    Python

If PyYAML is present, you can use YAML as an input data source. If toml is present, you can use TOML as an input data source.

python-sepadd - SEPA Direct Debit XML generation in python

  •    Python

This is a python implementation to generate SEPA direct debit XML files.


text-to-freemind - A simple text to Freemind conversion program

  •    Python

This program converts tab-indented UTF-8 text files into an XML format suitable for display by Freemind. It was written out of annoyance with the Freemind user interface, and the lack of 'merging' capabilities when collaborating with other people. This program is distributed under the GPL v2 (or later) license.

kajiki - Python XML-based template engine with Genshi-like syntax and Jinja-style blocks

  •    Python

Because Kajiki's input is XML, it can ensure that your HTML/XML output is well-formed. The Genshi-like syntax, based on XML attributes or tags, is simply beautiful and easily understood (or ignored) by web designers. But instead of the slow performance of Genshi, Kajiki compiles templates to Python code that renders with blazing-fast speed, so Kajiki can compete with the speed of Jinja, Mako, Chameleon and others. Also, one of Genshi's misfeatures -- py:match -- is replaced with blocks which work like Jinja's blocks. Kajiki is licensed under an MIT-style license.

ciseau - :rocket: Tokenize and clean strings in Python

  •    Python

Word and sentence tokenization in Python. sent_tokenize can keep the whitespace as-is with the flags keep_whitespace=True and normalize_ascii=False.

folia - FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations

  •    Python

FoLiA is an XML-based annotation format, suitable for the representation of linguistically annotated language resources. FoLiA’s intended use is as a format for storing and/or exchanging language resources, including corpora. Our aim is to introduce a single rich format that can accommodate a wide variety of linguistic annotation types through a single generalised paradigm. We do not commit to any label set, language or linguistic theory. This is always left to the developer of the language resource, and provides maximum flexibility. XML is an inherently hierarchic format. FoLiA does justice to this by maximally utilising a hierarchic, inline, setup. We inherit from the D-Coi format, which posits to be loosely based on a minimal subset of TEI. Because of the introduction of a new and much broader paradigm, FoLiA is not backwards-compatible with D-Coi, i.e. validators for D-Coi will not accept FoLiA XML. It is however easy to convert FoLiA to less complex or verbose formats such as the D-Coi format, or plain-text. Converters are provided.

Gelatin - Transform text files to XML, JSON, or YAML

  •    Python

Gelatin is a parser generator for converting text to a structured format such as XML, JSON or YAML. Gelatin is a combined lexer, parser, and output generator. Gelatin defines a simple language for converting text into a structured formats.

cfgdiff - diff(1) all your configs

  •    Python

diff(1) is an awesome tool, you use it (or similar implementations like git diff, svn diff etc) every day when dealing with code. But configuration files aren't code. Indentation often does not matter (yeah, there is diff -w and yeah, people use YAML for configs), order of settings does not matter and comments are just beautiful noise. cfgdiff will try to parse your configuration files, fetching all the relevant keys and values from them and then pretty-printing them in the original format. These results are then diffed and the diff is shown to you.

PhiloLogic4 - PhiloLogic4

  •    Python

PhiloLogic is an XML database/search engine/web app that is designed for the particular difficulties of TEI XML. For a more theoretical description, you can refer to our research publications or our blog.

fort-pymdwizard

  •    Python

The MetadataWizard is a useful tool designed to facilitate FGDC metadata creation for spatial and non-spatial data sets. It is a cross-platform desktop application built using an open-source Python architecture. It provides a pleasant and highly efficient environment for metadata creation, editing, preview, and validation. Built in tools automate and facilitate the creation of high quality metadata records.

python-hsreplay - Python library for creating and parsing HSReplay XML files

  •    Python

A python module for HSReplay support. The library is available on PyPI. pip install hsreplay will install it.

devops-python-tools - DevOps CLI Tools for Hadoop, Spark, HBase, Log Anonymizer, Ambari Blueprints, AWS CloudFormation, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Elasticsearch, Solr, Travis CI, Pig, IPython - Python / Jython Tools

  •    Python

A few of the Big Data, NoSQL & Linux tools I've written over the years. All programs have --help to list the available options. For many more tools see the DevOps Perl Tools and Advanced Nagios Plugins Collection repos which contains many Hadoop, NoSQL, Web and infrastructure tools and Nagios plugins.

ansible-config_encoder_filters - Ansible role used to deliver the Config Encoder Filters.

  •    Python

This is an Ansible role used to deliver the Config Encoder Filters as a dependency of another Ansible role. Ansible Galaxy contains a lot of useful roles. Some of them exist in many variations which differ only by their parameterization. The parameterization is often used mainly in templates which generate the configuration file. A good example such issues are roles for Nginx of which you can find almost 200 in the Ansible Galaxy.