structured-text-tools - A list of command line tools for manipulating structured text data

  •        15

The following is a list of text-based file formats and command line tools for manipulating each. Delimiter-separated values, including CSV, TSV, etc.

https://github.com/dbohdan/structured-text-tools

Tags
Implementation
License
Platform

   




Related Projects

Tikka - A content analysis toolkit

  •    Java

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

TextQL - Execute SQL against structured text like CSV or TSV

  •    Go

TextQL allows you to easily execute SQL against structured text like CSV or TSV.

RecordEditor

  •    Java

Editor for Fixed Width, Csv and Existing Xml files.

FSharp.Data - F# Data: Library for Data Access

  •    HTML

The F# Data library (FSharp.Data.dll) implements everything you need to access data in your F# applications and scripts. It implements F# type providers for working with structured file formats (CSV, HTML, JSON and XML) and for accessing the WorldBank data. It also includes helpers for parsing CSV, HTML and JSON files and for sending HTTP requests.We're open to contributions from anyone. If you want to help out but don't know where to start, you can take one of the Up-For-Grabs issues, or help to improve the documentation.


rq - Record Query - A tool for doing record analysis and transformation

  •    Javascript

This is the home of the tool called rq (record query). It's a tool that's used for performing queries on streams of records in various formats. The goal is to make ad-hoc exploration of data sets easy without having to use more heavy-weight tools like SQL/MapReduce/custom programs. rq fills a similar niche as tools like awk or sed, but works with structured (record) data instead of text.

config - Config is a lightweight configuration file loader that supports PHP, INI, XML, JSON, and YAML files

  •    PHP

Config is a file configuration loader that supports PHP, INI, XML, JSON, and YML files. Config requires PHP 5.5.9+.

Swiss File Knife

  •    C++

Multi function command line tool that belongs onto every usb stick.

HTMLtools

  •    Java

HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file amp; table editing, keyword mapping, templates, and more.

PEAR Framework - reusable PHP components

  •    PHP

PEAR is a framework and distribution system for reusable PHP components. It has all categories of components from DB access, security, xml parsing, encryption etc.

TWiki - Wiki and Web 2.0 Application Platform

  •    Perl

TWiki is a flexible, powerful, and easy to use enterprise wiki, enterprise collaboration platform, and web application platform. It is a Structured Wiki, typically used to run a project development space, a document management system, a knowledge base, or any other groupware tool, on an intranet, extranet or the Internet. TWiki is a cgi-bin script written in Perl. It reads a text file, hyperlinks it and converts it to HTML on the fly.

imgmin - Lossy image optimization

  •    C

Image files constitute a majority of static web traffic.[17] Unlike text-based web file formats, binary image files do not benefit from built-in webserver-based HTTP gzip compression. imgmin offers an automated means for enforcing image quality as a standalone tool and as a webserver module. imgmin determines the optimal balance of image quality and filesize, often greatly reducing image size while retaining quality for casual use, which translates into more efficient use of storage and network bandwidth, which saves money and improves user experience. Websites are composed of several standard components. Most (HTML, CSS, Javascript, JSON, XML, etc) are text-based. They can be efficiently compressed for transfer via gzip, supported by all mainstream webservers and browsers. But image and video files are binary, non-text files, and generally are not worth auto-compressing in the webserver.

ServiceStack text - NET's fastest JSON, JSV and CSV Text Serializers

  •    CSharp

ServiceStack.Text is an independent, dependency-free serialization library that contains ServiceStack's text processing functionality, including: JsonSerializer, TypeSerializer (JSV-Format), CsvSerializer, T.Dump extension method, StringExtensions - Xml/Json/Csv/Url encoding, BaseConvert, Rot13, Hex escape, etc., Stream, Reflection, List, DateTime, etc extensions and utils.

Manticore Search - High performance full-text search engine with SQL and JSON support

  •    C++

Manticore Search is an open source high performance full-text search oriented engine. It is a fork of Sphinx Search. Manticore Search is written in C++. It means speed and low resource consumption, it means you don’t have to worry about a garbage collector that suddenly makes a trouble.

JavaCSV - Java CSV Library

  •    Java

Java CSV is a small fast open source java library for reading and writing CSV and plain delimited text files. All kinds of CSV files can be handled, text qualified, Excel formatted, etc.

TablePad

  •    C++

TablePad is a tool to edit table-structured text files. It reads CSV files and also imports tables using ODBC datasources. You can add and remove rows and columns, edit the content of the cells, seach for any text and more.

HackMyResume - Generate polished résumés and CVs in HTML, Markdown, LaTeX, MS Word, PDF, plain text, JSON, XML, YAML, smoke signal, and carrier pigeon

  •    Javascript

Create polished résumés and CVs in multiple formats from your command line or shell. Author in clean Markdown and JSON, export to Word, HTML, PDF, LaTeX, plain text, and other arbitrary formats. Fight the power, save trees. Compatible with FRESH and JRS resumes. HackMyResume is built with Node.js and runs on recent versions of OS X, Linux, or Windows. View the FAQ.

slap - Sublime-like terminal-based text editor

  •    Javascript

slap supports INI or JSON config files. You can put configuration wherever rc can find it. A mostly empty configuration file with some useful comments is created in ~/.slap/config if an existing file isn't found.Slap is fully customizeable and supports plugins written in JS. You can place single JS files, or NodeJS packages, into ~/.slap/plugins/.

sublimetext_indentxml - Plugin for Sublime Text editor for reindenting XML and JSON files

  •    Python

Want more? It can indent only selected text - including multiple selections and even mixed XML / JSON selections. It is smart enough to recognize XML or JSON even if you are editing plain text. Indent plugin won't mess up your keyboard shortcuts because it uses "chord" command Ctrl+K, Ctrl+F (this mean hold Ctrl, press K then press F, release Ctrl) and also available in "Selection" menu. Currently this plugin performs patching of xml.dom.minidom.CDataSection class in order to fix CDATA indenting in toprettyxml function. This can be disabled in indent_xml.Sublime-settings file by setting disable_patch_minidom to true.





We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.