node-vcf - A not so forgiving vCard / vcf parser

  •        6

A not so forgiving vCard / vcf parser

https://github.com/jhermsmeier/node-vcf

Dependencies:

camelcase : ~3.0.0
foldline : ~1.0.0

Tags
Implementation
License
Platform

   




Related Projects

vcflib - a simple C++ library for parsing and manipulating VCF files, + many command-line utilities

  •    C++

The Variant Call Format (VCF) is a flat-file, tab-delimited textual format intended to concisely describe reference-indexed variations between individuals. VCF provides a common interchange format for the description of variation in individuals and populations of samples, and has become the defacto standard reporting format for a wide array of genomic variant detectors. The API itself provides a quick and extremely permissive method to read and write VCF files. Extensions and applications of the library provided in the included utilities (*.cpp) comprise the vast bulk of the library's utility for most users.

vCard Contact Info Parser

  •    Java

This is a simple program that allows you to get the information in a vCard file (.vcf) as plain text. vCards are used to store contact information and may be created by Apple's Address Book and other similar programs, along with mobile phones.

freebayes - Bayesian haplotype-based genetic polymorphism discovery and genotyping.

  •    C++

FreeBayes is a Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment. FreeBayes uses short-read alignments (BAM files with Phred+33 encoded quality scores, now standard) for any number of individuals from a population and a reference genome (in FASTA format) to determine the most-likely combination of genotypes for the population at each position in the reference. It reports positions which it finds putatively polymorphic in variant call file (VCF) format. It can also use an input set of variants (VCF) as a source of prior information, and a copy number variant map (BED) to define non-uniform ploidy variation across the samples under analysis.

VCardSplitter

  •    

When importing and exporting contact data, sometimes you will come across a single VCard (.vcf) that contains multiple contacts. Outlook has no native way to read this file. This simple Windows Forms application splits a single file into one VCard for each contact, allowing ...

VCF Builder IDE

  •    C++

The VCF Builder is an advanced development tool for creating C++ applications, and supporting a wide number of plugins for enhancing it's functionality. While the VCF Builder is capable of creating generic C++ applications, it's forte is building GUI ap


Wiki in a Jar

  •    Java

quot;Wiki in a Jarquot; is a small Wiki written in Java with great focus on simplicity. It is intended to run on a USB stick as a replacement for a paper note- and address book as a Personal Information Manager (PIM). It supports vcard (vcf) contact files.

vcf2csv

  •    

vcf2csv is a command line tool, written in ANSI-C (C90), converts a vcard (vcf) file to csv. Please note: the ability to generate HTML-files will be removed in the near future.

VCF Viewer

  •    

Shows you all VCF files, that are inside chosen folder. You can view and read them, nothing more (at least now).

GnuAccounting

  •    Java

An open-source java accounting application that integrates OpenOffice, Apache Derby and HBCI/FinTS to create and manage invoices, credit memos, delivery notes, bills etc. Imports from kTimeTracker, Task Coach, VCF, Hibiscus, Moneyplex, Starmoney, exports to Winston, VCF, openTrans et. al.

picard - A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF

  •    Java

A set of Java command line tools for manipulating high-throughput sequencing (HTS) data and formats. Picard is implemented using the HTSJDK Java library HTSJDK to support accessing file formats that are commonly used for high-throughput sequencing data such as SAM and VCF.

pysam - Pysam is a Python module for reading and manipulating SAM/BAM/VCF/BCF files

  •    C

Pysam is a python module for reading and manipulating files in the SAM/BAM format. The SAM/BAM format is a way to store efficiently large numbers of alignments (Li 2009), such as those routinely created by next-generation sequencing methods. Pysam is a lightweight wrapper of the samtools C-API. Pysam also includes an interface for tabix.

bioawk - BWK awk modified for biological data

  •    C

Bioawk is an extension to Brian Kernighan's awk, adding the support of several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q and TAB-delimited formats with column names. It also adds a few built-in functions and an command line option to use TAB as the input/output delimiter. When the new functionality is not used, bioawk is intended to behave exactly the same as the original BWK awk. The original awk requires a YACC-compatible parser generator (e.g. Byacc or Bison). Bioawk further depends on zlib so as to work with gzip'd files.

nomnom - [UNMAINTAINED] Option parser for node with generated usage and commands

  •    Javascript

nomnom is an option parser for node. It noms your args and gives them back to you in a hash. Nomnom supports args like -d, --debug, --no-debug, --file=test.txt, --file test.txt, -f test.txt, -xvf, and positionals. Positionals are arguments that don't fit the -a or --atomic format and aren't attached to an option.

vCard/vCal Parser C Library

  •    C

A C callback orientated parser library for card (vCard/vCal) objects, reentrant amp; stream orientated, handles line folding, automatic decoding of params and data

vCard PHP Parser

  •    PHP

vCardPHP parses a vCard file and displays the results as a Web page. Many address book programs export their data as a vCard file such as gnomecard and the Palm desktop.

vcftools

  •    Perl

A set of tools written in Perl and C++ for working with VCF files, such as those generated by the 1000 Genomes Project

nucleus - Python and C++ code for reading and writing genomics data.

  •    Python

Nucleus is a library of Python and C++ code designed to make it easy to read, write and analyze data in common genomics file formats like SAM and VCF. In addition, Nucleus enables painless integration with the TensorFlow machine learning framework, as anywhere a genomics file is consumed or produced, a TensorFlow tfrecords file may be substituted. For all other systems, you will need to first install CLIF by following the instructions at https://github.com/google/clif#installation before running install.sh.

htslib - C library for high-throughput sequencing data formats

  •    C

HTSlib is an implementation of a unified C library for accessing common file formats, such as SAM, CRAM and VCF, used for high-throughput sequencing data, and is the core library used by samtools and bcftools. HTSlib only depends on zlib. It is known to be compatible with gcc, g++ and clang. HTSlib implements a generalized BAM index, with file extension .csi (coordinate-sorted index). The HTSlib file reader first looks for the new index and then for the old if the new index is absent.

bedtools2 - A powerful toolset for genome arithmetic.

  •    C++

Collectively, the bedtools utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks. The most widely-used tools enable genome arithmetic: that is, set theory on the genome. For example, bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF/GTF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), quite sophisticated analyses can be conducted by combining multiple bedtools operations on the UNIX command line.

gemini - a lightweight db framework for exploring genetic variation.

  •    Python

The intent of GEMINI (GEnome MINIing) is to provide a simple, flexible, and powerful framework for exploring genetic variation for personal and medical genetics. GEMINI is unique in that it integrates genetic variation (from VCF files) with a wealth of genome annotations into a unified database framework. Using this integrated database as the analysis framework, we aim to leverage the expressive power of SQL for data analysis, while attempting to overcome the fundamental challenges associated with using databases for very large (e.g. 1,000,000 variants times 1,000 samples yields one billion genotypes) datasets. In addition, by defining sample relationships with a PED file, GEMINI allows one to explore and test for variants that meet specific inheritance models (e.g., recessive, dominant, etc.). The following is a video of a high-level talk from SciPy 2013 describing GEMINI.