Displaying 1 to 20 from 27 results

ISO-3166-Countries-with-Regional-Codes - ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets


These lists are the result of merging data from two sources, the Wikipedia ISO 3166-1 article for alpha and numeric country codes, and the UN Statistics site for countries' regional, and sub-regional codes. In addition to countries, it includes dependent territories. The International Organization for Standardization (ISO) site provides partial data (capitalised and sometimes stripped of non-latin ornamentation), but sells the complete data set as a Microsoft Access 2003 database. Other sites give you the numeric and character codes, but there appeared to be no sites that included the associated UN-maintained regional codes in their data sets. I scraped data from the above two websites that is all publicly available already to produce some ready-to-use complete data sets that will hopefully save someone some time who had similar needs.

awesome-json-datasets - A curated list of awesome JSON datasets that don't require authentication.


A curated list of awesome JSON datasets that don't require authentication. Pro Tip: Check out Blockchain Data API for more options.

Mobius - C# and F# language binding and extensions to Apache Spark


Mobius provides C# language binding to Apache Spark enabling the implementation of Spark driver program and data processing operations in the languages supported in the .NET framework like C# or F#.For more code samples, refer to Mobius\examples directory or Mobius\csharp\Samples directory.




pandas-datareader - Extract data from a wide range of Internet sources into a pandas DataFrame.


Up to date remote data access for pandas, works for multiple versions of pandas. As of v0.6.0 Yahoo!, Google Options, Google Quotes and EDGAR have been immediately deprecated due to large changes in their API and no stable replacement.

Xml To Csv Conversion Tool


This project contains an API that you can use to convert data stored in XML to comma seperated values (csv). There is also a Windows Form client application included. It is programmed in C#4.0.

DataFromFile


DataFromFile is a small class (written in C#) which makes it easy to deal with data files such as Excel (xls or xlsx) or CSV.



jLinq


LINQ style functionality for Javascript. Allows you to work with sets of data using query style syntax to select, order and sort records. Useful for medium sized arrays of information that shouldn't hit the server each time you need to select, sort or query it.

DataSetInspector plugin for Fiddler HTTP Debugger


This is an inspector plugin for the Fiddler HTTP Debugger which shows datasets in a .Net DataGridView control.

SharePoint List Reader


The SharePoint List Reader is a simple windows application that reads data from a list in SharePoint and allows the user to view the data as well as export the data to an XML file.

telemetry-batch-view - A Scala framework to build derived datasets, aka batch views, of Telemetry data


This is a Scala application to build derived datasets, also known as batch views, of Telemetry data.Raw JSON pings are stored on S3 within files containing framed Heka records. Reading the raw data in through e.g. Spark can be slow as for a given analysis only a few fields are typically used; not to mention the cost of parsing the JSON blobs. Furthermore, Heka files might contain only a handful of records under certain circumstances.

hub-db - Dataset of Adult Image Metadata for ML and NLP and Whatever Else


hub-db is a dataset of information about albums in the adult website PornHub. This application crawls through the 'most viewed' search pages (which pages are defined in the config.json file) and recursively crawls all albums on those pages, and the images from those albums. No images are saved, but links to the images as well as tag metadata, upload timestamp, comments, etc. are saved.This repository includes both the code to crawl PornHub to get this data as well as the dataset itself (when crawling is done).

reuters-21578-json - An terse and JSONified version of the Reuters 21578 dataset


A JSONified and simplified version of the famous reuters 21578 dataset

clusterdata - cluster data collected from production clusters in Alibaba for cluster management research


The trace data, ClusterData201708, contains cluster information of a production cluster in 24 hours period, and contains about 1.3k machines that run both online service and batch jobs.Please let us know if you have any issues, ideas, or papers about these data by sending email to us aliababa-clusterdata. The more specific the feedback, the more likely we are to be able to help you.

jquery-linechart - JQuery plugin for creating charts


JQuery plugin for building a linechart. Chart ruler completely on HTML/CSS/JS. Bar chart, calendar view visualisation. Diagram, graph, pyramid visualisation of large datasets. Showreel. The source for this module is in the main repo. Please create issues and pull requests. Angular plugin for linechart also exists. Check angular-scale if you're using Angular.js.Inspired by kinopoisk.ru chart written using Adobe Flash. But this chart is just on HTML/CSS without using libraries. Feel free for contribute.

scale - Angular plugin for creating charts


Angular plugin for building scale of items. Chart ruler completely on HTML/CSS/JS. Bar chart, line chart, calendar view visualisation. Diagram, graph, pyramid visualisation of large datasets. Showreel. The source for this module is in the main repo. Please create issues and pull requests.Inspired by kinopoisk.ru chart written using Adobe Flash. But this chart is just on HTML/CSS without using libraries. Feel free for contribute.

TextGenerator - TextGenerator is a PHP package that aims to generate automated texts from data.


TextGenerator is a PHP package that aims to generate automated texts from data. Feel free to comment and contribute.Aside from the PHP package, A Google Spreadsheet Add-on is available on the Chrome webstore. It gives users the ability to produce automated text contents from data directly within Google Spreadsheet. A complete tutorial for the Spreadsheet Add-on is available here.