solrbulk - SOLR bulk indexing utility for the command line.

  •        7

solrbulk expects as input a file with line-delimited JSON. Each line represents a single document. solrbulk takes care of reformatting the documents into the bulk JSON format, that SOLR understands. solrbulk will send documents in batches and in parallel. The number of documents per batch can be set via -size, the number of workers with -w.

https://github.com/miku/solrbulk

Tags
Implementation
License
Platform

   




Related Projects

Solr - Blazing-fast, open source enterprise search platform

  •    Java

Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g., Word, PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.

solr-node-client - A solr client for node.js.

  •    Javascript

A solr client for indexing, adding, deleting,committing and searching documents within an Apache Solr installation. It supports commands like search(select), add, delete, update, commit, rollback, optimize, ping, real-time get, prepare commit, soft commit, arbitrary search handler (i.e: mlt, luke ...), Lucene query / DisMax query, Grouping / Field Collapsing, Facet, MoreLikeThis etc.

Lux - XML Search engine

  •    Java

Lux is an open source XML search engine using Lucene /Solr and Saxon XQuery/XSLT processor. Lux provides XML-aware indexing, an XQuery 1.0 optimizer that rewrites queries to use the indexes, and a function library for interacting with Lucene via XQuery. These capabilities are tightly integrated with Solr, and leverage its application framework in order to deliver a REST service, application server, and supporting tools.

Constellio - Enterprise Search engine

  •    Java

Constellio Open Source Enterprise Search is based on Apache Solr and using Google Search Appliances connectors architecture, it allows, with a single click, to find all relevant content in your organization (Web, email, ECM, CRM etc.).

Hydra - Distributed processing framework for search solutions

  •    Java

Hydra is designed to give the search solution the tools necessary to modify the data that is to be indexed in an efficient and flexible way. This is done by providing a scalable and efficient pipeline which the documents will have to pass through before being indexed into the search engine. Architecturally Hydra sits in between the search engine and the source integration.


YouSeer

  •    Java

YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.

Solrj - Official Solr client

  •    Java

Solrj is the official java client to access Solr. It offers a java interface to add, update, and query the solr index. SolrJ hides a lot of the details of connecting to Solr and allows your application to interact with Solr with simple high-level methods. The SolrJ API is included with Solr, so you do not have to download or install anything else.

AJAX Solr - JavaScript library for creating user interfaces to Apache Solr

  •    Javascript

AJAX Solr is a JavaScript library for creating user interfaces to Apache Solr. AJAX Solr supports all features of Solr, including spell check, spatial local parameters, local parameters, facet parameters on both global and per-field bases, and more. It supports Calender, Results, Pager, Tagcloud, Autocomplete and few other widgets.

Solrstrap - Query-Result interface for Solr written in JavaScript, HTML and CSS

  •    Javascript

Solrstrap is a Query-Result interface for Solr. Solrstrap is intended to be a starting point for those building web interfaces that talk to Solr, or a very lightweight admin tool for querying Solr in a Googleish fashion.This is because it does everything in Javascript, CSS and HTML on the client side. JSON is shot back from the server and interpeted by the web browser.

Banana for Solr - A Port of Kibana

  •    Javascript

The Banana project was forked from Kibana, and works with all kinds of time series (and non-time series) data stored in Apache Solr. It uses Kibana's powerful dashboard configuration capabilities, ports key panels to work with Solr, and provides significant additional capabilities, including new panels that leverage D3.js.

solr-scala-client - Solr Client for Scala

  •    Scala

The simple Apache Solr client for Scala. This is based on the SolrJ and provides optimal interface for Scala.Add the following dependency into your build.sbt to use solr-scala-client.

Blacklight - Discovery Interface for any Apache Solr

  •    Ruby

Blacklight is an open source OPAC (online public access catalog). It is ruby-on-rails based discovery interface (a.k.a. “next-generation catalog”) especially optimized for heterogeneous collections. It could be used as a library catalog, as a front end for a digital repository, or as a single-search interface to aggregate digital content that would otherwise be siloed. Blacklight uses Solr, an enterprise-scale index for its search engine.

ajax-solr - A JavaScript framework for creating user interfaces to Solr.

  •    Javascript

AJAX Solr is a JavaScript library for creating user interfaces to Apache Solr.

Sunburnt - Python interface to Solr

  •    Python

Sunburnt is a library to interface with a Solr instance from Python. It helps to query Solr in a Pythonic way, without having to understand Solr’s query syntax. It provides support to insert Python objects into a Solr index.

Pysolr - Python wrapper for Apache Solr

  •    Python

pysolr is a lightweight Python wrapper for Apache Solr. It provides an interface that queries the server and returns results based on the query.Or if you want to install directly from the repository: python setup.py install, or drop the pysolr.py file anywhere on your PYTHONPATH.

Go-Solr - A solr library written in Go

  •    Go

An Apache Solr library written in Go, Its functionality includes Select queries, Function Queries, Update queries (add/replace/delete), Faceting.

Thoth - Real-time Solr Monitor and Search Analysis Engine

  •    Java

Thoth is a real-time solr monitor and search analysis engine. It's a set of tools that can help you collect, visualize and leverage data coming from your solr search infrastructure.

Spyglass - Simple search results with Solr and EmberJS

  •    Javascript

Spyglass provides a simple search interface for Solr using EmberJS. Searchers that return a result from a given Solr url. Result Sets that automatically show the results returned by their searchers, and Facets which toggle search parameters.

rsolr - A Ruby client for Apache Solr

  •    Ruby

A simple, extensible Ruby client for Apache Solr.

Solr/Lucene on Azure

  •    Lucene

This project hosts Solr/Lucene in Windows Azure using multi-instance replication for index-serving and single-instance for index generation with a persistent index mounted in Azure storage. Typical scenarios could be a commercial and publisher sites that need to scale the traffic with increasing query volume and need to index maximum 16 TB of data and require couple of index updates per day.