Free Open Source Code Search Engines
There are couple of sites which indexes the open source code and provides support to search code. Recently Google announced that they removed code search support from Google code. This article provides pointer for code search engine sites.
Github is a most widely used open source repository. It helps to search code on their public repository. https://github.com/
Ohloh Code, the world’s largest, most comprehensive free code search engine. Koders has now merged with Ohloh to become your one-stop resource for both code and analysis. http://code.ohloh.net/
Antepedia is the largest knowledge base of open source components with over 2 million current projects. Antepedia continuously aggregates data from various directories that include Google Code, Apache, GitHub, Maven, and many more. http://www.antepedia.com/
searchcode is a free source code and documentation search engine. API documentation, code snippets and open source (free sofware) repositories are indexed and searchable. http://searchcode.com/
Krugle provides support search open source code from all programming languages. It has advanced search capabilities to search on function name, class name, project name etc. http://opensearch.krugle.org/
GrepCode can be used to find Java projects, Java classes, interfaces, enums, annotations (Java Types), Java methods. In addition, GrepCode allows users to do stacktrace searches and find usages of classes and methods. http://grepcode.com/
Codase is the source code search company with advanced source code understanding and xml index/search technologies. It hosts huge amount of open source codes providing a much better coverage. With Codase, one can search functions, classes, strings, constants, macros, comments and other programming language constructs. http://www.codase.com/
Debian Code Search (DCS) is a search engine for source code — it searches all the open source projects which are included in the Debian archive (the "main" distribution only, not non-free or contrib). Currently, that includes about 18000 packages with 140 GiB of source code. http://codesearch.debian.net/
comments powered by Disqus
As open source getting popular day by day, many have questions like How to make money from Open Source? Lot more products are getting introduced and don't know who is making money. Certainly open source means, give the product and source for free then how to make money? Yes sell the product for free but get paid for its services.
Lucidworks Enterprise search solution is built on top of Apache Solr. It scales seamlessly w/sub-second response times under extreme query loads for multi-billion document collections. It has user friendly UI, which does all the job of configuration and search.
Lucene is a search library built in Java. Solr is a web application built on top of Lucene. Certainly Solr = Lucene + Added features. Often there would a question, when to choose Solr and when to choose Lucene.
Lucene is most powerful and widely used Search engine. Here is the list of 7 search engines which is built on top of Lucene. You could imagine how powerful they are.
Solr and Elastic Search are built on top of Lucene. Both are open source and both have extra features which makes programmer life easy. This article explains the difference and the best situation to choose between them.
Solr is a search engine built on top of Lucene. It supports REST interface and has lot of built-in capabilities. Solr package has Admin UI interface which has support to perform query and even delete the contents of the index. If you are using Solr in production then you may need to restrict access. I saw couple of questions in the group related to this topic. Thought to write an article explaining few tips to restrict the user access to Solr admin UI.
Most of the database has support of full text search, basically indexing and saarching. MySQL, Oracle and many more databases has in-built full text search. Then what is the need to go for external search engine like Lucene, Sphinx, Solr etc. Check out the advantage of using Searchengine.
Enterprise search software should be capable to search the data available in the entire organization or personnel desktop. The data could be in File system, Web or in Database. It should search contents of Emails, file formats like doc, xls, ppt, pdf and lot more. There are many commercial products available but LucidWorks and SearchBlox are best and free.
You may require GBs of data to do performance or load testing. How your app behaves when there is loads of data. You need to know the capacity of your application. This is the frequently asked question from the sales team "The customer is having 100GB of data and he wants to know whether our product will handle this? If so how much RAM / Disk storage required?". This article has pointers to the large data corpus.
Lucene and Solr are most popular and widely used search engine. It indexes the content and delivers the search result faster. It has all capabilities of NoSQL database. This article describes about its pros and cons.