Column database vs OLAP
OLAP (Online Analytical Processing), Reporting, Data mining related tasks are usually done by Business intelligence products. They do powerful Extraction, Transformation and Loading (ETL) the data and provides various reports. They use relational database as its back end. How could they generate better reports? Will column DB do a better job?
Business Intelligence and data mining products are available over the years. They are proven and use different techniques to solve the problem. Basically they aggregate data and store in such a way that they could able to perform multi dimensional (MDX) queries. The only drawback is they aggregate the data and it requires more disk space and processing power. To get scheduled reports, the aggregation task should be carried out at scheduled interval (frequent interval). They have good GUI where drag and drop fields will produce good charts.
Column database stores every single column separately. This helps to do better analytics without doing aggregation. It is capable to produce real time reports. Column database is more flexible and dynamic. Developers need to write some code to give better visualization reports where as in case of any BI tools this would be automatic. Not much programing skills or SQL knowledge required.
The best solution would be combining both the ends. An OLAP sever with column database as back end will give benefits of both the worlds.
See also: Open source column-oriented database
http://www.findbestopensource.com/tagged/data-mining
http://www.findbestopensource.com/tagged/business-intelligence
Please feel to add your comments. It would be great, If you could add more information to this topic.
comments powered by Disqus
Related Articles
Best situation to use Column database
Column oriented database or datastore as the name sounds it stores the data by column rather than by row. It has some advantages and disadvantages over traditional RDBMS. Developer should know the typical situation to choose column oriented database.
Coursera - Take the World's Best Courses, Online, For Free
Coursera is a social entrepreneurship company that partners with the top universities in the world to offer courses online for anyone to take, for free. Their courses include in various categories like Biology, Business management, Computer science, Robotics, Artificial Intelligence, Finance, Nutrition, Law, Mathematics, Medicine, Genetics, Data analytics and lot more.
10 Free services for your Website / Blog. Just plug it.
Each website / blog delivers useful content or service to its users. But website themselves requires some service to monitor and increase its presence. Here are few free services which could be used by Website / Blog. This will be very much helpful for small business owners.
Advantages and Disadvantages of using Hibernate like ORM libraries
Traditionally Programmers used ODBC, JDBC, ADO etc to access database. Developers need to write SQL queries, process the result set and convert the data in the form of objects (Data model). I think most programmers would typically write a function to convert the object to query and result set to object. To overcome these difficulties, ORM provides a mechanism to directly use objects and interact with the database.
Should web application store images in Database or File system?
Web developers most frequent question, Should user images be stored in database or file system? Which is the best way. Both has some pros and cons.
Lucene / Solr as NoSQL database
Lucene and Solr are most popular and widely used search engine. It indexes the content and delivers the search result faster. It has all capabilities of NoSQL database. This article describes about its pros and cons.
An introduction to MongoDB
MongoDB is the most exciting SQL-free database currently available in the market. The new kid on the block, called MongoDB is a scalable, high-performance, open source, schema free and document oriented database that focuses on the ideas of NoSQL Approach. Written in C++, it has taken rapid strides since its emergence into the public sphere as a popular way to build your database applications.
Is ZooKeeper mandatory for Cloud
Cloud is nothing but more than one system or application distributed across the network, across the globe. It may have couple of application servers, database server, shared data storage, backup server and lot more. The resources in the distributed environment must have information about each other so that they could co-ordinate and share without any issues. ZooKeeper helps to do that.
Why require Searchengine? Why not use database for full text search in Enterprise application.
Most of the database has support of full text search, basically indexing and saarching. MySQL, Oracle and many more databases has in-built full text search. Then what is the need to go for external search engine like Lucene, Sphinx, Solr etc. Check out the advantage of using Searchengine.
Open source is the backbone for Startups
Many startups are entering in to the business due to open source. Open source acts as a back bone / pillar for their business. It reduces the cost of production, Generates revenue from consulting and support. This article describes about the startups which flourished because of open source. Sun acquired MySQL for $1Bn is the biggest achievement for open source startups.