Apache POI - Java API To Access Microsoft Document File Formats

  •        0

APIs for manipulating various file formats based upon Open Office XML (ECMA-376) and Microsoft's OLE 2 Compound Document formats using pure Java. Apache POI is your Java Excel, Word and PowerPoint solution. We have a complete API for porting other OOXML and OLE 2 Compound Document formats and welcome others to participate.




Related Projects

Tikka - A content analysis toolkit

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

docx4j - JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files

docx4j is a library which helps you to work with the Office OpenXML file format as used in docx documents, pptx presentations, and xlsx spreadsheets.

documents4j - Java library for converting documents into another document format

documents4j is a Java library for converting documents into another document format. This is achieved by delegating the conversion to any native application which understands the conversion of the given file into the desired target format.

doorstop - Requirements management using version control.

- talks: [GRDevDay](https://speakerdeck.com/jacebrowning/doorstop-requirements-management-using-python-and-version-control), [BarCamp](https://speakerdeck.com/jacebrowning/strip-searched-a-rough-introduction-to-requirements-management)- sample: [Generated HTML](http://doorstop.info/reqs/index.html)- documentation: [API](http://doorstop.info/docs/index.html), [Demo](http://nbviewer.ipython.org/gist/jacebrowning/9754157)Getting Started===============Requirements------------* Python 3.3+* A version

JODConverter - Automates document conversions using OpenOffice

JODConverter automates conversions between office document formats using OpenOffice.org or LibreOffice. Supported formats include OpenDocument, PDF, RTF, HTML, Word, Excel, PowerPoint, and Flash. It can be used as a Java library, a command line tool, or a web application.

delta_attack - extract text from MS Office document with Apache POI

extract text from MS Office document with Apache POI

Show SharePoint Version in Office Documents

The “Show SharePoint Version in Office Documents” solution can be used to extract version information stored in SharePoint document libraries and display it within Microsoft Office documents

Feng Office - A Collaboration Platform and online office

Feng Office is a web based collaboration platform. It helps to manage your projects and business services, Collaborate with your team and your customers, Organize and share documents and files. Feng Office allows businesses to manage project tasks, billing, documents, communication with co-workers, customers and vendors, schedule meetings and events, and share every kind of electronic information.

Joeffice - Office Written in Java

Joeffice is the first open source office suite written in Java. Its features include Docking system. Visualize several documents in the same window, It can have a lot of documents open at the same time and easily switch from one to another. It works with Microsoft document formats (docx, xslx, pptx). It can get data through web services (RMI, SOAP, REST).

LibreOffice - The Document foundation

LibreOffice is the free power-packed Open Source personal productivity suite for Windows, Macintosh and Linux. LibreOffice is the perfect choice for home users, businesses, government and other organizations. It's native file format is the ISO standardized ODF (Open Document Format), but LibreOffice can open and save Microsoft Word, PowerPoint and Excel files, as well as many other formats, bringing you the widest-available compatibility with other products.


Office Document Convertor (ODC) is an online convertor for office document which runs as a web service. Its aim is to provide the facility of converting almost all office documents into image which make office documents viewable even without any office suite software installed on your machines.

MOSS Document Converter

Microsoft Office SharePoint Server (MOSS) Document Converters with Word & Excel 2007 on the server. Converting Office 2003 file-types (doc, xls) to pdf and xps. Could easily be altered for work for docx and xlsx file-types. Desktop Automation on the Server: Previously, us...

Hydra - Distributed processing framework for search solutions

Hydra is designed to give the search solution the tools necessary to modify the data that is to be indexed in an efficient and flexible way. This is done by providing a scalable and efficient pipeline which the documents will have to pass through before being indexed into the search engine. Architecturally Hydra sits in between the search engine and the source integration.

OpenPipe - Document Pipeline

OpenPipe is an open source scalable platform for manipulating a stream of documents. A pipeline is an ordered set of steps / operations performed on a document to convert from its raw form to something ready to be put into the index. The operations performed on documents include language detection, field manipulation, POS tagging, entity extraction or submitting the document to a search engine.

WebSync - Document editing tool similar to Google Drive or Microsoft Skydrive

WebSync is a document editing tool similar to Google Drive or Microsoft Skydrive. A limitation of Google Drive is not having a note taking application and a reason WebSync was created. WebSync makes up for this by providing a OneNote-esqe Notebook file type. It is a self hostable document editing tool. It has real time collaborative editing built in.


The KineSis project allows controlling presentations using Microsoft Kinect. The user have the possibility of opening documents (Microsoft Office documents, images and plain text), and based on gestures, to control the presentation (move to next slide/page, scroll, zoom). Anot...


Wrapper around the open xml office package. You can easily generate xlsx documents based on a template xlsx document and reuse parts from that document, if you mark them as named ranges (i.e."names"). Requirement: .Net 3.5 or later. Microsoft Office does not need to be installed!

Simple OOXML

Simple OOXML makes the creation of Open Office XML documents easier for developers. Modify or create any .docx or .xlsx document without Microsoft Word or Microsoft Excel. Uses the Open Office SDK v 2.0.

Virtual Office - Document Mangament

Virtual Office - Document Mangament System is to computerize all the hard copies of documents in any office and to automate the document management process. If you are an Asp.Net/C# developer you welcome to contribute.

Open-XML-SDK - Open XML SDK by Microsoft Open Technologies, Inc.

The Open XML SDK provides open-source libraries for working with Open XML Documents (DOCX, XLSX, and PPTX). It supports scenarios such as: - High-performance generation of word-processing documents, spreadsheets, and presentations - Document modification, such as removing tracked revisions or removing unacceptable content from documents - Data and content querying and extraction, such as transformation from DOCX to HTML, or extraction of data from spreadsheets