mwparserfromhell - A Python parser for MediaWiki wikicode

  •        98

mwparserfromhell (the MediaWiki Parser from Hell) is a Python package that provides an easy-to-use and outrageously powerful parser for MediaWiki wikicode. It supports Python 2 and Python 3. Developed by Earwig with contributions from Σ, Legoktm, and others. Full documentation is available on ReadTheDocs. Development occurs on GitHub.

https://mwparserfromhell.readthedocs.io
https://github.com/earwig/mwparserfromhell

Tags
Implementation
License
Platform

   




Related Projects

MediaWiki - powers Wikipedia

  •    PHP

MediaWiki is powers Wikipedia.org. MediaWiki is designed to be run on a large server farm for a website that gets millions of hits per day. MediaWiki is an extremely powerful, scalable software and a feature-rich wiki implementation. Pages use MediaWiki's wikitext format, so that users without knowledge of XHTML or CSS can edit them easily.

Word2MediaWiki.NET

  •    

Do you hate trying to remember how to format text in Wikipedia? Do you have a bunch of Word documents that you'd like to convert to use in Wikipedia (or another site that uses the same wiki engine - aka "MediaWiki")? This .NET/VSTO add-in for Microsoft Word will allow you to...

apps-android-wikipedia - The official Wikipedia Android app

  •    Java

This repository contains the source code for the official Wikipedia Android app.

Semantic MediaWiki

  •    PHP

Semantic MediaWiki is an extension to the MediaWiki software (powering Wikipedia), which extends the Wiki with ideas from the Semantic Web. We focus on usability and tight integration. See the web site for further details.

Python Wikipedia Robot Framework

  •    Python

A framework for writing robots to crawl MediaWiki wikis like Wikipedia, with the goal of standardizing content or otherwise making minor modifications to the pages of the wiki including making interwiki links.


fact-extractor - Fact Extraction from Wikipedia Text

  •    Python

Wikipedia dumps are packaged as XML documents and contain text formatted according to the Mediawiki markup syntax, with templates to be transcluded. To obtain a raw text corpus, we use the WikiExtractor, integrated in a frozen version here. Pull requests not complying to these guidelines will be ignored.

SoloWiki editor

  •    Java

A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.

PHP mediawiki bot

  •    PHP

Use this to make automated changes to MediaWiki websites such as Wikipedia, Wikibooks, and Wiktionary. A useful bot and cron library providing everything you need to automatically correct or create articles. Runs on any platform with PHP.

parsimonious - The fastest pure-Python PEG parser I can muster

  •    Python

Parsimonious aims to be the fastest arbitrary-lookahead parser written in pure Python—and the most usable. It's based on parsing expression grammars (PEGs), which means you feed it a simplified sort of EBNF notation. Parsimonious was designed to undergird a MediaWiki parser that wouldn't take 5 seconds or a GB of RAM to do one page, but it's applicable to all sorts of languages. You can have forward references and even right recursion; it's all taken care of by the grammar compiler. The first rule is taken to be the default start symbol, but you can override that.

LicenseToKill

  •    CSharp

LicenseToKill is a tool made to help the images deletion tasks often performed on Wikipedia and Mediawiki-based sites.

WikiImporter for NodeXL

  •    

Import networks from MediaWikis directly into NodeXL for analysis and visualization.

Luasoacute;g

  •    Flex

A JavaScript framework and integrated development environment (IDE) for interacting with the MediaWiki API e.g. to create Wikipedia bots.

IpbWiki

  •    PHP

Integrates Invision Power Board forum software with Mediawiki (Wikipedia) wiki software

Txt2tags - Document generator: ONE source, MULTI targets

  •    Python

Txt2tags is a document generator. It reads a text file with minimal markup such as **bold** and //italic// and converts it to the formats like HTML, XHTML, SGML, DocBook (NEW), LaTeX, Lout, Man page, Creole (NEW), Wikipedia / MediaWiki, Google Code, Wiki, PmWiki (NEW), DokuWiki, MoinMoin, MagicPoint, PageMaker, AsciiDoc (NEW), ASCII Art (NEW), Plain text.

WikiFilter

  •    C

A wiki text parser and a web filter used for browsing a local copy of the Wikipedia and other wiki project dump files. Works with either Microsoft web server or Apache.

postcss - Transforming styles with JS plugins

  •    Javascript

PostCSS is a tool for transforming styles with JS plugins. These plugins can lint your CSS, support variables and mixins, transpile future CSS syntax, inline images, and more.PostCSS is used by industry leaders including Wikipedia, Twitter, Alibaba, and JetBrains. The Autoprefixer PostCSS plugin is one of the most popular CSS processors.

DBpedia - Wikipedia Data Extraction

  •    Scala

DBpedia has moved to GitHub: https://github.com/dbpedia/extraction-framework/wiki The mailing lists are still hosted by SourceForge. DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to link other datasets on the Web to Wikipedia data.

Wikipedia Concept Extractor

  •    Java

Extract a Term Concept Map from Wikipedia. It takes an export of the Wikipedia and imports it into a database. The extracted term concept map can then serve as a basis for semantically enhancing applications, with the depth and bredth of the Wikipedia.

Wikipedia browser for Microsoft Surface

  •    WPF

Wikipedia browser for Microsoft Surface allows to search Wikipedia and Wikimedia Commons on Microsoft Surface device.

wikipedia-ios - The official Wikipedia iOS app.

  •    Objective-C

The official Wikipedia iOS client.The app is primarily being developed by the Wikimedia Foundation's Mobile Apps team. This README provides high-level guidelines for getting started with the project. If you have any questions, comments, or issues, the easiest way to talk to us is joining the #wikimedia-mobile channel on the Freenode IRC server during Eastern and Pacific business hours. We'll also gladly accept any tickets filed against the project in Phabricator.