Info Extraction & Geo-inferencing Projects
keywords: geoparsing, geocoding, entity extraction, NER, REGEX
Xponents - Geotagging APIs to work with gazetteers and multilingual extraction of geography, date/time, patterns. A pre-built Docker microservice is available with a complete operational gazetteer and geotagger that accepts text. Example applications demonstrate the use of Tika for rendering inputs and GISCore for output formats to demonstrate end-to-end solutions. The geotagger features the Solr TextTagger.
Xponents is primarily a Java-based library and web service, with various Python client entry points. 5,000+ downloads on Docker Hub, so far!.
Last updated 2026-March.
Inactive Modules
This list of projects and experiments is no longer active, but worth listing here as part of the OpenSextant family. Last update is noted in each heading.
- HOWLER: Ontology translation work. Last update 2016.
- HOWLER - Translate between simple English text and OWL ontologies
- HOWLER Kanban - HOWLER combined with Kanban (based on Wekan )
- GISCore: GIS support. Last update 2019.
-
SolrTextTagger - (Retired) A text tagger based on Lucene/Solr. Lat updated 2023. NOTE: As of Solr 7.4 this tagger plugin was migrated to Apache Solr as a formal request handler. Xponents SDK uses the Solr TextTagger still.
- OpenSextant v1: Original Gangstah. Last updated 2017.
- OpenSextantToolbox - (Retired) A geotagger and entity extractor employing GATE.
- opensextant - (Retired) The original OpenSextant project.
NOTE: Xponents is the currently maintained geotagger solution that took over the main functionality. - Gazetteer - (Retired) Pipeline project to render world-wide “geo names” data into gazetteers used by these projects.
NOTE: This Gazetteer required Pentaho 6 or earlier and was stuck to Oracle JDK 8.
Xponents internal gazetteer is current and yields a flexible SQLite intermediate and complete worldwide gazetteer