Index

A B C D E F G H I J K L M N O P R S T U V W X Y 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form

A

ABBREV_MAX_LEN - Static variable in class org.opensextant.util.TextUtils
 
ABBREV_TYPE - Static variable in class org.opensextant.util.GeonamesUtility
 
AbstractFlexPat - Class in org.opensextant.extractors.flexpat
FlexPat Extractor -- given a set of pattern families, extract, filter and normalize matches.
AbstractFlexPat() - Constructor for class org.opensextant.extractors.flexpat.AbstractFlexPat
 
AbstractFlexPat(boolean) - Constructor for class org.opensextant.extractors.flexpat.AbstractFlexPat
 
AbstractFormatter - Class in org.opensextant.output
Abstract class encapsulating basic results formatter functionality.
AbstractFormatter() - Constructor for class org.opensextant.output.AbstractFormatter
 
AbstractGenericFormatter - Class in org.opensextant.output
Abstract class encapsulating basic results formatter functionality without prescribing schema
AbstractGenericFormatter() - Constructor for class org.opensextant.output.AbstractGenericFormatter
 
accept(File, String) - Method in class org.opensextant.util.AnyFilenameFilter
FilenameFilter implementation for XML files
add(Collection<Annotation>) - Method in interface org.opensextant.annotations.DeepEyeStore
add a list of entity annotations.
add(Annotation) - Method in interface org.opensextant.annotations.DeepEyeStore
add a single Annotation.
add(Record) - Method in interface org.opensextant.annotations.DeepEyeStore
Add new record.
add_trace(String) - Method in class org.opensextant.extractors.flexpat.TextMatchResult
 
addAdditionalAttributes(Feature, Map<String, Object>) - Method in class org.opensextant.output.GISDataModel
If the caller has additional data to attach to records, allow them to add fields to schema at runtime and map their data to keys on GeocodingResult Similarly, you could have Geocoding row-level attributes unique to the geocoding whereas attrs on GeocodingResult are global for all geocodings in that result set.
addAlias(String) - Method in class org.opensextant.data.Country
Country is also known as some list of aliases
addAttribute(String, Object) - Method in class org.opensextant.annotations.DeepEyeData
 
addAttribute(String, Object) - Method in class org.opensextant.extraction.ExtractionResult
Add some piece of amplifying metadata about the record which may be carried through to output format in some way
addBytes(long) - Method in class org.opensextant.extraction.ExtractionMetrics
 
addCollectionTag(String) - Method in class org.opensextant.annotations.Record
"tags" are meant to be used at a data set or collection level.
addCollectionTags(Collection<String>) - Method in class org.opensextant.annotations.Record
Parses the given "a;b;c;..." format of tags into a Set.
addCollectionTags(Map<String, Object>) - Method in class org.opensextant.annotations.Record
 
addColumn(Map<String, String>, String, int) - Method in class org.opensextant.output.CSVFormatter
 
addColumn(Map<String, String>, String, int) - Method in class org.opensextant.output.CSVGenericFormatter
 
addColumn(Map<String, String>, String, String) - Method in class org.opensextant.output.CSVFormatter
Add a column of data to output; Field is validated ; value is not added if null
addColumn(Map<String, String>, String, String) - Method in class org.opensextant.output.CSVGenericFormatter
Add a column of data to output; Field is validated ; value is not added if null
addColumn(Feature, SimpleField, double) - Method in class org.opensextant.output.GISDataModel
Add a column of data to output; Field is validated.
addColumn(Feature, SimpleField, int) - Method in class org.opensextant.output.GISDataModel
Add a column of data to output; Field is validated.
addColumn(Feature, SimpleField, Object) - Method in class org.opensextant.output.GISDataModel
Add a column of data to output; Field is validated ; value is not added if null.
addConfidence(Feature, int) - Method in class org.opensextant.output.GISDataModel
Adds the confidence.
addContext(Feature, TextMatch) - Method in class org.opensextant.output.GISDataModel
Adds the context.
addDateField(String) - Static method in class org.opensextant.output.OpenSextantSchema
Cache an arbitrary date field in schema
addExtractor(Extractor) - Method in class org.opensextant.processing.XtractorGroup
 
addField(String) - Method in class org.opensextant.output.AbstractGenericFormatter
Adds the field.
addField(String) - Method in class org.opensextant.output.CSVFormatter
 
addField(String) - Method in class org.opensextant.output.GISDataFormatter
 
addField(String) - Method in class org.opensextant.output.GISDataModel
Add a field key to the field order; Caller must also be responsible for ensuring field is valid and exists in Schema.
addField(String) - Method in interface org.opensextant.output.ResultsFormatter
 
addFilePaths(Feature, String, String) - Method in class org.opensextant.output.GISDataModel
Adds the file paths.
addFormatter(ResultsFormatter) - Method in class org.opensextant.processing.XtractorGroup
 
addLang(String, String) - Method in class org.opensextant.util.GeonamesUtility
 
addLanguage(String) - Method in class org.opensextant.data.Country
When adding languages, please add the primary language FIRST.
addLanguage(Language) - Static method in class org.opensextant.util.TextUtils
 
addLanguage(Language, boolean) - Static method in class org.opensextant.util.TextUtils
Extend the basic language dictionary.
addLatLon(Feature, Geocoding) - Method in class org.opensextant.output.GISDataModel
Adds the lat lon.
addMatchMethod(Feature, String) - Method in class org.opensextant.output.GISDataModel
Allows caller to add a method or pattern id of sorts to denote how match was derived.
addMatchMethod(Feature, TextMatch) - Method in class org.opensextant.output.GISDataModel
Adds the match method.
addMatchText(Feature, TextMatch) - Method in class org.opensextant.output.GISDataModel
Adds the match text.
addMention(String, String) - Method in class org.opensextant.data.social.Tweet
If adding mentions one at a time, then only mention IDS map is used.
addNearByPlace(Place) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Adds the near by place.
addOffset(int) - Method in class org.opensextant.annotations.Annotation
 
addOffsetAttribute() - Method in class org.opensextant.annotations.Annotation
add annot.offset = x or annot.attrs.offsets = "x1;x2;x3"...
addOffsets(Feature, TextMatch) - Method in class org.opensextant.output.GISDataModel
Adds the offsets.
addOffsetTo(Map<String, Object>) - Method in class org.opensextant.annotations.Annotation
Add offset information (offset, len, offsets, etc ) into the representation of this annotation.
addOtherInterpretation(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
The current instance is the main match.
addOutputFormat(String) - Method in class org.opensextant.processing.Parameters
Processing will support multiple output formats
addPlaceData(Feature, Geocoding) - Method in class org.opensextant.output.GISDataModel
Adds the place data.
addPrecision(Feature, Geocoding) - Method in class org.opensextant.output.GISDataModel
Adds the precision.
addRegion(String) - Method in class org.opensextant.data.Country
Country is also known as some list of aliases
addState(String) - Method in class org.opensextant.annotations.Record
 
addState(String, int) - Method in class org.opensextant.annotations.Record
 
addStates(Collection<String>) - Method in class org.opensextant.annotations.Record
 
addStates(Map<String, Object>) - Method in class org.opensextant.annotations.Record
 
addTag(String) - Method in class org.opensextant.data.social.Tweet
add any tag you like.
addTag(String) - Method in class org.opensextant.data.Taxon
 
addTags(List<String>) - Method in class org.opensextant.data.Taxon
 
addTerm(String) - Method in class org.opensextant.data.Taxon
 
addTerms(Collection<String>) - Method in class org.opensextant.data.Taxon
 
addTerritory(Country) - Method in class org.opensextant.data.Country
 
addTextField(String) - Static method in class org.opensextant.output.OpenSextantSchema
Cache an arbitrary text field in schema
addTime(long) - Method in class org.opensextant.extraction.ExtractionMetrics
Add just a time delta.
addTime(long, int) - Method in class org.opensextant.extraction.ExtractionMetrics
Add just a time delta.
addTimeSince(long) - Method in class org.opensextant.extraction.ExtractionMetrics
Add time delta using NOW - time.
addTimezone(String, double) - Method in class org.opensextant.data.Country
Add a timezone and its offset.
addTimezone(Country.TZ) - Method in class org.opensextant.data.Country
Refactor -- use JodaTime and the TZDB more formally.
addURL(String) - Method in class org.opensextant.data.social.Tweet
 
admin1 - Variable in class org.opensextant.data.Place
For normalization purposes tracking the Province may be helpful.
admin1Name - Variable in class org.opensextant.data.Place
 
admin2 - Variable in class org.opensextant.data.Place
 
admin2Name - Variable in class org.opensextant.data.Place
 
adminName - Variable in class org.opensextant.data.Place
 
ALL_PATTERNS - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
ALL_PATTERNS - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
allowNonGeo - Variable in class org.opensextant.output.GISDataFormatter
 
alternativeCJKLangID(String) - Static method in class org.opensextant.extractors.langid.LangDetect
detecting if script of text is Japanese, Korean or Chinese.
alternativeLangID(String) - Static method in class org.opensextant.extractors.langid.LangDetect
Look at raw bytes/characters to see which Unicode block they fall into.
Annotation - Class in org.opensextant.annotations
An annotation is at least a typed name/value pair created by something.
Annotation() - Constructor for class org.opensextant.annotations.Annotation
 
Annotation(String) - Constructor for class org.opensextant.annotations.Annotation
 
Annotation(String, String) - Constructor for class org.opensextant.annotations.Annotation
 
Annotation(String, String, String, String, String) - Constructor for class org.opensextant.annotations.Annotation
 
AnnotationHelper - Class in org.opensextant.annotations
Basis for this optional helper class was three or four different projects using DeepEye as a model for persisting annotations from the typical Named Entity and Geo/Time extraction work.
AnnotationHelper() - Constructor for class org.opensextant.annotations.AnnotationHelper
 
AnyFilenameFilter - Class in org.opensextant.util
 
AnyFilenameFilter(String) - Constructor for class org.opensextant.util.AnyFilenameFilter
 
APP_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
applyGenericRules() - Method in class org.opensextant.data.social.Tweet
 
approximateLongitudeForUTCOffset(int) - Static method in class org.opensextant.util.GeonamesUtility
This helps get the general area +/-5 degrees for a given UTC offset.
arabicLang - Static variable in class org.opensextant.util.TextUtils
 
asMap(JsonArray) - Static method in class org.opensextant.annotations.DeepEyeData
Convert an array to a trivial map, [i1, i2, i3,...] ==> { i1:"1", i2:"1", ...} UNUSED.
asPlace() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Create a Place version of this coordinate -- that is, once we've found the coordinate match if the match data is no longer needed we can produce a geodetic Place from the TextMatch.
ATTR_AUTH_LANG - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_AUTH_NAME - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_DATE - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_DESC - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_EPOCH - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_GENDER - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_ID - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_KLOUT - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_LANG - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_MENTIONS - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_PROFILE_ID - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_RETWEET - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_RETWEET_ID - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_SCREEN_NAME - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_TAGS - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_TEXT - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_TZ - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_URLS - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_USES_GPS - Static variable in class org.opensextant.data.social.Tweet
 
ATTR_UTC_OFFSET - Static variable in class org.opensextant.data.social.Tweet
 
attributes - Variable in class org.opensextant.extraction.ExtractionResult
 
attrs - Variable in class org.opensextant.annotations.DeepEyeData
 
AUD_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
authorCC - Variable in class org.opensextant.data.social.Message
raw country code
authorDesc - Variable in class org.opensextant.data.social.Tweet
 
authorGender - Variable in class org.opensextant.data.social.Tweet
M=male, F=female, T=transitional, transgender? Other? Gender is a string, could be an enum, but its no longer two values, I bet.
authorGeo - Variable in class org.opensextant.data.social.Message
Author's profile location -- If country and actual location is set, use Geocoding object, or opensextant.Place to capture the full metadata.
authorID - Variable in class org.opensextant.data.social.Message
Author ID is screen_name, user_id, user.name, etc.
authorLatLonText - Variable in class org.opensextant.data.social.Message
raw XY val, if present on author profile.
authorLocation - Variable in class org.opensextant.data.social.Message
raw location string
authorName - Variable in class org.opensextant.data.social.Message
A plain language Name, display name, native language name for the author
authorProfileID - Variable in class org.opensextant.data.social.Message
the numeric ID for a user/author profile.

B

b2hex(byte[]) - Static method in class org.opensextant.util.TextUtils
 
bahasaLang - Static variable in class org.opensextant.util.TextUtils
 
buffer - Variable in class org.opensextant.data.TextInput
 
buildRow(Map<String, String>, TextMatch) - Method in class org.opensextant.output.CSVFormatter
Pull in data from match into the output schema (map)
buildRows(int, Geocoding, TextMatch, Map<String, Object>, ExtractionResult) - Method in class org.opensextant.output.GISDataModel
Builds a GISCore feature array (rows) from a given array of TextMatches; Enrich the features with record-level attributes (columns).
buildSchema() - Method in class org.opensextant.output.CSVFormatter
Create a schema instance with the fields properly typed and ordered
buildSchema() - Method in class org.opensextant.output.CSVGenericFormatter
Create a schema instance with the fields properly typed and ordered
buildSchema(String) - Method in class org.opensextant.output.GISDataModel
Create a schema instance with the fields properly typed and ordered.

C

cacheAnnotation(String, String, String, int, String) - Method in class org.opensextant.annotations.AnnotationHelper
Cache entity annotations, accumulating unique offsets for a name/value pair.
cacheAnnotation(Annotation) - Method in class org.opensextant.annotations.AnnotationHelper
Cache annotation.
cacheAnnotation(Annotation, int) - Method in class org.opensextant.annotations.AnnotationHelper
Cache entity annotation - in Memory; Note, the actual ID or key in database is usually composed of name+value+contrib.
cacheAnnotation(Annotation, String) - Method in class org.opensextant.annotations.AnnotationHelper
Cache an annotation.
cacheTaxonAnnotation(String, Taxon, String, int, String) - Method in class org.opensextant.annotations.AnnotationHelper
Cache taxon entity annotation.
cal - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
The cal.
canAdd(String) - Method in class org.opensextant.output.CSVFormatter
 
canAdd(String) - Method in class org.opensextant.output.CSVGenericFormatter
 
canAdd(SimpleField) - Method in class org.opensextant.output.GISDataModel
Can add.
CASE_LOWER - Static variable in class org.opensextant.util.TextUtils
 
CASE_UPPER - Static variable in class org.opensextant.util.TextUtils
 
catalog - Variable in class org.opensextant.data.Taxon
Catalog, for example "fruit"
CC_FIPS - Variable in class org.opensextant.data.Country
FIPS 10-4 2-character country code
CC_ISO2 - Variable in class org.opensextant.data.Country
ISO 2-character country code
CC_ISO3 - Variable in class org.opensextant.data.Country
ISO 3-character country code
cce_family - Variable in class org.opensextant.extractors.xcoord.GeocoordPattern
Only as defined in your configured patterns, e.g., "DM", "DMS"; However this set of values should align with XConstants enumerations.
cce_family_id - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The cce family id.
cce_family_id - Variable in class org.opensextant.extractors.xcoord.GeocoordPattern
XConstants value for the family
CCE_family_state - Variable in class org.opensextant.extractors.xcoord.PatternManager
 
cce_variant - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The cce variant.
cce_variant - Variable in class org.opensextant.extractors.xcoord.GeocoordPattern
Only as defined in your configured patterns, e.g., the "01" in "DM-01"
checkCase(String, int) - Static method in class org.opensextant.util.TextUtils
detects if string alpha chars are purely lower case.
checkOverwrite(File) - Method in class org.opensextant.output.AbstractFormatter
uniform helper for overwrite check.
checkOverwrite(File) - Method in class org.opensextant.output.AbstractGenericFormatter
uniform helper for overwrite check.
chineseLang - Static variable in class org.opensextant.util.TextUtils
 
chineseTradLang - Static variable in class org.opensextant.util.TextUtils
 
clean_input - Variable in class org.opensextant.processing.Parameters
 
cleanup() - Method in interface org.opensextant.extraction.Extractor
Resource management.
cleanup() - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Extractor interface: extractors are responsible for cleaning up after themselves.
cleanupAll() - Method in class org.opensextant.processing.XtractorGroup
Use only if you intend to shutdown.
close() - Method in class org.opensextant.output.AbstractFormatter
 
close() - Method in class org.opensextant.output.CSVFormatter
 
close() - Method in class org.opensextant.output.GISDataFormatter
 
CLOSE_CHARS - Static variable in class org.opensextant.extraction.MatcherUtils
 
closeOutputStreams() - Method in class org.opensextant.output.AbstractGenericFormatter
Close output streams.
closeOutputStreams() - Method in class org.opensextant.output.CSVGenericFormatter
 
code - Variable in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
CODE_TYPE - Static variable in class org.opensextant.util.GeonamesUtility
 
COMMENT_CHAR - Static variable in class org.opensextant.util.FileUtility
Char used in config files, dict files.
compareTo(Place) - Method in class org.opensextant.data.Place
With multiple data sources there is no standard way of saying this place == that place.
compareTo(TextMatch) - Method in class org.opensextant.extraction.TextMatch
this match, A compared to B Order: A B then A > B Order: B A then A < B Order: same spans then A == B
compareTo(LangID) - Method in class org.opensextant.extractors.langid.LangID
If THIS langID score is less than argument, return -1 If THIS langID score is greater than argument, return 1
compress(String) - Static method in class org.opensextant.util.TextUtils
Compress bytes from a Unicode string.
compress(String, String) - Static method in class org.opensextant.util.TextUtils
 
CONFIDENCE - Static variable in class org.opensextant.output.OpenSextantSchema
confidence 0.000 to 1.000 suggests our confidence that we code the MATCH TEXT to the right LAT/LON this is a string for now to keep the actual sig-figs accurate.
ConfigException - Exception in org.opensextant
Generic OpenSextant Configuration Exception
ConfigException(String) - Constructor for exception org.opensextant.ConfigException
 
ConfigException(String, Exception) - Constructor for exception org.opensextant.ConfigException
 
configure() - Method in interface org.opensextant.extraction.Extractor
Configure an Extractor using defaults for that extractor.
configure() - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Configures whatever default patterns file is named.
configure(InputStream, String) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
configure(String) - Method in interface org.opensextant.extraction.Extractor
Configure an Extractor using a config file named by a path.
configure(String) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Configure using a particular pattern file.
configure(URL) - Method in interface org.opensextant.extraction.Extractor
Configure an Extractor using a config file named by a URL.
configure(URL) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Configure using a URL pointer to the pattern file.
connect() - Method in interface org.opensextant.annotations.DeepEyeStore
 
contains(int) - Method in class org.opensextant.extraction.TextEntity
Assess if an offset is within this span
containsDSTOffset(double) - Method in class org.opensextant.data.Country
 
containsTimezone(String) - Method in class org.opensextant.data.Country
 
containsUTCOffset(double) - Method in class org.opensextant.data.Country
Test if this Country contains the UTC offset.
content - Variable in class org.opensextant.annotations.Record
 
CONTEXT - Static variable in class org.opensextant.output.OpenSextantSchema
A text window around the MATCH TEXT delineated by START/END offsets.
CONTEXT_FILTERS_ON - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: filter out coordinate matches that appear embedded in other text, e.g., ABC45.44,77.1W
contrib - Variable in class org.opensextant.annotations.Annotation
 
COORD_FLD - Static variable in class org.opensextant.data.social.Tweet
 
coord_text - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
Just the coordinate text normalized.
COORDINATE_SYMBOLS - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
copy(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
copy(TextMatch) - Method in class org.opensextant.extraction.TextMatch
 
copyMetadata(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Copy metadata.
copyTo(Place) - Method in class org.opensextant.data.Place
Copy the basic gazetteer metadata to
count_DD_digits(String) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
Count the number of decimal places in a lat or lon text string.
count_digits(String) - Static method in class org.opensextant.util.TextUtils
Counts all digits in text.
count_DMS_digits(String) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
Counts all digits in latitude.
count_ws(String) - Static method in class org.opensextant.util.TextUtils
Counts all whitespace in text.
countASCIIChars(byte[]) - Static method in class org.opensextant.util.TextUtils
count the number of ASCII bytes
countCJKChars(char[]) - Static method in class org.opensextant.util.TextUtils
Counts the CJK characters in buffer, buf chars Inspiration: http://stackoverflow .com/questions/1499804/how-can-i-detect-japanese-text-in-a-java-string Assumption is that the char array is Unicode characters.
countDigits(String) - Static method in class org.opensextant.util.TextUtils
 
countFormattingSpace(String) - Static method in class org.opensextant.util.TextUtils
Count formatting whitespace.
countIrregularPunctuation(String) - Static method in class org.opensextant.util.TextUtils
 
countNonText(String) - Static method in class org.opensextant.util.TextUtils
Count number of non-alphanumeric chars are present.
countriesInDSTOffset(double) - Method in class org.opensextant.util.GeonamesUtility
This check only makes sense if you have date/time which is in a period of daylight savings.
countriesInTimezone(String) - Method in class org.opensextant.util.GeonamesUtility
List all countries in a particular TZ
countriesInUTCOffset(double) - Method in class org.opensextant.util.GeonamesUtility
List all countries in a particular UTC offset; These are usually -15.0 to 15.0 every 0.5 or 0.25 hrs.
countriesSpeaking(String) - Method in class org.opensextant.util.GeonamesUtility
Examples: what countries speak french (fr)? what countries speak Rwandan French? (fr-RW)?
country - Variable in class org.opensextant.data.Place
 
Country - Class in org.opensextant.data
Country metadata provided on this class includes: ISO-3166 country code 2-char and 3-char forms, aligned with US standard FIPS 10-4 codes Country aliases: nick names, variant names, abbreviations Affiliated territories Timezone and UTC offset for temporal calculations Primary and Secondary languages
Country(String, String) - Constructor for class org.opensextant.data.Country
A country abstraction that uses ISO 2-alpha as an ID, and any name given as the Place.name
COUNTRY_ADM0 - Static variable in class org.opensextant.util.GeonamesUtility
 
COUNTRY_ADM0_NORM - Static variable in class org.opensextant.util.GeonamesUtility
 
country_id - Variable in class org.opensextant.data.Place
 
Country.TZ - Class in org.opensextant.data
 
countrySpeaks(String, String) - Method in class org.opensextant.util.GeonamesUtility
Is language spoken in country ID'd by cc? See TextUtils for list of langauges provided by Library of Congress.
CR - Static variable in class org.opensextant.util.TextUtils
 
create_match(String, String, Map<String, String>) - Method in class org.opensextant.extractors.poli.PoliPatternManager
Deprecated.
logic for creation of a match is back in main PoLi match loop
create_pattern(String, String, String) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Implementation must create a RegexPattern given the basic RULE define, #RULE FAMILY RID REGEX PatternManager here adds compiled pattern and DEFINES.
create_pattern(String, String, String) - Method in class org.opensextant.extractors.poli.PoliPatternManager
Pattern Factory Implementation must create a RegexPattern given the basic RULE define, #RULE FAMILY RID REGEX PatternManager here adds compiled pattern and DEFINES.
create_pattern(String, String, String) - Method in class org.opensextant.extractors.xcoord.PatternManager
Implementation must create a RegexPattern given the basic RULE define, #RULE FAMILY RID REGEX PatternManager here adds compiled pattern and DEFINES.
create_pattern(String, String, String) - Method in class org.opensextant.extractors.xtemporal.PatternManager
 
create_testcase(String, String, String) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Implementation must create TestCases given the #TEST directive, #TEST RID TID TEXT
create_testcase(String, String, String) - Method in class org.opensextant.extractors.poli.PoliPatternManager
Implementation must create TestCases given the #TEST directive, #TEST RID TID TEXT
create_testcase(String, String, String) - Method in class org.opensextant.extractors.xcoord.PatternManager
Implementation must create TestCases given the #TEST directive, #TEST RID TID TEXT
create_testcase(String, String, String) - Method in class org.opensextant.extractors.xtemporal.PatternManager
 
createAnnotation(String, String, String, int, int, String) - Static method in class org.opensextant.annotations.AnnotationHelper
 
createAnnotation(String, String, String, int, String) - Static method in class org.opensextant.annotations.AnnotationHelper
Creates a standard named entity annotation.
createCountry(Annotation) - Static method in class org.opensextant.annotations.AnnotationHelper
Returns an instance of a Country object using annotation value as country name, and attr[cc] optionally as code.
createCountryAnnotation(String, String, String, int, String, String) - Static method in class org.opensextant.annotations.AnnotationHelper
Tracking a country name match of some sort.
createGeocoding(Annotation) - Static method in class org.opensextant.annotations.AnnotationHelper
Decode: Geocoding See OpenSextant Geocoding interface.
createGeocodingAnnotation(String, String, String, int, String, Geocoding) - Static method in class org.opensextant.annotations.AnnotationHelper
Encode geocoding annotations to be saved.
createOutputFileName() - Method in class org.opensextant.output.AbstractFormatter
 
createOutputFileName() - Method in class org.opensextant.output.AbstractGenericFormatter
Creates the output file name.
createOutputStreams() - Method in class org.opensextant.output.AbstractFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.AbstractGenericFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.CSVFormatter
 
createOutputStreams() - Method in class org.opensextant.output.CSVGenericFormatter
 
createOutputStreams() - Method in class org.opensextant.output.GDBFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.GeoCSVFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.KMLFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.ShapefileFormatter
Create the output stream appropriate for the output type.
createOutputStreams() - Method in class org.opensextant.output.WKTFormatter
Create the output stream appropriate for the output type.
createPatternManager(InputStream, String) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Create a pattern manager given the input stream and the file name.
createPatternManager(InputStream, String) - Method in class org.opensextant.extractors.poli.PatternsOfLife
 
createPatternManager(InputStream, String) - Method in class org.opensextant.extractors.xcoord.XCoord
 
createPatternManager(InputStream, String) - Method in class org.opensextant.extractors.xtemporal.XTemporal
 
createTaxon(Annotation) - Static method in class org.opensextant.annotations.AnnotationHelper
Recreates a Taxon from a stored annotation.
createTaxonAnnotation(String, String, String, int, String, Taxon) - Static method in class org.opensextant.annotations.AnnotationHelper
Create an annotation for a Taxon node that has a found value, val, in document, docid at offset.
createTempFolder(String) - Method in class org.opensextant.output.GISDataFormatter
 
createTemporalAnnotation(String, String, String, int, int, String, Date, String) - Static method in class org.opensextant.annotations.AnnotationHelper
Same createTemporalEntityAnnotation, just with len param.
createTemporalEntityAnnotation(String, String, String, int, String, Date, String) - Static method in class org.opensextant.annotations.AnnotationHelper
Creates the temporal entity annotation.
CSVFormatter - Class in org.opensextant.output
 
CSVFormatter() - Constructor for class org.opensextant.output.CSVFormatter
 
CSVGenericFormatter - Class in org.opensextant.output
Alternative to CSVFormatter which is schema-specific and a bit rigid.
CSVGenericFormatter(Parameters) - Constructor for class org.opensextant.output.CSVGenericFormatter
 
currency - Variable in class org.opensextant.extractors.poli.data.Money
 
CURRENT_YEAR - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
The current year.
CURRENT_YY - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
The current yy.
currErrors - Variable in class org.opensextant.processing.XtractorGroup
API: child implementations have access to accumulated errors; reset() clears errors and other state.

D

dashCount - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
count dashes other than hemispheres, +/-.
DATA_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
date - Variable in class org.opensextant.data.social.Message
Date object for the message timestamp
date - Variable in class org.opensextant.extractors.xtemporal.TestCase
 
DateMatch - Class in org.opensextant.extractors.xtemporal
 
DateMatch(int, int) - Constructor for class org.opensextant.extractors.xtemporal.DateMatch
 
DateMatch.TimeResolution - Enum in org.opensextant.extractors.xtemporal
A simplistic way to capture resolution of the date/time reference.
datenorm - Variable in class org.opensextant.extractors.xtemporal.DateMatch
Just the coordinate text normalized
datenorm_text - Variable in class org.opensextant.extractors.xtemporal.DateMatch
 
DateNormalization - Class in org.opensextant.extractors.xtemporal
 
DateNormalization() - Constructor for class org.opensextant.extractors.xtemporal.DateNormalization
 
dateText - Variable in class org.opensextant.data.social.Message
Original text of the date, if given
DATETIME_FAMILY - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
DateTimePattern - Class in org.opensextant.extractors.xtemporal
 
DateTimePattern(String, String, String) - Constructor for class org.opensextant.extractors.xtemporal.DateTimePattern
 
DAY - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
DD_FILTERS_ON - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: filter Decimal Degree coordinates -- primarily for bare floating point numbers w/out hemisphere or other symbols
DD_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
DD_precision_list - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
Maximal error in METERS in coordinate with N decimal places; for up to 12 decimal places.
debug - Variable in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
debug - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
debug - Variable in class org.opensextant.output.AbstractFormatter
 
debug - Variable in class org.opensextant.output.AbstractGenericFormatter
 
decodeAnnotations(List<Annotation>) - Static method in class org.opensextant.annotations.AnnotationHelper
Given encoded annotations from db, decode them and yield a flattened set of annotations, e.g., for use with MAT
decodeOffsets(String) - Static method in class org.opensextant.annotations.AnnotationHelper
Take a list of numbers and convert to Integer list "1;5;89;777" => List<> [ 1, 5, 89, 777 ].
decodeOffsets(Annotation, String) - Static method in class org.opensextant.annotations.AnnotationHelper
Generate annotations in a linear fashion.
DeepEyeData - Class in org.opensextant.annotations
A base class for Record, Annotation and other structures.
DeepEyeData() - Constructor for class org.opensextant.annotations.DeepEyeData
 
DeepEyeException - Exception in org.opensextant.annotations
Exception used when there is a user or system error related to data serialization or any sort of Java object - to JSONification error.
DeepEyeException(Exception) - Constructor for exception org.opensextant.annotations.DeepEyeException
 
DeepEyeException(String) - Constructor for exception org.opensextant.annotations.DeepEyeException
 
DeepEyeException(String, Exception) - Constructor for exception org.opensextant.annotations.DeepEyeException
 
DeepEyeStore - Interface in org.opensextant.annotations
DeepEyeStore is an abstraction of a data store that stores records and annotations.
DEFAULT_ENCODING - Static variable in class org.opensextant.util.FileUtility
 
DEFAULT_FILTERS - Static variable in class org.opensextant.extraction.ExtractionResult
 
DEFAULT_GEOHASH_PRECISION - Static variable in class org.opensextant.util.GeodeticUtility
 
DEFAULT_POLI_CFG - Static variable in class org.opensextant.extractors.poli.PatternsOfLife
 
DEFAULT_PRECISION - Static variable in class org.opensextant.util.GeodeticUtility
 
DEFAULT_UNKNOWN_RESOLUTION - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
DEFAULT_WORKING_SIZE - Static variable in class org.opensextant.extractors.langid.LangDetect
If working size, in CHARS, is less than 180 (20 8 char words + 1 whitespace word break);
DEFAULT_XCOORD_CFG - Static variable in class org.opensextant.extractors.xcoord.XCoord
 
DEFAULT_XTEMP_CFG - Static variable in class org.opensextant.extractors.xtemporal.XTemporal
The Constant DEFAULT_XTEMP_CFG.
defaultFields() - Method in class org.opensextant.output.AbstractGenericFormatter
Default fields for generic CSV output.
defaultFields() - Method in class org.opensextant.output.CSVFormatter
Default fields for generic CSV output.
defaultFields() - Method in class org.opensextant.output.GISDataModel
Default fields.
defaultHierarchicalPath() - Method in class org.opensextant.data.Place
This sets the default to non-null value.
defaultMatchId() - Method in class org.opensextant.extraction.TextMatch
If called, this overwrites existing match_id Match ID is typically entity label @ offset.
DEG - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
degLatFields - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
degLonFields - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
DEL - Static variable in class org.opensextant.util.TextUtils
 
delete_controls(String) - Static method in class org.opensextant.util.TextUtils
Delete control chars from text data; leaving text and whitespace only.
delete_eol(String) - Static method in class org.opensextant.util.TextUtils
Replace line endings with SPACE
delete_whitespace(String) - Static method in class org.opensextant.util.TextUtils
Delete whitespace of any sort.
deleteOutput(File) - Method in class org.opensextant.output.AbstractFormatter
This is checked only by internal classes as they create output streams.
deleteOutput(File) - Method in class org.opensextant.output.AbstractGenericFormatter
This is checked only by internal classes as they create output streams.
description - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
detect(String) - Method in class org.opensextant.extractors.langid.LangDetect
API for LangDetect, cybozu.labs
detect(String, boolean) - Method in class org.opensextant.extractors.langid.LangDetect
API for LangDetect, cybozu.labs.
detectSocialMediaLang(String, String) - Method in class org.opensextant.extractors.langid.LangDetect
Find best lang ID for short texts.
detectSocialMediaLang(String, String, boolean) - Method in class org.opensextant.extractors.langid.LangDetect
EXPERIMENTAL , EXPERIMENTAL, EXPERIMENTAL UPDATE, 2015.
df - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
DateFormat used to check for dates that look like MGRS i.e.
digest_latitude_match() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
This should cascade.
digest_longitude_match() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
This is a copy of the logic for digest_latitude_match; All I replace is "Lat" with "Lon"
digits - Variable in class org.opensextant.extractors.xcoord.GeocoordPrecision
# of decimal places in D.ddd...
disable_patterns(String) - Method in class org.opensextant.extractors.poli.PoliPatternManager
Enable a family of patterns
disableAll() - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
disableAll() - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Enable a family of patterns
disableAll() - Method in class org.opensextant.extractors.xcoord.XCoord
 
disconnect() - Method in interface org.opensextant.annotations.DeepEyeStore
 
distanceDegrees(double, double, double, double) - Static method in class org.opensextant.util.GeodeticUtility
This returns distance in degrees, e.g., this is a Cartesian distance.
distanceDegrees(GeoBase, GeoBase) - Static method in class org.opensextant.util.GeodeticUtility
This returns distance in degrees, e.g., this is a Cartesian distance.
distanceMeters(LatLon, LatLon) - Static method in class org.opensextant.util.GeodeticUtility
Haversine distance using LL1 to LL2;
DM_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
DMS_FILTERS_ON - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: filter DMS coordinates
DMS_MIN_PREC - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
DMS_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
DMS_SEC_PREC - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
DMSFilter - Class in org.opensextant.extractors.xcoord
DMS Filters include ignoring these patterns: dd-dd-dd HH:MM:ss (where dd-dd-dd HH-MM-ss would be a valid coordinate as the field separators for lat/lon are the same).
DMSFilter() - Constructor for class org.opensextant.extractors.xcoord.DMSFilter
 
DMSOrdinate - Class in org.opensextant.extractors.xcoord
DMSOrdinate represents all the various fields a WGS84 cartesian coordinate could have.
DMSOrdinate(int, int, int, boolean, int) - Constructor for class org.opensextant.extractors.xcoord.DMSOrdinate
Deprecated.
For XCoord purposes use the DMSOrdinate(map, map, boolean, text) form.
DMSOrdinate(int, int, int, int, boolean, int) - Constructor for class org.opensextant.extractors.xcoord.DMSOrdinate
Deprecated.
For XCoord purposes use the DMSOrdinate(map, map, boolean, text) form.
DMSOrdinate(String, String, String, boolean, int) - Constructor for class org.opensextant.extractors.xcoord.DMSOrdinate
Deprecated.
For XCoord purposes use the DMSOrdinate(map, map, boolean, text) form.
DMSOrdinate(Map<String, TextEntity>, Map<String, String>, boolean, String) - Constructor for class org.opensextant.extractors.xcoord.DMSOrdinate
DMS ordinates can be made up of degrees, minutes, seconds and then decimal minutes and decimal seconds fractional minutes dddd are formatted as ".dddd" then parsed; ADD to existing minutes fractional seconds dddd are formatted as ".dddd" then parsed; ADD to existing seconds decimal minutes d.ddd..
DMSOrdinate.Resolution - Enum in org.opensextant.extractors.xcoord
Resolution field for DMS.ms
DMY_FAMILY - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
DOC_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
doc_type - Variable in class org.opensextant.output.GISDataFormatter
 
DocInput - Class in org.opensextant.data
Use only for cases where you have document inputs instead of raw records.
DocInput(String, String) - Constructor for class org.opensextant.data.DocInput
 
dstDelta - Variable in class org.opensextant.data.Country.TZ
 
dstOffset - Variable in class org.opensextant.data.Country.TZ
 

E

EARTH_RADIUS - Static variable in class org.opensextant.util.GeodeticUtility
 
EAST - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
EmailAddress - Class in org.opensextant.extractors.poli.data
 
EmailAddress() - Constructor for class org.opensextant.extractors.poli.data.EmailAddress
 
EmailAddress(String) - Constructor for class org.opensextant.extractors.poli.data.EmailAddress
 
EmailAddress(Map<String, String>, String) - Constructor for class org.opensextant.extractors.poli.data.EmailAddress
 
enable_CCE_family(int, boolean) - Method in class org.opensextant.extractors.xcoord.PatternManager
 
enable_pattern(RegexPattern) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
enable an instance of a pattern based on the global settings.
enable_pattern(RegexPattern) - Method in class org.opensextant.extractors.poli.PoliPatternManager
enable an instance of a pattern based on the global settings.
enable_pattern(RegexPattern) - Method in class org.opensextant.extractors.xcoord.PatternManager
enable an instance of a pattern based on the global settings.
enable_pattern(RegexPattern) - Method in class org.opensextant.extractors.xtemporal.PatternManager
enable an instance of a pattern based on the global settings.
enable_pattern_family(int, boolean) - Method in class org.opensextant.extractors.xtemporal.PatternManager
 
enable_patterns(String) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
default adapter -- you must override.
enable_patterns(String) - Method in class org.opensextant.extractors.poli.PoliPatternManager
 
enableAll() - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
enableAll() - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
 
enableAll() - Method in class org.opensextant.extractors.poli.PoliPatternManager
You don't really want to enable All patterns...
enableAll() - Method in class org.opensextant.extractors.xcoord.XCoord
 
enabled - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
encodeOffsets(Collection<Integer>) - Static method in class org.opensextant.annotations.AnnotationHelper
Encode offsets.
end - Variable in class org.opensextant.extraction.TextEntity
char offset of entity; location in document where entity ends.
END_OFFSET - Static variable in class org.opensextant.output.OpenSextantSchema
 
englishLang - Static variable in class org.opensextant.util.TextUtils
 
entitiesKey - Variable in class org.opensextant.data.social.Tweet
 
evaluated - Variable in class org.opensextant.extractors.flexpat.TextMatchResult
 
evaluateInvalidDashes(Map<String, String>) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Evaluate DMS patterns only...
evaluateInvalidPunct(Map<String, String>) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Test punctuation separators.
evaluateSpecificity(DMSOrdinate, DMSOrdinate) - Static method in class org.opensextant.extractors.xcoord.GeocoordNormalization
Hueuristic for what style of fields are allowed in valid DD or DM/DMS coordinates.
extract(String) - Method in interface org.opensextant.extraction.Extractor
Useful for working with text buffers adhoc.
extract(String) - Method in class org.opensextant.extractors.poli.PatternsOfLife
 
extract(String) - Method in class org.opensextant.extractors.xcoord.XCoord
Support the standard Extractor interface.
extract(String) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Support the standard Extractor interface.
extract(TextInput) - Method in interface org.opensextant.extraction.Extractor
Useuful for working with batches of inputs that have an innate row ID + buffer pairing.
extract(TextInput) - Method in class org.opensextant.extractors.poli.PatternsOfLife
Support the standard Extractor interface.
extract(TextInput) - Method in class org.opensextant.extractors.xcoord.XCoord
Support the standard Extractor interface.
extract(TextInput) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Support the standard Extractor interface.
extract(TextInput, String) - Method in class org.opensextant.extractors.poli.PatternsOfLife
 
extract_coordinates(String, String) - Method in class org.opensextant.extractors.xcoord.XCoord
Assess all enabled patterns against the given text.
extract_coordinates(String, String, int) - Method in class org.opensextant.extractors.xcoord.XCoord
Limit the extraction to a particular family of coordinates.
extract_dates(String, String) - Method in class org.opensextant.extractors.xtemporal.XTemporal
A direct call to extract dates; which is useful for diagnostics and development/testing.
extract_patterns(String, String, String) - Method in class org.opensextant.extractors.poli.PatternsOfLife
Extract patterns of a certain family from a block of text.
ExtractionException - Exception in org.opensextant.extraction
An exception to be thrown when place name matching goes awry.
ExtractionException() - Constructor for exception org.opensextant.extraction.ExtractionException
 
ExtractionException(String) - Constructor for exception org.opensextant.extraction.ExtractionException
 
ExtractionException(String, Throwable) - Constructor for exception org.opensextant.extraction.ExtractionException
 
ExtractionMetrics - Class in org.opensextant.extraction
This is a holder for tracking various common measures: No.
ExtractionMetrics(String) - Constructor for class org.opensextant.extraction.ExtractionMetrics
A named metric
ExtractionResult - Class in org.opensextant.extraction
 
ExtractionResult(String) - Constructor for class org.opensextant.extraction.ExtractionResult
Given a record ID, create a container for holding onto all the geocodes for that particular data object.
Extractor - Interface in org.opensextant.extraction
For now, this interface is closer to an AbstractExtractor where a clean interface might be output = Extractor.extract(input) This interface specifies more
extractors - Variable in class org.opensextant.processing.XtractorGroup
API: child implementations have access to the core list of extractors.

F

family - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
family - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
family_id - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
family_id - Variable in class org.opensextant.extractors.xtemporal.DateTimePattern
 
familyInt - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
familyInt - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
familyLabel - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
familyLabel - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
farsiLang - Static variable in class org.opensextant.util.TextUtils
 
fast_replace(String, String, String) - Static method in class org.opensextant.util.TextUtils
Given a string S and a list of characters to replace with a substitute, return the new string, S'.
FEATURE_CLASS - Static variable in class org.opensextant.output.OpenSextantSchema
Geonames Feature class
FEATURE_CODE - Static variable in class org.opensextant.output.OpenSextantSchema
Geonames style feature coding
FEATURE_GEOHASH_PRECISION - Static variable in class org.opensextant.util.GeodeticUtility
 
FEATURE_PRECISION - Static variable in class org.opensextant.util.GeodeticUtility
Precision -- this is a first draft attempt at assigning some error bars to geocoding results.
FEED_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
field_order - Variable in class org.opensextant.output.GISDataModel
 
field_set - Variable in class org.opensextant.output.GISDataModel
 
fieldOrder - Variable in class org.opensextant.output.AbstractGenericFormatter
The field_order.
fieldOrder - Variable in class org.opensextant.output.CSVFormatter
 
fieldSet - Variable in class org.opensextant.output.AbstractGenericFormatter
Distinct set of fields in your output schema.
FILENAME - Static variable in class org.opensextant.output.OpenSextantSchema
 
FILENAME_REPLACE_CHAR - Static variable in class org.opensextant.util.FileUtility
Char to use in place of special chars when scrubbing filenames.
filenameCleaner(String) - Static method in class org.opensextant.util.FileUtility
Another utility to deal with unicode in filenames
FILEPATH - Static variable in class org.opensextant.output.OpenSextantSchema
Optionally the File path for the text
FILEPATH_FLD - Static variable in class org.opensextant.output.OpenSextantSchema
Field names: filepath
FileUtility - Class in org.opensextant.util
 
filter_out(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.GeocoordNormalization
Not all pattens might have filters.
FILTERED_IN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
FILTERED_OUT - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
filterMatchesBySpans(String, List<TextMatch>) - Static method in class org.opensextant.extraction.MatcherUtils
A simple demonstration of how to sift through matches identifying which matches appear within tags.
filterOut(String) - Method in class org.opensextant.extraction.MatchFilter
If value is in stop list, then filter it out.
filterOut(TextMatch) - Method in class org.opensextant.output.GISDataFormatter
This helps you figure out what to put in the GIS products.
findRecord(String) - Method in interface org.opensextant.annotations.DeepEyeStore
find a single record.
findSimilar(Annotation) - Method in interface org.opensextant.annotations.DeepEyeStore
given the shell of a annot, find similar records.
findSimilar(Record) - Method in interface org.opensextant.annotations.DeepEyeStore
given the shell of a record, find similar records.
findTagSpans(String) - Static method in class org.opensextant.extraction.MatcherUtils
Trivial attempt at locating edges of tags in data.
finish() - Method in class org.opensextant.output.AbstractFormatter
 
finish() - Method in class org.opensextant.output.AbstractGenericFormatter
Finish.
finish() - Method in class org.opensextant.output.CSVFormatter
 
finish() - Method in class org.opensextant.output.CSVGenericFormatter
 
finish() - Method in class org.opensextant.output.GISDataFormatter
 
finish() - Method in interface org.opensextant.output.ResultsFormatter
 
FIPS2ISO(String) - Method in class org.opensextant.util.GeonamesUtility
Find an ISO code for a given FIPS entry.
fixNull(String) - Static method in class org.opensextant.data.social.Tweet
 
FLAG_ALL_FILTERS - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: filter all coordinate patterns that have filters
FLAG_EXTRACT_CONTEXT - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: extract context or not
FLAG_EXTRACT_CONTEXT - Static variable in class org.opensextant.processing.Parameters
 
FLAG_NO_COORDINATES - Static variable in class org.opensextant.processing.Parameters
 
FOLDER_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
format(double, int) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
This was deemed to be more Java like, however performs 10x slower than format2() -- which unfortunately rounds too early.
format(ExtractionResult) - Method in class org.opensextant.processing.XtractorGroup
Format each result; Some formatters may pass on results For example, Shapefile formatter accepts only Geocoding-capable TextMatch.
format_date(Date) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Format_date.
formatConfidence(double) - Method in class org.opensextant.output.GISDataModel
Convenience method for managing how confidence number is reported in output.
formatConfidence(double) - Static method in class org.opensextant.processing.ResultsUtility
Format confidence.
formatLatitude() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Format latitude.
formatLatLon(LatLon) - Static method in class org.opensextant.util.GeodeticUtility
Create a string representation of a decimal lat/lon.
formatLongitude() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Format longitude.
formatPrecision() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Precision value is in Meters.
formatResults(ExtractionResult) - Method in class org.opensextant.output.AbstractFormatter
Write to a file and return HTML containing a link to the file.
formatResults(ExtractionResult) - Method in class org.opensextant.output.CSVGenericFormatter
Not implemented: this formatter is more flexible than requiring you to use it for Extraction output
formatResults(ExtractionResult) - Method in interface org.opensextant.output.ResultsFormatter
Formats the results obtained from processing a corpus through OpenSextant.
FormatterFactory - Class in org.opensextant.output
 
FormatterFactory() - Constructor for class org.opensextant.output.FormatterFactory
 
formatters - Variable in class org.opensextant.processing.XtractorGroup
API: child implementations have access to the core list of extractors.
FOUND_CASE - Static variable in class org.opensextant.extractors.poli.PoliMatch
 
frenchLang - Static variable in class org.opensextant.util.TextUtils
 
fromJSON(JsonObject) - Method in class org.opensextant.data.social.Tweet
If "base data" has been filled in by other method, e.g., TW4J or other formal API, then avoid parsing the basics here: id, text ,author*, date, lang, are checked if null.

G

GDBFormatter - Class in org.opensextant.output
A results formatter for FileGDB output.
GDBFormatter() - Constructor for class org.opensextant.output.GDBFormatter
 
general_dates - Variable in class org.opensextant.extractors.xcoord.DMSFilter
 
general_formats - Static variable in class org.opensextant.extractors.xcoord.DMSFilter
Date patterns to test matches; Avoid coordinate interpretation if date parses.
generateUniqueFilename(String, String) - Static method in class org.opensextant.util.FileUtility
Generate some filename with a unique date/time stamp
generateUniquePath(String, String, String) - Static method in class org.opensextant.util.FileUtility
Generate some path with a unique date/time stamp
GeoBase - Class in org.opensextant.data
An intermediary between the simple LatLon and other conceptual classes: Place, Country, etc.
GeoBase() - Constructor for class org.opensextant.data.GeoBase
Creates an empty GeoBase location object
GeoBase(double, double) - Constructor for class org.opensextant.data.GeoBase
 
GeoBase(String, String) - Constructor for class org.opensextant.data.GeoBase
Creates a new instance of GeoBase
Geocoding - Interface in org.opensextant.data
An interface that describes any data that can be geocoded -- the metadata behind deriving location is as important as the actual location is.
GeocoordMatch - Class in org.opensextant.extractors.xcoord
GeocoordMatch holds all the annotation data for the actual raw and normalized coordinate.
GeocoordMatch(int, int) - Constructor for class org.opensextant.extractors.xcoord.GeocoordMatch
a TextMatch that represents a coordinate found in free text.
GeocoordMatchFilter - Interface in org.opensextant.extractors.xcoord
Filtering matches is a matter of practicality.
GeocoordNormalization - Class in org.opensextant.extractors.xcoord
 
GeocoordNormalization() - Constructor for class org.opensextant.extractors.xcoord.GeocoordNormalization
 
GeocoordPattern - Class in org.opensextant.extractors.xcoord
 
GeocoordPattern(String, String, String) - Constructor for class org.opensextant.extractors.xcoord.GeocoordPattern
 
GeocoordPrecision - Class in org.opensextant.extractors.xcoord
 
GeocoordPrecision() - Constructor for class org.opensextant.extractors.xcoord.GeocoordPrecision
 
GeocoordTestCase - Class in org.opensextant.extractors.xcoord
 
GeocoordTestCase(String, String, String) - Constructor for class org.opensextant.extractors.xcoord.GeocoordTestCase
 
GeoCSVFormatter - Class in org.opensextant.output
A results formatter for shapefile output.
GeoCSVFormatter() - Constructor for class org.opensextant.output.GeoCSVFormatter
 
GeodeticUtility - Class in org.opensextant.util
A collection of geodetic routines used within OpenSextant.
geoEnabled - Variable in class org.opensextant.data.social.Tweet
 
geohash(double, double) - Static method in class org.opensextant.util.GeodeticUtility
 
geohash(LatLon) - Static method in class org.opensextant.util.GeodeticUtility
 
geoInterpreter - Variable in class org.opensextant.output.AbstractFormatter
 
GeonamesUtility - Class in org.opensextant.util
 
GeonamesUtility() - Constructor for class org.opensextant.util.GeonamesUtility
A utility class that offers many static routines; If you instantiate this class it will require metadata files for country-names and feature-codes in your classpath
germanLang - Static variable in class org.opensextant.util.TextUtils
 
get_CCE_family(int) - Static method in class org.opensextant.extractors.xcoord.XConstants
Get the CCE family for the given XConstants enum id
get_CCE_family(String) - Static method in class org.opensextant.extractors.xcoord.XConstants
Get the CCE family for the given string/key
get_hemisphere_sign(String) - Static method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
get_hemisphere_symbol(int) - Static method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
get_pattern(String) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Access the paterns by ID
get_patterns() - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
 
get_text_window(int, int, int) - Static method in class org.opensextant.util.TextUtils
Get a single text window around the offset.
get_text_window(int, int, int, int) - Static method in class org.opensextant.util.TextUtils
Find the text window(s) around a match.
get_trace() - Method in class org.opensextant.extractors.flexpat.TextMatchResult
 
getAdmin1() - Method in interface org.opensextant.data.Geocoding
 
getAdmin1() - Method in class org.opensextant.data.Place
 
getAdmin1() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the admin 1.
getAdmin1Metadata() - Method in class org.opensextant.util.GeonamesUtility
Deprecated.
Use getUSStateMetadata
getAdmin1Name() - Method in interface org.opensextant.data.Geocoding
 
getAdmin1Name() - Method in class org.opensextant.data.Place
 
getAdmin1Name() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Null implementation -- Geocoding interface.
getAdmin1Place(String, String) - Method in class org.opensextant.util.GeonamesUtility
Retrieve a Place object with the semi-official name (in Latin/Anglo terms) given CC and ADM1 code.
getAdmin1PlaceByHASC(String) - Method in class org.opensextant.util.GeonamesUtility
Lookup by coded path, CC.ADM1.
getAdmin1PostalCode() - Method in interface org.opensextant.data.Geocoding
State-level postal code, the corresponds usually to ADM1
getAdmin1PostalCode() - Method in class org.opensextant.data.Place
State-level postal code, the corresponds usually to ADM1
getAdmin1PostalCode() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
State-level postal code, the corresponds usually to ADM1.
getAdmin2() - Method in interface org.opensextant.data.Geocoding
 
getAdmin2() - Method in class org.opensextant.data.Place
 
getAdmin2() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the admin 2.
getAdmin2Name() - Method in interface org.opensextant.data.Geocoding
 
getAdmin2Name() - Method in class org.opensextant.data.Place
 
getAdmin2Name() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Null implementation -- Geocoding interface Coordinates from XCoord do not automatically report name of admin boundaries.
getAdminName() - Method in interface org.opensextant.data.Geocoding
 
getAdminName() - Method in class org.opensextant.data.Place
 
getAdminName() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
GeoBase interface.
getAliases() - Method in class org.opensextant.data.Country
 
getAllTimezones() - Method in class org.opensextant.data.Country
A full list/map of all timezone labels mapped to UTC offsets present in this country.
getAnnotationId(String, String, String, String) - Static method in class org.opensextant.annotations.AnnotationHelper
New, required format for an annotation ID: Md5 hash made up of:
getAttribute(String) - Method in class org.opensextant.annotations.Annotation
 
getAttributeNames() - Method in class org.opensextant.annotations.DeepEyeData
 
getAttributes() - Method in class org.opensextant.annotations.DeepEyeData
Converts internal JSON store to a key/value map.
getAverageTime() - Method in class org.opensextant.extraction.ExtractionMetrics
avg time spent for this metric in milliseconds
getBasename(String, String) - Static method in class org.opensextant.util.FileUtility
get the base name of a file, given any file extension.
getByteCount() - Method in class org.opensextant.extraction.ExtractionMetrics
 
getCachedAnnotation(String, String) - Method in class org.opensextant.annotations.AnnotationHelper
Careful -- no guarntee that two entity annotations could share the same type/value unintentionally.
getCachedAnnotations() - Method in class org.opensextant.annotations.AnnotationHelper
Gets the cached annotations, unordered.
getCallCount() - Method in class org.opensextant.extraction.ExtractionMetrics
 
getCode() - Method in class org.opensextant.data.Language
 
getConfidence() - Method in interface org.opensextant.data.Geocoding
Confidence metric is a normalized 100-point scale.
getConfidence() - Method in class org.opensextant.data.Place
 
getConfidence() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
TOOD: convey a realistic confidence metric for what was actually matched.
getConfigurationDebug() - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Instead of relying on a logging API, we now throw Exceptionsages for real configuration errors, and capture configuration details in a buffer if debug is on.
getContentId() - Method in class org.opensextant.extraction.TextMatch
create a simple text-based identifier with form of value + start offset ...
getContext() - Method in class org.opensextant.extraction.TextEntity
 
getContextAfter() - Method in class org.opensextant.extraction.TextEntity
 
getContextBefore() - Method in class org.opensextant.extraction.TextEntity
 
getCountries() - Method in class org.opensextant.util.GeonamesUtility
 
getCountry() - Method in class org.opensextant.data.Place
get the country object; generally optional.
getCountry(String) - Method in class org.opensextant.util.GeonamesUtility
Get Country by the default ISO digraph returns the Unknown country if you are not using an ISO2 code.
getCountryByAnyCode(String) - Method in class org.opensextant.util.GeonamesUtility
Find distinct country object by a code.
getCountryByFIPS(String) - Method in class org.opensextant.util.GeonamesUtility
 
getCountryCode() - Method in interface org.opensextant.data.Geocoding
 
getCountryCode() - Method in class org.opensextant.data.Place
 
getCountryCode() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the country code.
getDDPrecision(String) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
Return XCoord precision (+/- meters) in latitude.
getDefaultCountryName(String) - Method in class org.opensextant.util.GeonamesUtility
Finds a default country name for a CC if one exists.
getEndOffset() - Method in class org.opensextant.annotations.Annotation
Compute span end offset.
getFeatureClass() - Method in interface org.opensextant.data.Geocoding
 
getFeatureClass() - Method in class org.opensextant.data.Place
 
getFeatureClass() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the feature class.
getFeatureCode() - Method in interface org.opensextant.data.Geocoding
 
getFeatureCode() - Method in class org.opensextant.data.Place
 
getFeatureCode() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the feature code.
getFeatureDesignation() - Method in class org.opensextant.data.Place
Returns a dynamically formatted feature string C/CODE for class/code.
getFeatureDesignation(String, String) - Static method in class org.opensextant.util.GeonamesUtility
 
getFeatureName(String, String) - Method in class org.opensextant.util.GeonamesUtility
Find a readable name or description of a class/code
getFeaturePrecision(String, String) - Static method in class org.opensextant.util.GeodeticUtility
For a given feature type and code, determine what sort of resolution or precision should be considered for that place, approximately.
getField(String) - Method in class org.opensextant.output.GISDataModel
Gets the field.
getField(String) - Static method in class org.opensextant.output.OpenSextantSchema
 
getFileDescription(String) - Static method in class org.opensextant.util.FileUtility
Get a plain language name of the type of file.
getFilenameFilter(String) - Static method in class org.opensextant.util.FileUtility
Simple filter
getFilepath() - Method in class org.opensextant.data.DocInput
get the original document
getFirstOffset(Annotation) - Static method in class org.opensextant.annotations.AnnotationHelper
Gets the first offset.
getGeo() - Method in class org.opensextant.extractors.xcoord.GeocoordTestCase
 
getGeo(JsonObject, boolean) - Static method in class org.opensextant.data.social.TweetUtility
TOOD: validate available data for this schema.
getGeocoding(TextMatch) - Method in class org.opensextant.output.AbstractFormatter
The default geocoding interpretation is here.
getGeocoding(TextMatch) - Method in interface org.opensextant.output.MatchInterpreter
For a given match in text return the geocoding object for that match, if one exists.
getGeohash() - Method in class org.opensextant.data.GeoBase
 
getGeohashPrecision(String, String) - Static method in class org.opensextant.util.GeodeticUtility
For a given Geonames feature class/designation provide a guess about how long geohash should be.
getGeoMethod() - Method in class org.opensextant.data.social.Tweet
TODO: this is not clear.
getHASC(String, String) - Static method in class org.opensextant.util.GeonamesUtility
Get a hiearchical path for a boundar or a place.
getHASC(String, String, String) - Static method in class org.opensextant.util.GeonamesUtility
 
getHierarchicalPath() - Method in class org.opensextant.data.Place
This ensures at least a default hierarchichal path is set.
getId_bias() - Method in class org.opensextant.data.Place
The ID bias is a measure of the a priori likelihood that a mention of this name refers to this particular place.
getInputStream(File, String) - Static method in class org.opensextant.util.FileUtility
 
getInputStream(String, String) - Static method in class org.opensextant.util.FileUtility
 
getInputStreamReader(File, String) - Static method in class org.opensextant.util.FileUtility
Getting an input stream from a file.
getInstance(String) - Static method in class org.opensextant.output.FormatterFactory
Supported formats are CSV, WKT, HTML, KML, Shapefile, GDB, JSON
getInstanceId() - Method in class org.opensextant.data.Place
getInteger(JsonObject, String, int) - Static method in class org.opensextant.data.social.Tweet
 
getIntValue(String) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Gets the int value.
getISO639_1_Code() - Method in class org.opensextant.data.Language
 
getISOCountries() - Method in class org.opensextant.util.GeonamesUtility
List all country names, official and variant names.
getISOTimestamp() - Method in class org.opensextant.data.social.Tweet
 
getJobName() - Method in class org.opensextant.output.AbstractFormatter
A basic job name that reflects file name
getJobName() - Method in class org.opensextant.output.AbstractGenericFormatter
A basic job name that reflects file name.
getJobName() - Method in interface org.opensextant.output.ResultsFormatter
 
getJobName() - Method in class org.opensextant.processing.Parameters
 
getJobTimestamp() - Static method in class org.opensextant.processing.Parameters
Generates a simple job date/time key for the job
getKey() - Method in class org.opensextant.data.GeoBase
 
getLang(String) - Static method in class org.opensextant.util.GeonamesUtility
Parse lang ID from Locale.
getLanguage(String) - Static method in class org.opensextant.util.TextUtils
ISO2 and ISO3 char codes for languages are unique.
getLanguageCode(String) - Static method in class org.opensextant.util.TextUtils
ISO2 and ISO3 char codes for languages are unique.
getLanguageMap() - Static method in class org.opensextant.util.TextUtils
If caller wants to add language they can.
getLanguageName(String) - Static method in class org.opensextant.util.TextUtils
Given an ISO2 char code (least common denominator) retrieve Language Name.
getLanguages() - Method in class org.opensextant.data.Country
 
getLatHemisphereSign(Map<String, String>) - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
getLatitude() - Method in class org.opensextant.data.GeoBase
 
getLatitude() - Method in interface org.opensextant.data.LatLon
 
getLatitude() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the latitude.
getLatText() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the lat text.
getLength() - Method in class org.opensextant.annotations.Annotation
Compute length, as logic is related to defaults, if value is set or not.
getLength() - Method in class org.opensextant.extraction.TextEntity
get the length of the matched text
getLongitude() - Method in class org.opensextant.data.GeoBase
 
getLongitude() - Method in interface org.opensextant.data.LatLon
 
getLongitude() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the longitude.
getLonHemisphereSign(Map<String, String>) - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
Given a list of match groups find the first Longitude Hemisphere group hemiLon W, E group used mostly for DMS, DM, DD formats hemiLonSign +, - group allowed only for specific formats; +/- may appear before any number not just coords.
getLonText() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the lon text.
getMap() - Method in class org.opensextant.annotations.Annotation
Generate a key-value representation of the object with its current values.
getMap() - Method in class org.opensextant.annotations.DeepEyeData
 
getMap() - Method in class org.opensextant.annotations.Record
 
getMatchId() - Method in class org.opensextant.extraction.TextMatch
Future planning -- match_id may become private field in future API.
getMentionIDs() - Method in class org.opensextant.data.social.Tweet
fully qualified Twitter user profiles: screen_name : user ID pairings.
getMentions() - Method in class org.opensextant.data.social.Tweet
Found user screen_names, no user ID
getMethod() - Method in interface org.opensextant.data.Geocoding
 
getMethod() - Method in class org.opensextant.data.Place
The method by which the geolocation was determined; GAZ, COUNTRY, etc.
getMethod() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Returns the exact pattern that matched.
getName() - Method in class org.opensextant.data.GeoBase
 
getName() - Method in class org.opensextant.data.Language
 
getName() - Method in interface org.opensextant.extraction.Extractor
 
getName() - Method in class org.opensextant.extractors.poli.PatternsOfLife
Extractor interface: getName
getName() - Method in class org.opensextant.extractors.xcoord.XCoord
Extractor interface: getName
getName() - Method in class org.opensextant.extractors.xtemporal.XTemporal
Extractor interface: getName.
getName_bias() - Method in class org.opensextant.data.Place
The name bias is a measure of the a priori likelihood that a mention of this place's name actually refers to a place.
getName_type() - Method in class org.opensextant.data.Place
 
getNameCode() - Method in class org.opensextant.data.Language
 
getNamenorm() - Method in class org.opensextant.data.Country
Return name normalized, e.g., lowercase, w/out diacritics.
getNamenorm() - Method in class org.opensextant.data.GeoBase
 
getNDNamenorm() - Method in class org.opensextant.data.Place
Returns a pre-computed Non-diacritic name
getNearByPlaces() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the near by places.
getOffsets() - Method in class org.opensextant.annotations.Annotation
 
getOtherInterpretations() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the other interpretations.
getOutputFilepath() - Method in class org.opensextant.output.AbstractFormatter
 
getOutputFilepath() - Method in class org.opensextant.output.AbstractGenericFormatter
Gets the output filepath.
getOutputFilepath() - Method in interface org.opensextant.output.ResultsFormatter
Get the path to the output file.
getOutputFormats() - Method in class org.opensextant.processing.Parameters
 
getOutputStream(String, String) - Static method in class org.opensextant.util.FileUtility
Caller is responsible for write flush, close, etc.
getOutputStream(String, String, boolean) - Static method in class org.opensextant.util.FileUtility
Caller is responsible for write flush, close, etc.
getOutputType() - Method in class org.opensextant.output.AbstractFormatter
 
getOutputType() - Method in class org.opensextant.output.AbstractGenericFormatter
Gets the output type.
getOutputType() - Method in interface org.opensextant.output.ResultsFormatter
Get the type of output produced by this formatter.
getParent(File) - Static method in class org.opensextant.util.FileUtility
 
getPatternFamily(int) - Static method in class org.opensextant.extractors.xtemporal.XTConstants
 
getPatternFamily(String) - Static method in class org.opensextant.extractors.xtemporal.XTConstants
 
getPatternManager() - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
getPlaceID() - Method in interface org.opensextant.data.Geocoding
 
getPlaceID() - Method in class org.opensextant.data.Place
 
getPlaceID() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the place ID.
getPlaceName() - Method in interface org.opensextant.data.Geocoding
 
getPlaceName() - Method in class org.opensextant.data.Place
 
getPlaceName() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the place name.
getPlacePostalCode() - Method in interface org.opensextant.data.Geocoding
City-level postal code, that may be something like a zip.
getPlacePostalCode() - Method in class org.opensextant.data.Place
City-level postal code, that may be something like a zip.
getPlacePostalCode() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
City-level postal code, that may be something like a zip.
getPopulation() - Method in class org.opensextant.data.Place
 
getPrecision() - Method in interface org.opensextant.data.Geocoding
Precision - radius in meters of possible error
getPrecision() - Method in class org.opensextant.data.Place
Get the relative precision of this feature; in meters of error
getPrecision() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the precision.
getPrimaryLanguage() - Method in class org.opensextant.data.Country
Certain island nations, areas, and territories that have ISO country codes may not have a language.
getProvince(String, String) - Method in class org.opensextant.util.GeonamesUtility
Lookup by coded path, CC.ADM1.
getProvinceMetadata() - Method in class org.opensextant.util.GeonamesUtility
Alias for getWorldAdmin1Metadata.
getRegions() - Method in class org.opensextant.data.Country
 
getRelatedPlace() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Gets the related place.
getSafeDir(File, String, int) - Static method in class org.opensextant.util.FileUtility
Get a directory that does not conflict with an existing directory.
getSafeFile(File, String, int) - Static method in class org.opensextant.util.FileUtility
 
getSchema() - Method in class org.opensextant.output.GISDataModel
Gets the schema.
getSource() - Method in class org.opensextant.data.Place
Get the original source of this information.
getStrictMode() - Static method in class org.opensextant.extractors.xcoord.XCoord
 
getSupportedFormats() - Static method in class org.opensextant.output.FormatterFactory
 
getTags() - Method in class org.opensextant.data.social.Tweet
 
getTerritories() - Method in class org.opensextant.data.Country
List the territories for this country.
getText() - Method in class org.opensextant.data.DocInput
 
getText() - Method in class org.opensextant.data.social.Message
get the message text.
getText() - Method in class org.opensextant.extraction.TextEntity
 
getTextNatural() - Method in class org.opensextant.data.social.Message
Get the natural language version of the raw text.
getTextnorm() - Method in class org.opensextant.extraction.TextMatch
Get a normalized version of the text, lower case, punctuation and diacritics removed.
getTextpath() - Method in class org.opensextant.data.DocInput
get the optional text version of the document;
getTotalTime() - Method in class org.opensextant.extraction.ExtractionMetrics
 
getType() - Method in class org.opensextant.extraction.TextMatch
 
getTZDatabase() - Method in class org.opensextant.data.Country
Return the full list of TZ.
getURLs() - Method in class org.opensextant.data.social.Tweet
 
getUSStateMetadata() - Method in class org.opensextant.util.GeonamesUtility
Provides access to a array of ADM1 metadata.
getValidFilename(String) - Static method in class org.opensextant.util.FileUtility
On occasion file path may contain unicode chars, however as the is encoded, it may not be decodable by OS/FS.
getValue() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
Get the cartesian value for this ordinate
getWorldAdmin1Metadata() - Method in class org.opensextant.util.GeonamesUtility
Get the array of Place objects representing ADM1 level boundaries.
GIS_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
GISDataFormatter - Class in org.opensextant.output
This is the base class for classes that convert document annotations to GISCore features.
GISDataFormatter() - Constructor for class org.opensextant.output.GISDataFormatter
 
gisDataModel - Variable in class org.opensextant.output.GISDataFormatter
 
GISDataModel - Class in org.opensextant.output
 
GISDataModel(String, boolean, boolean) - Constructor for class org.opensextant.output.GISDataModel
Instantiates a new GIS data model.
GISDataModel(String, boolean, boolean, boolean) - Constructor for class org.opensextant.output.GISDataModel
Instantiates a new GIS data model.
gridzone() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Identifies the 100KM quad in which this point is contained.
group_map(RegexPattern, Matcher) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
NOTE: We're dealing with Java6's inability to use named groups.
group_matches(RegexPattern, Matcher) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Matched fields as TextEntities
groupByDocument - Variable in class org.opensextant.output.GISDataFormatter
 
guessLanguage(String) - Method in class org.opensextant.extractors.langid.LangDetect
Routine to guess the language ID Scrub data prior to guessing language.

H

has_degrees - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
has_hemi - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
has_minutes - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
has_seconds - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasCachedAnnotation(String, String) - Method in class org.opensextant.annotations.AnnotationHelper
Checks for cached annotation.
hasCJKText(String) - Static method in class org.opensextant.util.TextUtils
A simple test to see if text has any CJK characters at all.
hasCoordinate() - Method in class org.opensextant.data.GeoBase
Convenience method for checking if lat/lon was set to other than 0,0 (default)
hasCoordinate() - Method in interface org.opensextant.data.Geocoding
has Coordinate: if this named place object has a coordinate.
hasCoordinate() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks for coordinate.
hasDegrees() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasDiacritics(String) - Static method in class org.opensextant.util.TextUtils
If a string has extended latin diacritics.
hasDigits(String) - Static method in class org.opensextant.util.TextUtils
 
hasHemisphere() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hashtagPattern1 - Static variable in class org.opensextant.util.TextUtils
Find any pattern "ABC#[ABC 123]" -- a hashtag with whitespace.
hashtagPattern2 - Static variable in class org.opensextant.util.TextUtils
Find any pattern "#ABC123" -- normal hashtag, Java Regex note: UNICODE flags are important, otherwise "\w" and other classes match only ASCII.
hasIrregularPunctuation(String) - Static method in class org.opensextant.util.TextUtils
Simple triage of punctuation.
hasLength() - Method in class org.opensextant.annotations.Annotation
 
hasMiddleEasternText(String) - Static method in class org.opensextant.util.TextUtils
Detects the first Arabic or Hewbrew character for now -- will be more comprehensive in scoping "Middle Eastern" scripts in text.
hasMinutes() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasMinutes() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks for minutes.
hasOtherIterpretations() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks for other interpretations.
hasSeconds() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasSeconds() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks for seconds.
hasSubDegrees() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasSubMinutes() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasSubSeconds() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasSymbols() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hasTags() - Method in class org.opensextant.data.Taxon
 
hasTerritories() - Method in class org.opensextant.data.Country
 
hasUniqueName() - Method in class org.opensextant.data.Country
 
HEMI_MAP - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hemiLatFields - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hemiLonFields - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
hemisphere - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
Hemisphere - Class in org.opensextant.extractors.xcoord
Represent a Hemisphere symbol and value.
Hemisphere() - Constructor for class org.opensextant.extractors.xcoord.Hemisphere
 
HOUR - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 

I

id - Variable in class org.opensextant.annotations.DeepEyeData
A base class to keep all data organized
id - Variable in class org.opensextant.data.social.Message
Message ID
id - Variable in class org.opensextant.data.Taxon
Nod OID: 1.2.3
id - Variable in class org.opensextant.data.TextInput
 
id - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
id - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
IGNORE_SEQ - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
The ignore seq.
IMAGE_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
includeCoordinate - Variable in class org.opensextant.output.AbstractFormatter
GIS formats may optionally include coordinates as fields.
includeCoordinate - Variable in class org.opensextant.output.GISDataModel
 
includeOffsets - Variable in class org.opensextant.output.AbstractFormatter
Schema-specific stuff.
includeOffsets - Variable in class org.opensextant.output.GISDataModel
 
initialize(InputStream) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Initializes the pattern manager implementations.
initialize(InputStream) - Method in class org.opensextant.extractors.xcoord.PatternManager
 
initialize(InputStream) - Method in class org.opensextant.extractors.xtemporal.PatternManager
 
initLangId() - Method in class org.opensextant.extractors.langid.LangDetect
Taken straight from LangDetect example NOTE: /langdetect/profiles must be a folder on disk, although I have a variation that could work with JAR resources.
initLanguageData() - Static method in class org.opensextant.util.TextUtils
Initialize language codes and metadata.
initLOCLanguageData() - Static method in class org.opensextant.util.TextUtils
This is Libray of Congress data for language IDs.
input - Variable in class org.opensextant.extraction.ExtractionResult
 
inputFile - Variable in class org.opensextant.processing.Parameters
 
insertGeo(JsonObject, Geocoding, boolean) - Static method in class org.opensextant.data.social.TweetUtility
TOOD: validate available data for this schema.
insertMention(JsonArray, String, String) - Static method in class org.opensextant.data.social.TweetUtility
Serialize a user screen name / ID pairing., e.g.
insertMention(JsonArray, Tweet.Mention) - Static method in class org.opensextant.data.social.TweetUtility
Variation on insertMention(a, name, uid)
interpretations - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The interpretations.
INVALID_DATE - Static variable in class org.opensextant.extractors.xtemporal.DateNormalization
 
INVALID_DAY - Static variable in class org.opensextant.extractors.xtemporal.DateNormalization
 
INVALID_FCNAME_CHAR - Static variable in class org.opensextant.processing.Parameters
 
is_duplicate - Variable in class org.opensextant.extraction.TextEntity
If this entity is a duplicate of some other
is_overlap - Variable in class org.opensextant.extraction.TextEntity
If this entity is a overlaps with some other
is_submatch - Variable in class org.opensextant.extraction.TextEntity
If this entity is contained completely within some other
isAbbreviation() - Method in class org.opensextant.data.Place
 
isAbbreviation(char) - Static method in class org.opensextant.util.GeonamesUtility
Check if name type is an Abbreviation
isAbbreviation(String) - Static method in class org.opensextant.util.TextUtils
 
isAbbreviation(String, boolean) - Static method in class org.opensextant.util.TextUtils
Define what an acronym is: A.B.
isAbbreviation(Place) - Static method in class org.opensextant.util.GeonamesUtility
Wrapper for isAbbreviation(name type)
isAcronym - Variable in class org.opensextant.data.Taxon
Acronyms are assumed to be ALL UPPER CASE; granted this does not always apply.
isAdmin1() - Method in class org.opensextant.data.Place
Is this Place a State or Province?
isAdmin1(String) - Static method in class org.opensextant.util.GeonamesUtility
Is this Place a State or Province?
isAdmin1(Place) - Static method in class org.opensextant.util.GeonamesUtility
 
isAdmin2(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isAdministrative() - Method in interface org.opensextant.data.Geocoding
 
isAdministrative() - Method in class org.opensextant.data.Place
 
isAdministrative() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks if is administrative.
isAdministrative(String) - Static method in class org.opensextant.util.GeonamesUtility
if a place or feature represents an administrative boundary.
isAdministrative(String, String) - Static method in class org.opensextant.util.GeonamesUtility
Administrative feat class + code test.
isAfter(TextEntity) - Method in class org.opensextant.extraction.TextEntity
Assuming simple whitespace separation or other simple delimiters, is this term following the argument entity?
isAlpha() - Method in class org.opensextant.extractors.xcoord.Hemisphere
Checks if is alpha.
isArchiveFile(String) - Static method in class org.opensextant.util.FileUtility
Check if a file is an archive
isArchiveFileType(String) - Static method in class org.opensextant.util.FileUtility
Allow checking of a file extention; NO prefix "."
isASCII - Variable in class org.opensextant.data.social.Message
True if text is purely ASCII
isASCII() - Method in class org.opensextant.extraction.TextEntity
If non-punctuation content is purely ASCII vs.
isASCII(byte[]) - Static method in class org.opensextant.util.TextUtils
 
isASCII(char) - Static method in class org.opensextant.util.TextUtils
 
isASCII(String) - Static method in class org.opensextant.util.TextUtils
Early exit test -- return false on first non-ASCII character found.
isASCIILetter(char) - Static method in class org.opensextant.util.TextUtils
 
isASCIIName() - Method in class org.opensextant.data.Place
 
isAudio(String) - Static method in class org.opensextant.util.FileUtility
Checks file extension of given filepath to see if the format is a known audio type.
isBalanced() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks if is balanced.
isBefore(TextEntity) - Method in class org.opensextant.extraction.TextEntity
Assuming simple whitespace separation or other simple delimiters, is this term preceeding the argument entity?
isChinese(Character.UnicodeBlock) - Static method in class org.opensextant.util.TextUtils
 
isChinese(String) - Static method in class org.opensextant.util.TextUtils
Utility method to check if lang ID is Chinese(Traditional or Simplified)...
isCJK(Character.UnicodeBlock) - Static method in class org.opensextant.util.TextUtils
 
isCJK(String) - Static method in class org.opensextant.util.TextUtils
Utility method to check if lang ID is Chinese, Korean, or Japanese
isCode() - Method in class org.opensextant.data.Place
 
isCode(char) - Static method in class org.opensextant.util.GeonamesUtility
 
isCoord(double, double) - Static method in class org.opensextant.util.GeodeticUtility
 
isCoord(Geocoding) - Static method in class org.opensextant.util.GeodeticUtility
 
isCoord(LatLon) - Static method in class org.opensextant.util.GeodeticUtility
 
isCoordinate() - Method in interface org.opensextant.data.Geocoding
isCoordinate: if this object represents a coordinate
isCoordinate() - Method in class org.opensextant.data.Place
 
isCoordinate() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Note the coordinate nature of this TextMatch/Geocoding takes precedence over other flags isPlace, isCountry, etc.
isCountry() - Method in interface org.opensextant.data.Geocoding
 
isCountry() - Method in class org.opensextant.data.Place
Is this Place a Country?
isCountry() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks if is country.
isCountry(String) - Static method in class org.opensextant.util.GeonamesUtility
Is this Place a Country?
isCountry(Place) - Static method in class org.opensextant.util.GeonamesUtility
Wrapper for isCountry(feat code)
isCountryNameCollision(String) - Static method in class org.opensextant.util.GeonamesUtility
Experimental.
isDatetime(TextMatch) - Static method in class org.opensextant.processing.ResultsUtility
Testers for TextMatch: isDatetime macro.
isdefault - Variable in class org.opensextant.processing.Parameters
You the caller must explicitly set isdefault = false; forcing you to actually look at these parameters.
isDefault() - Method in class org.opensextant.extraction.TextMatch
Users of this class should set a non-default type via setType(String), otherwise the match remains default and generic.
isDistantPast - Variable in class org.opensextant.extractors.xtemporal.DateMatch
Flag caller can use to classify if a date match is distant
isDistantPast(long) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Checks if is distant past.
isDistantPast(Date) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Checks if is distant past.
isDistantPastYMD(Date) - Method in class org.opensextant.extractors.xtemporal.XTemporal
if a date is too far in past to likley be a date of the format YYYY-MM-DD.
isDone() - Method in interface org.opensextant.data.social.JSONListener
If listener isDone, then caller should exit
isDST - Variable in class org.opensextant.data.social.Message
isDST = is Daylight Savings Time = true if we think the message timestamp is in a period of DST recognized by the country of origin, or UTC offset.
isEnglish - Variable in class org.opensextant.data.social.Message
True if processing of text determines tweet is nearly all English.
isEnglish(String) - Static method in class org.opensextant.util.TextUtils
Utility method to check if lang ID is English...
isEuroLanguage(String) - Static method in class org.opensextant.util.TextUtils
European languages = Romance + GER + ENG Extend definition as needed.
isFilteredOut() - Method in class org.opensextant.extraction.TextMatch
 
isFuture - Variable in class org.opensextant.extractors.xtemporal.DateMatch
Flag caller can use to classify if date is future relative to a given date, by default TODAY
isFuture(long) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Given the set MAX_DATE_CUTOFF_YEAR, determine if the date epoch is earlier than this.
isFuture(Date) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Checks if is future.
isGeoinferenced() - Method in class org.opensextant.data.social.Tweet
indicate if tweet has any Geo resolution at all, wether it was a given Lat/Lon or derived.
isGeolocated() - Method in class org.opensextant.data.social.Tweet
indicate if tweet is geolocated by an accurate coordinate.
isImage(String) - Static method in class org.opensextant.util.FileUtility
Using Commons getExtension(), determine if the filename represents an image media type.
isJapanese(Character.UnicodeBlock) - Static method in class org.opensextant.util.TextUtils
Checks if char block is uniquely Japanese.
isJSONGzip(String) - Static method in class org.opensextant.util.FileUtility
Tell if the file is JSON/Gzip
isKorean(Character.UnicodeBlock) - Static method in class org.opensextant.util.TextUtils
Likely to be uniquely Korean if the character block is in Hangul.
isLand(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isLatin(String) - Static method in class org.opensextant.util.TextUtils
Checks if non-ASCII and non-LATIN characters are present.
isLeftMatch(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
isLocation(TextMatch) - Static method in class org.opensextant.processing.ResultsUtility
Testers for TextMatch: isLocation macro.
isLower - Variable in class org.opensextant.data.TextInput
writable flags that represent some basic assement of your input text.
isLower() - Method in class org.opensextant.extraction.TextEntity
test If text (that has a case sense) is ALL lower case
isLower(String) - Static method in class org.opensextant.util.TextUtils
 
isLowerCaseDocument(int[]) - Static method in class org.opensextant.util.TextUtils
This measures the amount of upper case See Upper Case.
isMixedCase() - Method in class org.opensextant.extraction.TextEntity
test if text is mixed case.
isName() - Method in class org.opensextant.data.Place
test if Place is a "Name" -- not a code/abbrev or other.
isName(char) - Static method in class org.opensextant.util.GeonamesUtility
Check if name type is an Abbreviation
isNationalCapital() - Method in class org.opensextant.data.Place
Is this Place a National Capital?
isNationalCapital(String) - Static method in class org.opensextant.util.GeonamesUtility
Is this Place a National Capital?
isNationalCapital(Place) - Static method in class org.opensextant.util.GeonamesUtility
wrapper for isNationalCaptial( feat code )
isNumeric(String) - Static method in class org.opensextant.util.TextUtils
Determine if a string is numeric in nature, not necessarily a parsable number.
ISO_COUNTRY - Static variable in class org.opensextant.output.OpenSextantSchema
ISO Country code
iso_date_formatter - Static variable in class org.opensextant.data.social.Tweet
 
isOverlap(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
isPlace() - Method in interface org.opensextant.data.Geocoding
 
isPlace() - Method in class org.opensextant.data.Place
 
isPlace() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Checks if is place.
isPlainText(String) - Static method in class org.opensextant.util.FileUtility
Test is a path or file extension ends with .txt NPE if null is passed in.
isPoliticalEntity(String) - Static method in class org.opensextant.util.GeonamesUtility
Test if a feature is a political entity ~ country, territory, sovereign land
isPoliticalEntity(Place) - Static method in class org.opensextant.util.GeonamesUtility
Test is Place feature is coded as PCL* (PCL, PCLIX, PCLH, PCLD, PCLF, PCLS, etc)
isPopulated() - Method in class org.opensextant.data.Place
if feature class for this location is 'P' for populated place.
isPopulated(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isPostal() - Method in class org.opensextant.data.Place
Determines if this feature instance is a postal zone, coded "A/POST"
isPostal(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isPostal(Place) - Static method in class org.opensextant.util.GeonamesUtility
 
isPrimaryLanguage(String) - Method in class org.opensextant.data.Country
 
isPrimaryLanguage(String, String) - Method in class org.opensextant.util.GeonamesUtility
If lang is primary lang.
isRightMatch(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
isRomanceLanguage(String) - Static method in class org.opensextant.util.TextUtils
Romance languages = SPA + POR + ITA + FRA + ROM Extend definition as needed.
isSame(String) - Method in class org.opensextant.extraction.TextMatch
Case-insensitive comparison to another string
isSame(Place) - Method in class org.opensextant.data.Place
 
isSameMatch(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
isSameNorm(TextMatch) - Method in class org.opensextant.extraction.TextMatch
Compare the normalized string for this match to that of another.
isShortName() - Method in class org.opensextant.data.Place
Alias for "isAbbreviation() || isCode()"
isSpoken(String) - Method in class org.opensextant.data.Country
 
isSpot() - Method in class org.opensextant.data.Place
 
isSpot(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isSpreadsheet(String) - Static method in class org.opensextant.util.FileUtility
Simple check if a file is typed as a Spreadsheet Tab-delimited .txt files or .dat files may be valid spreadsheets, however this method does not look inside files.
isSupported(String) - Static method in class org.opensextant.output.FormatterFactory
Check if this is a known format
isTerritory - Variable in class org.opensextant.data.Country
 
isUpper - Variable in class org.opensextant.data.TextInput
 
isUpper() - Method in class org.opensextant.extraction.TextEntity
test If text (that has a case sense) is ALL upper case
isUpper(String) - Static method in class org.opensextant.util.TextUtils
For measuring the upper-case-ness of short texts.
isUpperAdmin() - Method in class org.opensextant.data.Place
macro for detecting ADM1 or ADM2
isUpperAdminLevel(String) - Static method in class org.opensextant.util.GeonamesUtility
Macro for reasoning with upper common levels of boundaries - province, districts.
isUpperCaseDocument(int[]) - Static method in class org.opensextant.util.TextUtils
First measureCase(Text) to acquire counts, then call this routine for a heuristic that suggests the text is mainly upper case.
isUppercaseName() - Method in class org.opensextant.data.Place
 
isValidEastingNorthing(String, boolean) - Static method in class org.opensextant.extractors.xcoord.MGRSParser
A hueuristic from looking at real data, real text artifacts - typos, line endings, whitespace wrapping, etc.
isValidNonZeroCoordinate(double, double) - Static method in class org.opensextant.util.GeodeticUtility
A common check required by practical applications -- 0,0 is not interesting, so this is a simple java-based check.
isValue(String) - Static method in class org.opensextant.data.social.Tweet
 
isValue(String) - Static method in class org.opensextant.data.social.TweetUtility
 
isValue(String) - Static method in class org.opensextant.util.GeonamesUtility
 
isValue(Collection<?>) - Method in class org.opensextant.annotations.DeepEyeData
 
isValue(List<?>) - Static method in class org.opensextant.data.social.Tweet
 
isValue(Map<?, ?>) - Method in class org.opensextant.annotations.DeepEyeData
 
isValue(JsonObject) - Static method in class org.opensextant.data.social.Tweet
 
isVideo(String) - Static method in class org.opensextant.util.FileUtility
Checks file extension of given filepath to see if the format is a known video type.
isWebURL(String) - Static method in class org.opensextant.util.FileUtility
Check if path or URL is a webpage.
isWindowsSystem() - Static method in class org.opensextant.util.FileUtility
A way of determining OS Beware, OS X has Darwin in its full OS name.
isWithin(TextEntity) - Method in class org.opensextant.extraction.TextEntity
 
isWithinChars(TextEntity, int) - Method in class org.opensextant.extraction.TextEntity
Proximity test between this text span and another This is A; B is input.
isZero() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Convenience method for determining if XY = 0,0.
isZeroCoord(double, double) - Static method in class org.opensextant.util.GeodeticUtility
Just tests if location is not 0,0 ...
italianLang - Static variable in class org.opensextant.util.TextUtils
 

J

japaneseLang - Static variable in class org.opensextant.util.TextUtils
 
JAVA_0_DATE_YEAR - Static variable in class org.opensextant.extractors.xtemporal.XTemporal
The Constant JAVA_0_DATE_YEAR.
JSONListener - Interface in org.opensextant.data.social
 

K

key - Variable in class org.opensextant.data.GeoBase
identifier ID
KMLFormatter - Class in org.opensextant.output
A formatter for KML output that uses GISCore API.
KMLFormatter() - Constructor for class org.opensextant.output.KMLFormatter
 
KNOWN_NAME_COLLISIONS - Static variable in class org.opensextant.util.GeonamesUtility
Experimental.
koreanLang - Static variable in class org.opensextant.util.TextUtils
 

L

label - Variable in class org.opensextant.data.Country.TZ
 
lang - Variable in class org.opensextant.data.social.Message
Language of the message; not usually given unless inferred by someone or something.
LangDetect - Class in org.opensextant.extractors.langid
Wrapper around cybozu labs langdetect.
LangDetect() - Constructor for class org.opensextant.extractors.langid.LangDetect
Default use requires you unpack LangDetect profiles here: /langdetect-profiles
LangDetect(int) - Constructor for class org.opensextant.extractors.langid.LangDetect
If you anticipate working with short text - queries, tweets, excerpts, etc.
LangDetect(int, String) - Constructor for class org.opensextant.extractors.langid.LangDetect
 
LangDetect(String) - Constructor for class org.opensextant.extractors.langid.LangDetect
 
langid - Variable in class org.opensextant.data.TextInput
 
langid - Variable in class org.opensextant.extractors.langid.LangID
 
LangID - Class in org.opensextant.extractors.langid
 
LangID(String, double, boolean) - Constructor for class org.opensextant.extractors.langid.LangID
 
Language - Class in org.opensextant.data
Simple mapping of ISO 639 id to display name for languages
Language(String, String) - Constructor for class org.opensextant.data.Language
 
Language(String, String, String) - Constructor for class org.opensextant.data.Language
A normalize view of a Language - ISO 639-2 and -1 codes and display name codes are lower cased.
LANGUAGE_ID_GROUP_CJK - Static variable in class org.opensextant.extractors.langid.LangDetect
 
LANGUAGE_ID_GROUP_ENGLISH - Static variable in class org.opensextant.extractors.langid.LangDetect
 
LANGUAGE_ID_GROUP_UNKNOWN - Static variable in class org.opensextant.extractors.langid.LangDetect
 
languagesInCountry(String) - Method in class org.opensextant.util.GeonamesUtility
 
LAT - Static variable in class org.opensextant.output.OpenSextantSchema
 
LAT_DEGREE_PRECISION - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
LAT_MAX - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
LAT_MAX - Static variable in class org.opensextant.util.GeodeticUtility
 
lat_text - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The lat text.
latitude - Variable in class org.opensextant.data.GeoBase
 
LatLon - Interface in org.opensextant.data
 
level - Variable in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
list(JsonArray) - Static method in class org.opensextant.annotations.DeepEyeData
utility -- get list from jsonarray.
list(JsonObject) - Static method in class org.opensextant.annotations.DeepEyeData
 
loadCountryLanguages() - Method in class org.opensextant.util.GeonamesUtility
Parse metadata from geonames.org (file in CLASSPATH @ /geonames.org/countryInfo.txt) and populate existing Country objects with language metadata.
loadDict(InputStream, boolean) - Static method in class org.opensextant.util.FileUtility
The do all method.
loadDictionary(File, boolean) - Static method in class org.opensextant.util.FileUtility
Load a word list from a file path.
loadDictionary(String, boolean) - Static method in class org.opensextant.util.FileUtility
A generic word list loader.
loadDictionary(URL, boolean) - Static method in class org.opensextant.util.FileUtility
A generic word list loader.
loadMajorCities(InputStream) - Static method in class org.opensextant.util.GeonamesUtility
Load the Geonames.org majorcities data file.
loadMajorCities(String) - Static method in class org.opensextant.util.GeonamesUtility
Geonames.org data set: citiesN.txt
loadUSStateMetadata() - Method in class org.opensextant.util.GeonamesUtility
TODO: This is mildly informed by geonames.org, however even there we are still missing a mapping between ADM1 FIPS/ISO codes for a state and the Postal codes/abbreviations.
loadWorldAdmin1Metadata() - Method in class org.opensextant.util.GeonamesUtility
Source: geonames.org ADM1 codes/names in anglo/ASCII form.
LOCATION_FLD - Static variable in class org.opensextant.data.social.Tweet
 
locationCC - Variable in class org.opensextant.data.social.Message
Location Country Code if inferred or given.
log - Variable in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
log - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
log - Variable in class org.opensextant.output.AbstractFormatter
 
log - Variable in class org.opensextant.output.AbstractGenericFormatter
 
log - Variable in class org.opensextant.output.GISDataModel
 
log - Variable in class org.opensextant.processing.XtractorGroup
API: child implementations should recreate their own logger.
LON - Static variable in class org.opensextant.output.OpenSextantSchema
 
LON_MAX - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
LON_MAX - Static variable in class org.opensextant.util.GeodeticUtility
 
lon_text - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The lon text.
longitude - Variable in class org.opensextant.data.GeoBase
 
LOWER_CASE - Static variable in class org.opensextant.extractors.poli.PoliMatch
 

M

MACAddress - Class in org.opensextant.extractors.poli.data
 
MACAddress() - Constructor for class org.opensextant.extractors.poli.data.MACAddress
 
makeDirectory(File) - Static method in class org.opensextant.util.FileUtility
Utility for making dirs
makeDirectory(String) - Static method in class org.opensextant.util.FileUtility
Utility for making dirs
map(JsonObject) - Static method in class org.opensextant.annotations.DeepEyeData
 
mapMajorCityIDs(List<Place>) - Static method in class org.opensextant.util.GeonamesUtility
Convenience: prepare a map for lookup by ID.
mapPopulationByLocation(List<Place>) - Static method in class org.opensextant.util.GeonamesUtility
See mapPopulationByLocation(list, int).
mapPopulationByLocation(List<Place>, int) - Static method in class org.opensextant.util.GeonamesUtility
This organizes population data by geohash.
match - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
match_class - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
match_classname - Variable in class org.opensextant.extractors.flexpat.RegexPattern
Name of a Java class that extends TextMatch and implements user's normalization routines.
match_DateTime(boolean) - Method in class org.opensextant.extractors.xtemporal.XTemporal
enable date time patterns
match_DayMonYear(boolean) - Method in class org.opensextant.extractors.xtemporal.XTemporal
enable day mon year.
match_DD(boolean) - Method in class org.opensextant.extractors.xcoord.XCoord
Enable matching of DD patterns
match_DM(boolean) - Method in class org.opensextant.extractors.xcoord.XCoord
Enable matching of DM patterns
match_DMS(boolean) - Method in class org.opensextant.extractors.xcoord.XCoord
Enable matching of DMS patterns
match_groups - Variable in class org.opensextant.extractors.poli.PoliMatch
 
match_id - Variable in class org.opensextant.extraction.TextEntity
 
MATCH_METHOD - Static variable in class org.opensextant.output.OpenSextantSchema
The method used to match the data in MATCH TEXT
match_MGRS(boolean) - Method in class org.opensextant.extractors.xcoord.XCoord
Enable matching of MGRS patterns
match_MonDayYear(boolean) - Method in class org.opensextant.extractors.xtemporal.XTemporal
enable mon day year patterns.
MATCH_TEXT - Static variable in class org.opensextant.output.OpenSextantSchema
Match Text captures the raw text matched by the tagger.
match_UTM(boolean) - Method in class org.opensextant.extractors.xcoord.XCoord
Enable matching of UTM patterns
match_width - Variable in class org.opensextant.extractors.flexpat.AbstractFlexPat
CHARS.
MatcherUtils - Class in org.opensextant.extraction
 
MatcherUtils() - Constructor for class org.opensextant.extraction.MatcherUtils
 
matches - Variable in class org.opensextant.extraction.ExtractionResult
 
matches - Variable in class org.opensextant.extractors.flexpat.TextMatchResult
 
MatchFilter - Class in org.opensextant.extraction
The Class MatchFilter.
MatchFilter() - Constructor for class org.opensextant.extraction.MatchFilter
free-form filter
MatchFilter(String) - Constructor for class org.opensextant.extraction.MatchFilter
Instantiates a new match filter.
MatchFilter(URL) - Constructor for class org.opensextant.extraction.MatchFilter
 
MatchInterpreter - Interface in org.opensextant.output
 
MatchSchema - Interface in org.opensextant.data
Improving control over Xponents schema fields and common, constant values.
MAX_ERROR_COUNT - Static variable in class org.opensextant.data.social.TweetLoader
The internals of parsing a JSON file of tweets.
MAX_YEARS_AGO - Static variable in class org.opensextant.extractors.xcoord.MGRSFilter
The max years ago.
md5_id(byte[]) - Static method in class org.opensextant.util.TextUtils
Deprecated.
not MD5 specific. Use #b2hex() instead
MDY_FAMILY - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
measureCase(String) - Static method in class org.opensextant.util.TextUtils
Measure character count, upper, lower, non-Character, whitespace
measureCJKText(String) - Static method in class org.opensextant.util.TextUtils
Returns a ratio of Chinese/Japanese/Korean characters: CJK chars / ALL TODO: needs testing; not sure if this is sustainable if block; or if it is comprehensive.
Mention() - Constructor for class org.opensextant.data.social.Tweet.Mention
 
mentionAuthorID - Variable in class org.opensextant.data.social.Tweet.Mention
 
mentionAuthorProfileID - Variable in class org.opensextant.data.social.Tweet.Mention
 
mentionIDs - Variable in class org.opensextant.data.social.Tweet
 
mentions - Variable in class org.opensextant.data.social.Tweet
 
message - Variable in class org.opensextant.extractors.flexpat.TextMatchResult
 
Message - Class in org.opensextant.data.social
Micro Message is the basis for any sort microblog message, SMS, or tweet or other social media post.
Message() - Constructor for class org.opensextant.data.social.Message
Trivial constructor.
Message(String, String, Date) - Constructor for class org.opensextant.data.social.Message
A simple message with an ID, text and a timestamp.
MESSAGE_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
MessageParseException - Exception in org.opensextant.data.social
 
MessageParseException(String) - Constructor for exception org.opensextant.data.social.MessageParseException
 
MessageParseException(String, Exception) - Constructor for exception org.opensextant.data.social.MessageParseException
 
method - Variable in class org.opensextant.data.Place
 
MGRS_FILTERS_ON - Static variable in class org.opensextant.extractors.xcoord.XConstants
RUNTIME FLAGS: filter MGRS coordinates -- date patterns and NUM PER NUM patterns e.g., the ratio "4 per 4000" is not MGRS
MGRS_offset_precision_list - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
MGRS_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
MGRS_precision_list - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
Precision appears in pairs, as we tolerate some typo/errors in MGRs.
MGRS_STRICT_ON - Static variable in class org.opensextant.extractors.xcoord.XConstants
Strict MGRS parsing does not allow typos or mismatched Northing/Easting and possibly other parsing problems.
MGRS2DEC_digits - Static variable in class org.opensextant.extractors.xcoord.PrecisionScales
 
MGRSFilter - Class in org.opensextant.extractors.xcoord
MGRS Filters include ignoring these patterns: 1234 123456 12345678 1234567890 Recent calendar dates of the form ddMMMyyyy, "14DEC1990" (MGRS: 14D EC 19 90 Recent calendar dates with time, ddMMHHmm, "14DEC1200" Noon on 14DEC.
MGRSFilter() - Constructor for class org.opensextant.extractors.xcoord.MGRSFilter
Instantiates a new MGRS filter.
MGRSParser - Class in org.opensextant.extractors.xcoord
 
MGRSParser() - Constructor for class org.opensextant.extractors.xcoord.MGRSParser
 
MIN - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
MIN_LANG_DETECT_PROBABILITY - Static variable in class org.opensextant.extractors.langid.LangDetect
 
MIN_LENGTH_UNK_TEXT_THRESHOLD - Static variable in class org.opensextant.extractors.langid.LangDetect
A simple threshold for demarcating when we might infer simple language ID with minimal content.
MINUTE - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
Money - Class in org.opensextant.extractors.poli.data
 
Money() - Constructor for class org.opensextant.extractors.poli.data.Money
 
Money(String) - Constructor for class org.opensextant.extractors.poli.data.Money
 
Money(Map<String, String>, String) - Constructor for class org.opensextant.extractors.poli.data.Money
 
MONTH - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 

N

name - Variable in class org.opensextant.annotations.Annotation
 
name - Variable in class org.opensextant.data.GeoBase
any name associated with this place
name - Variable in class org.opensextant.data.Taxon
Node name: citrus.tropical
name_type - Variable in class org.opensextant.data.Place
 
NAME_TYPE - Static variable in class org.opensextant.util.GeonamesUtility
 
NEG_HEMI - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
NEGATIVE - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
newAttributes() - Method in class org.opensextant.annotations.DeepEyeData
 
NL - Static variable in class org.opensextant.util.TextUtils
 
NO_DAY - Static variable in class org.opensextant.extractors.xtemporal.DateNormalization
 
NO_DOC_ID - Static variable in interface org.opensextant.extraction.Extractor
optional constant - a universal doc ID holder
NO_HEMISPHERE - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
NO_HEMISPHERE_VALUE - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
NO_MONTH - Static variable in class org.opensextant.extractors.xtemporal.DateNormalization
 
NO_YEAR - Static variable in class org.opensextant.extractors.xtemporal.DateNormalization
 
NONE - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
normal_case - Variable in class org.opensextant.extractors.poli.PoliMatch
 
NormalizationException - Exception in org.opensextant.extraction
 
NormalizationException(String) - Constructor for exception org.opensextant.extraction.NormalizationException
 
NormalizationException(String, Exception) - Constructor for exception org.opensextant.extraction.NormalizationException
 
normalize() - Method in class org.opensextant.extractors.poli.data.EmailAddress
 
normalize() - Method in class org.opensextant.extractors.poli.data.MACAddress
 
normalize() - Method in class org.opensextant.extractors.poli.data.Money
 
normalize() - Method in class org.opensextant.extractors.poli.data.TelephoneNumber
 
normalize() - Method in class org.opensextant.extractors.poli.PoliMatch
Trim whitespace and convert found text to alpha case configured for this object.
normalize_coordinate(GeocoordMatch, Map<String, TextEntity>) - Static method in class org.opensextant.extractors.xcoord.GeocoordNormalization
The match object is normalized, setting the coord_text and other data from parsing "text" and knowing which pattern family was matched.
normalize_day(Map<String, String>) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Normalize_day.
normalize_month(Map<String, String>) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Normalize_month.
normalize_month_name(Map<String, String>) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Normalize_month_name.
normalize_tz(Map<String, String>) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Z or Zulu is not always recognized as UTC / GMT+0000.
normalize_year(Map<String, String>) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Normalize_year.
normalizeAbbreviation(String) - Static method in class org.opensextant.util.TextUtils
Intended only as a filter for punctuation within a word.
normalizeAdminCode(String) - Static method in class org.opensextant.util.GeonamesUtility
Convert and ADM1 or ADM2 id to a normalized form.
normalizeCountryName(String) - Static method in class org.opensextant.util.GeonamesUtility
This may help revert to a more readable country name, e.g., if you are given upper case name and you want some version of it as a proper name But no need to use this if you have good reference data.
normalizeDate(Map<String, String>, DateMatch) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
For now this reports only DATE and standard TIME fields.
normalizeFilenameChar(char) - Static method in class org.opensextant.util.FileUtility
Tests for valid filename chars for simple normalization A-Z, a-z, _-, 0-9,
normalizeTextEntity(String) - Static method in class org.opensextant.util.TextUtils
Normalization: Clean the ends, Remove Line-endings from middle of entity.
normalizeTime(Map<String, String>, String) - Static method in class org.opensextant.extractors.xtemporal.DateNormalization
Given a field hh, mm, or ss, get field from map and normalize/validate the value.
normalizeUnicode(String) - Static method in class org.opensextant.util.TextUtils
Normalize to "Normalization Form Canonical Decomposition" (NFD) REF: http: //stackoverflow.com/questions/3610013/file-listfiles-mangles-unicode- names-with-jdk-6-unicode-normalization-issues This supports proper file name retrieval from file system, among other things.
NORTH - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
NOT_AVAILABLE - Static variable in class org.opensextant.util.FileUtility
 
notes - Variable in class org.opensextant.annotations.Record
Notes are any text messages you wish to attach to a record DeepEye is not responsible for how such a buffer is maintained.
NUM_SEP - Static variable in class org.opensextant.annotations.AnnotationHelper
The Constant NUM_SEP.

O

offset - Variable in class org.opensextant.annotations.Annotation
optional offset
OFFSET_HALF - Static variable in class org.opensextant.data.social.Message
 
offsetDeg - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
Where does the Degree value begin in text?
offsetHemi - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
offsetOrdinate - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
OFFSETS_FLD - Static variable in class org.opensextant.annotations.Annotation
 
offsetSeparator - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The offset separator.
ONE_YEAR_MS - Static variable in class org.opensextant.extractors.xtemporal.XTemporal
The Constant ONE_YEAR_MS.
OpenSextantSchema - Class in org.opensextant.output
GISCore-based output schema.
OpenSextantSchema() - Constructor for class org.opensextant.output.OpenSextantSchema
 
optString(JsonObject, String) - Static method in class org.opensextant.data.social.Tweet
"", null, or "null" checking.
org.opensextant - package org.opensextant
 
org.opensextant.annotations - package org.opensextant.annotations
DeepEye is an approach for simplifying typical NLP annotation exchanges.
org.opensextant.data - package org.opensextant.data
Xponents Data Model
org.opensextant.data.social - package org.opensextant.data.social
Social Media Representation
org.opensextant.extraction - package org.opensextant.extraction
Extraction Fundamentals
org.opensextant.extractors.flexpat - package org.opensextant.extractors.flexpat
FlexPat -- A Pattern Definition & Testing Library
org.opensextant.extractors.langid - package org.opensextant.extractors.langid
Language Detection
org.opensextant.extractors.poli - package org.opensextant.extractors.poli
Patterns Of Life (PoLi)
org.opensextant.extractors.poli.data - package org.opensextant.extractors.poli.data
PoLi Patterns: Cyber, Telelphony, Currency examples
org.opensextant.extractors.xcoord - package org.opensextant.extractors.xcoord
XCoord: Geographic Coordinate Extraction
org.opensextant.extractors.xtemporal - package org.opensextant.extractors.xtemporal
XTemporal: Date and Time Pattern Extraction
org.opensextant.output - package org.opensextant.output
Xponents Output Formatting using GISCore
org.opensextant.processing - package org.opensextant.processing
Processing Basics: Parameters, Results Handlers, Pipelining
org.opensextant.util - package org.opensextant.util
Utilities for Extraction
os - Variable in class org.opensextant.output.GISDataFormatter
 
output_coordinate_duplicates - Variable in class org.opensextant.processing.Parameters
Is the concept of duplicate filtering more general than for just coords?
output_filtered - Variable in class org.opensextant.processing.Parameters
By default Country Names will not be included in GIS products They should appear in CSV, though.
output_geohash - Variable in class org.opensextant.processing.Parameters
Default is to not generate Geohash
outputDir - Variable in class org.opensextant.processing.Parameters
 
outputExtension - Variable in class org.opensextant.output.AbstractFormatter
File extension for callers to know.
outputExtension - Variable in class org.opensextant.output.AbstractGenericFormatter
File extension for callers to know.
outputFile - Variable in class org.opensextant.processing.Parameters
 
outputParams - Variable in class org.opensextant.output.AbstractFormatter
 
outputParams - Variable in class org.opensextant.output.AbstractGenericFormatter
The output params.
outputType - Variable in class org.opensextant.output.AbstractFormatter
 
outputType - Variable in class org.opensextant.output.AbstractGenericFormatter
reflected by extension; an enum in OpenSextant
overwrite - Variable in class org.opensextant.output.AbstractFormatter
 
overwrite - Variable in class org.opensextant.output.AbstractGenericFormatter
The overwrite.
ownsTerritory(String) - Method in class org.opensextant.data.Country
Territory ownership is defined only by the data fed to this API; We do not make any political statements here.

P

Parameters - Class in org.opensextant.processing
 
Parameters() - Constructor for class org.opensextant.processing.Parameters
 
parseDate(String) - Static method in class org.opensextant.util.TextUtils
A limited-scope date parsing: Parse properly formatted strings for example, ISO date/time strings stored in one of our Solr indices.
parseDate(Map<?, ?>) - Method in class org.opensextant.data.social.Tweet
 
parseHashTags(String) - Static method in class org.opensextant.util.TextUtils
Parse the typical Twitter hashtag variants.
parseHashTags(String, boolean) - Static method in class org.opensextant.util.TextUtils
Takes a string and returns all the hashtags in it.
parseId(String) - Static method in class org.opensextant.data.social.Tweet
Most commonly needed to parse TweetID from a GnipID
parseIds(Map<?, ?>) - Method in class org.opensextant.data.social.Tweet
Find a best ID from many possible places where ID, id, id_str, etc, reside.
parseInt(String) - Static method in class org.opensextant.extractors.xcoord.MGRSParser
 
parseLanguage(String) - Method in class org.opensextant.data.social.Tweet
 
parseLatLon(Object, Object) - Static method in class org.opensextant.util.GeodeticUtility
Parse coordinate from object
parseLatLon(String) - Static method in class org.opensextant.util.GeodeticUtility
The most simplistic parsing and validation of "lat lon" or "lat, lon" any amount of whitespace is allowed, provided the lat lon order is there.
parseMentions(String) - Static method in class org.opensextant.data.social.Tweet
From a tweet, get list of "@id"
parseMentions(JsonObject) - Method in class org.opensextant.data.social.Tweet
Gets a entities.user_mentions from a normal Tweet.
parseMGRS(String, String, Map<String, String>) - Static method in class org.opensextant.extractors.xcoord.MGRSParser
Given the match parse MGRS as best as can be done.
parseNaturalLanguage(String) - Static method in class org.opensextant.util.TextUtils
see default implementation below
parseNaturalLanguage(String, boolean, boolean, boolean, boolean) - Static method in class org.opensextant.util.TextUtils
Given tweet text or any [social media] text remove entities or other markers: - URLs are removed - entities are stripped of "@" - hashtags are stripped of "#" - HTML: & is converted to an ampersand - HTML: escaped angle brackets are replaced with { and } for gt and lt, respectively - HTML: remaining special chars are converted back to unicode; remaining ampersand is replaced with "+" Whitespaces (space, newlines, tabs, etc.) are reduced.
parser_rule - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
parseText(Map<?, ?>) - Method in class org.opensextant.data.social.Tweet
 
parseURLs(List<?>) - Method in class org.opensextant.data.social.Tweet
supports gnip.urls or topsy.urls fields
parseUTM(String, Map<String, String>) - Static method in class org.opensextant.extractors.xcoord.UTMParser
TODO: Needs lots of false positive filtering mainly using context.
pass - Variable in class org.opensextant.extractors.flexpat.TextMatchResult
 
pass(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.DMSFilter
pass a match
pass(GeocoordMatch) - Method in interface org.opensextant.extractors.xcoord.GeocoordMatchFilter
pass a match
pass(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.MGRSFilter
pass a match.
pattern_family_state - Variable in class org.opensextant.extractors.xtemporal.PatternManager
 
pattern_id - Variable in class org.opensextant.extraction.TextMatch
the ID of the pattern that extracted this
patternFields - Variable in class org.opensextant.extractors.xtemporal.DateMatch
fields in pattern match
patternFile - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
PatternManager - Class in org.opensextant.extractors.xcoord
This is the culmination of various coordinate extraction efforts in python and Java.
PatternManager - Class in org.opensextant.extractors.xtemporal
 
PatternManager(InputStream, String) - Constructor for class org.opensextant.extractors.xcoord.PatternManager
 
PatternManager(InputStream, String) - Constructor for class org.opensextant.extractors.xtemporal.PatternManager
Pass in InputStream to provide yourself the most flexibility.
patterns - Variable in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
patterns - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
patterns_file - Variable in class org.opensextant.extractors.flexpat.AbstractFlexPat
 
patterns_list - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
PatternsOfLife - Class in org.opensextant.extractors.poli
 
PatternsOfLife() - Constructor for class org.opensextant.extractors.poli.PatternsOfLife
Default constructor, debugging off.
PatternsOfLife(boolean) - Constructor for class org.opensextant.extractors.poli.PatternsOfLife
 
PatternTestCase - Class in org.opensextant.extractors.flexpat
 
PatternTestCase(String, String, String) - Constructor for class org.opensextant.extractors.flexpat.PatternTestCase
 
phoneticReduction(String) - Static method in class org.opensextant.util.TextUtils
Create a non-diacritic, ASCII version of the input string.
phoneticReduction(String, boolean) - Static method in class org.opensextant.util.TextUtils
 
PKG - Static variable in class org.opensextant.output.FormatterFactory
 
Place - Class in org.opensextant.data
Place class represents all the metadata about a location.
Place() - Constructor for class org.opensextant.data.Place
 
Place(double, double) - Constructor for class org.opensextant.data.Place
 
Place(String, String) - Constructor for class org.opensextant.data.Place
Creates a new instance of GeoBase
PLACE_NAME - Static variable in class org.opensextant.output.OpenSextantSchema
the name in the Gazetteer entry; which aligns with the MATCH TEXT
polarity - Variable in class org.opensextant.extractors.xcoord.Hemisphere
Hemisphere polarity - N, E = +1; S, W = -1.
PoliMatch - Class in org.opensextant.extractors.poli
 
PoliMatch() - Constructor for class org.opensextant.extractors.poli.PoliMatch
No-argument constructor is needed because class in invoked through introspection (via name of class)
PoliMatch(int, int) - Constructor for class org.opensextant.extractors.poli.PoliMatch
 
PoliMatch(String) - Constructor for class org.opensextant.extractors.poli.PoliMatch
 
PoliMatch(Map<String, String>, String) - Constructor for class org.opensextant.extractors.poli.PoliMatch
 
PoliPatternManager - Class in org.opensextant.extractors.poli
 
PoliPatternManager(InputStream, String) - Constructor for class org.opensextant.extractors.poli.PoliPatternManager
 
portugueseLang - Static variable in class org.opensextant.util.TextUtils
 
POS_HEMI - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
POSITIVE - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
postChar - Variable in class org.opensextant.extraction.TextEntity
char immediately after span
preChar - Variable in class org.opensextant.extraction.TextEntity
char immediately before span
precision - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
inherent precision of the coordinate matched.
precision - Variable in class org.opensextant.extractors.xcoord.GeocoordPrecision
+/- # of Meters of error
PRECISION - Static variable in class org.opensextant.output.OpenSextantSchema
Number of meters of error in coordinate of location.
PrecisionScales - Class in org.opensextant.extractors.xcoord
 
PrecisionScales() - Constructor for class org.opensextant.extractors.xcoord.PrecisionScales
 
preferJSON() - Method in interface org.opensextant.data.social.JSONListener
implementation should advertise if it prefers JSON or String.
preferredGeography - Variable in class org.opensextant.processing.Parameters
A way of relaying arbitrary geographic filters to an extraction routine indicating that useful answers for disambiguation for tie-breakers come from these cues.
primary - Variable in class org.opensextant.extractors.langid.LangID
 
primaryLangID(String) - Method in class org.opensextant.util.GeonamesUtility
When lang ID will do.
primaryLanguage(String) - Method in class org.opensextant.util.GeonamesUtility
Primary language for a given country.
probability - Variable in class org.opensextant.extractors.langid.LangID
 
procdate - Variable in class org.opensextant.annotations.Record
a processing date/time key that has as much resolution as you need This is a string because the lexical sort is likely easier to manage than using actual date/time field with date/time math.
process(TextInput) - Method in class org.opensextant.processing.XtractorGroup
Process one input.
processAndFormat(TextInput) - Method in class org.opensextant.processing.XtractorGroup
Processes input content against all extractors and all formatters This does not throw exceptions, as some processing may fail, while others succeed.
ProcessingException - Exception in org.opensextant.processing
 
ProcessingException(Exception) - Constructor for exception org.opensextant.processing.ProcessingException
 
ProcessingException(String) - Constructor for exception org.opensextant.processing.ProcessingException
 
ProcessingException(String, Exception) - Constructor for exception org.opensextant.processing.ProcessingException
 
producer - Variable in class org.opensextant.extraction.TextMatch
A short label or tag representing the matcher, extractor, tagger, etc.
PROVINCE - Static variable in class org.opensextant.output.OpenSextantSchema
 

R

rawbytes - Variable in class org.opensextant.data.social.Message
Character count of the message.
rawOffset - Variable in class org.opensextant.data.Country.TZ
 
readFile(File) - Static method in class org.opensextant.util.FileUtility
 
readFile(File, String) - Static method in class org.opensextant.util.FileUtility
Slurps a text file into a string and returns the string.
readFile(String) - Static method in class org.opensextant.util.FileUtility
 
readGzipFile(String) - Static method in class org.opensextant.util.FileUtility
 
readJSONByLine(File, JSONListener) - Static method in class org.opensextant.data.social.TweetLoader
To read gzip/JSON files one row of JSON at a time.
readObject(String) - Method in interface org.opensextant.data.social.JSONListener
API method to allow implementation to read string, e.g., TW4J factory uses strictly String args in a JSON context.
readObject(JsonObject) - Method in interface org.opensextant.data.social.JSONListener
 
rec_id - Variable in class org.opensextant.annotations.Annotation
 
Record - Class in org.opensextant.annotations
A record is a representation of the raw original.
Record() - Constructor for class org.opensextant.annotations.Record
 
Record(String, String) - Constructor for class org.opensextant.annotations.Record
 
recordFile - Variable in class org.opensextant.extraction.ExtractionResult
Original file for record
recordID - Variable in class org.opensextant.extraction.ExtractionResult
short ID or name of file
recordTextFile - Variable in class org.opensextant.extraction.ExtractionResult
Text version of file used for processing
reduce_line_breaks(String) - Static method in class org.opensextant.util.TextUtils
Replaces all 3 or more blank lines with a single paragraph break (\n\n)
reduceMatches(List<TextMatch>) - Static method in class org.opensextant.extraction.MatcherUtils
Reduce actual valid matches by identifying duplicates or sub-matches.
regex - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
regex_groups - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
RegexPattern - Class in org.opensextant.extractors.flexpat
 
RegexPattern(String, String, String) - Constructor for class org.opensextant.extractors.flexpat.RegexPattern
Metadata for a Regular Expression pattern
RegexPatternManager - Class in org.opensextant.extractors.flexpat
This is the culmination of various date/time extraction efforts in python and Java.
RegexPatternManager(InputStream, String) - Constructor for class org.opensextant.extractors.flexpat.RegexPatternManager
 
remarks - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
removeAny(String, String) - Static method in class org.opensextant.util.TextUtils
Remove instances of any char in the remove string from buf
removeAnyLeft(String, String) - Static method in class org.opensextant.util.TextUtils
compare to trim( string, chars ), but you can trim any chars Example: - a b c remove "-" from string above.
removeDiacritics(String) - Static method in class org.opensextant.util.TextUtils
Supports Phoneticizer utility from OpenSextant v1.x Remove diacritics from a phrase
removeDirectory(File) - Static method in class org.opensextant.util.FileUtility
Java oddity - recursive removal of a directory
removeEmoticons(String) - Static method in class org.opensextant.util.TextUtils
replace Emoticons with something less nefarious -- UTF-16 characters do not play well with some I/O routines.
removeField(String) - Method in class org.opensextant.output.AbstractGenericFormatter
Removes the field.
removeField(String) - Method in class org.opensextant.output.CSVFormatter
 
removeField(String) - Method in class org.opensextant.output.GISDataFormatter
 
removeField(String) - Method in class org.opensextant.output.GISDataModel
Removes the field.
removeField(String) - Method in interface org.opensextant.output.ResultsFormatter
 
removePunctuation(String) - Static method in class org.opensextant.util.TextUtils
Remove any leading and trailing punctuation and some internal punctuation.
removeSymbols(String) - Static method in class org.opensextant.util.TextUtils
Replace symbology
replaceAny(String, String, String) - Static method in class org.opensextant.util.TextUtils
Replace any of the removal chars with the sub.
replaceDiacritics(String) - Static method in class org.opensextant.util.TextUtils
A thorough replacement of diacritics and Unicode chars to their ASCII equivalents.
replaceDiacritics(String) - Static method in class org.opensextant.util.Unimap
A port of opensextant.unicode in Python lib; A rendering of the Solr lucene character filter/map.
replaceDiacriticsOriginal(String) - Static method in class org.opensextant.util.TextUtils
Deprecated.
See replaceDiacritics as the replacement.
reportMemory() - Static method in class org.opensextant.processing.RuntimeTools
Easily digestible version of memory report.
reset() - Method in class org.opensextant.annotations.AnnotationHelper
Reset() clears the internal cache.
reset() - Method in class org.opensextant.processing.XtractorGroup
DRAFT: still figuring out the rules for 'reset' between processing or inputs.
resetOffsets() - Method in class org.opensextant.annotations.Annotation
This is provided mainly for testing.
resolution - Variable in class org.opensextant.extractors.xtemporal.DateMatch
 
resolve_localities - Variable in class org.opensextant.processing.Parameters
Reverse Geo here is accommodated by resolving Provinces and Localities when a geodetic coordinate is encountered in text.
result_id - Variable in class org.opensextant.extractors.flexpat.TextMatchResult
 
ResultsFormatter - Interface in org.opensextant.output
Interface for classes that generate output from corpora that have been processed by OpenSextant.
ResultsUtility - Class in org.opensextant.processing
Abstract class encapsulating basic results formatter functionality.
ResultsUtility() - Constructor for class org.opensextant.processing.ResultsUtility
 
retweet - Variable in class org.opensextant.data.social.Tweet
 
retweetID - Variable in class org.opensextant.data.social.Tweet
 
romanianLang - Static variable in class org.opensextant.util.TextUtils
 
rootid - Variable in class org.opensextant.data.Taxon
Node root: citrus
RUNTIME_FLAGS - Static variable in class org.opensextant.extractors.xcoord.XCoord
Reserved.
RUNTIME_FLAGS - Static variable in class org.opensextant.processing.Parameters
 
RuntimeTools - Class in org.opensextant.processing
 
RuntimeTools() - Constructor for class org.opensextant.processing.RuntimeTools
 
russianLang - Static variable in class org.opensextant.util.TextUtils
 

S

sameBoundary(Place) - Method in class org.opensextant.data.Place
Check if CC.AA coding of the features is the same.
save(Record) - Method in interface org.opensextant.annotations.DeepEyeStore
Suggested Save operation: check if exists, update if it does, otherwise insert.
schema - Variable in class org.opensextant.output.GISDataModel
 
SCHEMA_OID - Static variable in class org.opensextant.output.OpenSextantSchema
 
score - Variable in class org.opensextant.extractors.langid.LangID
On a scale of 0 to 100, where 100 = 100% confident, how would you score this language identity? It is easier to compare scores 55 == 55, vs.
SEC - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
SECOND - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
separator - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The separator.
separators - Variable in class org.opensextant.extractors.xcoord.GeocoordMatch
The separators.
serialVersionUID - Static variable in exception org.opensextant.ConfigException
 
serialVersionUID - Static variable in exception org.opensextant.extraction.NormalizationException
 
serialVersionUID - Static variable in exception org.opensextant.processing.ProcessingException
 
set_match_id(TextMatch, int) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Optional.
set_normalized_text() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
Get back a normalized version of what you found.
set_precision(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.GeocoordNormalization
set the precision on a match based on the situation (match + pattern).
setAdmin1(String) - Method in class org.opensextant.data.Place
 
setAdmin1Name(String) - Method in class org.opensextant.data.Place
 
setAdmin1PostalCode(String) - Method in class org.opensextant.data.Place
 
setAdmin1PostalCode(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the admin 1 postal code.
setAdmin2(String) - Method in class org.opensextant.data.Place
 
setAdmin2Name(String) - Method in class org.opensextant.data.Place
 
setAdminName(String) - Method in class org.opensextant.data.Place
 
setBalanced(boolean) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Allow pattern rules to determine by any means if match is balanced.
setByteCount(long) - Method in class org.opensextant.extraction.ExtractionMetrics
 
setConfidence(int) - Method in interface org.opensextant.data.Geocoding
Set confidence, a value on a 100 point scale, 0-100.
setConfidence(int) - Method in class org.opensextant.data.Place
 
setConfidence(int) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the confidence.
setContext(String) - Method in class org.opensextant.extraction.TextEntity
Set the context buffer from a single window
setContext(String, String) - Method in class org.opensextant.extraction.TextEntity
Set the context with before and after windows
setContextFor(String, TextEntity, int) - Static method in class org.opensextant.processing.ResultsUtility
Given the annotation or match, set the context on the TextEntity object.
setCoordinate(DMSOrdinate, DMSOrdinate) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Set the ordinates back on the match; general filters are assessed.
setCountry(Country) - Method in interface org.opensextant.data.Geocoding
 
setCountry(Country) - Method in class org.opensextant.data.Place
Set the country object and the local country ID code.
setCountry(Country) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the country.
setCountryCode(String) - Method in interface org.opensextant.data.Geocoding
 
setCountryCode(String) - Method in class org.opensextant.data.Place
Compat: set country_id aka CountryCode
setCountryCode(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the country code.
setDate(long) - Method in class org.opensextant.data.social.Tweet
Set the date and an the standard "CREATED_AT" date/time format.
setDDPrecision(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
Sets the precision on a decimal degrees match
setDelimiter(String) - Method in class org.opensextant.output.CSVGenericFormatter
 
setDigits(int) - Method in class org.opensextant.extractors.xcoord.GeocoordPrecision
Augment number of digits in precision -- choose the maximum amount if in coord (a,b) if a has more digits of precision than b, use a's precision.
setDistantPastYear(int) - Static method in class org.opensextant.extractors.xtemporal.XTemporal
* Application thresholds -- chosen by the user.
setDMSPrecision(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
set precision on a DMS text coordinate -- simply if the Match latitude "has seconds" then its precision is seconds, otherwise if it has minutes, then it is precise to +/- 1 minute, etc.
setFeatureClass(String) - Method in class org.opensextant.data.Place
 
setFeatureCode(String) - Method in class org.opensextant.data.Place
 
setFilteredOut(boolean) - Method in class org.opensextant.extraction.TextMatch
 
setGeohash(String) - Method in class org.opensextant.data.GeoBase
Set and get Geohash -- this is delegated to caller as core processing need not have a geohash generated when lat/lon is set.
setGeoMethod(String) - Method in class org.opensextant.data.social.Tweet
 
setGisDataModel() - Method in class org.opensextant.output.GISDataFormatter
Use Default GIS Data Model based on current state of formatter
setGisDataModel(GISDataModel) - Method in class org.opensextant.output.GISDataFormatter
 
setGroups(Map<String, String>) - Method in class org.opensextant.extractors.poli.PoliMatch
 
setHierarchicalPath(String) - Method in class org.opensextant.data.Place
 
setId(String) - Method in class org.opensextant.data.Taxon
 
setId_bias(int) - Method in class org.opensextant.data.Place
 
setInstanceId(String) - Method in class org.opensextant.data.Place
Use to identify a particular related object ID associated with this location.
setJobName(String) - Method in class org.opensextant.processing.Parameters
 
setKey(String) - Method in class org.opensextant.data.GeoBase
legacy nomenclature.
setLanguage(String) - Method in class org.opensextant.data.social.Tweet
 
setLatitude(double) - Method in class org.opensextant.data.GeoBase
 
setLatitude(double) - Method in interface org.opensextant.data.LatLon
 
setLatitude(double) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the latitude.
setLatitude(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the latitude.
setLatLon(LatLon) - Method in class org.opensextant.data.GeoBase
Convenience method
setLatLon(LatLon) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
If you are given a vetted XY, use that.
setLatLon(Place, JsonObject, boolean) - Static method in class org.opensextant.data.social.Tweet
Order of coordinates is for geo = (LON, LAT) in twitter objects.
setLength(int) - Method in class org.opensextant.annotations.Annotation
 
setLongitude(double) - Method in class org.opensextant.data.GeoBase
 
setLongitude(double) - Method in interface org.opensextant.data.LatLon
 
setLongitude(double) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the longitude.
setLongitude(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the longitude.
setMatchInterpeter(MatchInterpreter) - Method in class org.opensextant.output.AbstractFormatter
Override means for how geocoding is determined per row.
setMatchWidth(int) - Method in class org.opensextant.extractors.flexpat.AbstractFlexPat
Match Width is the text buffer before and after a TextMatch.
setMethod(String) - Method in interface org.opensextant.data.Geocoding
 
setMethod(String) - Method in class org.opensextant.data.Place
 
setMethod(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
This reuses TextMatch.pattern_id attr; Use get/setMethod() or pattern_id as needed.
setMGRSPrecision(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
 
setName(String) - Method in class org.opensextant.data.GeoBase
 
setName(String) - Method in class org.opensextant.data.Taxon
 
setName_bias(int) - Method in class org.opensextant.data.Place
 
setName_type(char) - Method in class org.opensextant.data.Place
 
setOutputDir(String) - Method in class org.opensextant.output.AbstractFormatter
 
setOutputDir(String) - Method in class org.opensextant.output.AbstractGenericFormatter
Sets the output dir.
setOutputDir(String) - Method in interface org.opensextant.output.ResultsFormatter
Set the path to the output directory.
setOutputFilename(String) - Method in class org.opensextant.output.AbstractFormatter
 
setOutputFilename(String) - Method in class org.opensextant.output.AbstractGenericFormatter
Sets the output filename.
setOutputFilename(String) - Method in interface org.opensextant.output.ResultsFormatter
Set the name of the output file.
setParameters(Parameters) - Method in class org.opensextant.output.AbstractFormatter
Note - output parameters use tag_XYZ flags to indicate date to include or exclude
setParameters(Parameters) - Method in class org.opensextant.output.AbstractGenericFormatter
Sets the parameters.
setParameters(Parameters) - Method in interface org.opensextant.output.ResultsFormatter
A more convenient way of passing in a list of parameters.
setPerson(JsonObject) - Method in class org.opensextant.data.social.Tweet
 
setPlace(Place, JsonObject) - Static method in class org.opensextant.data.social.Tweet
 
setPlaceID(String) - Method in class org.opensextant.data.Place
Wrapper around GeoBase.setKey for compat
setPlaceName(String) - Method in interface org.opensextant.data.Geocoding
 
setPlaceName(String) - Method in class org.opensextant.data.Place
 
setPlaceName(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the place name.
setPlacePostalCode(String) - Method in class org.opensextant.data.Place
 
setPlacePostalCode(String) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the place postal code.
setPopulation(int) - Method in class org.opensextant.data.Place
 
setPrecision(int) - Method in interface org.opensextant.data.Geocoding
Precision - radius in meters of possible error
setPrecision(int) - Method in class org.opensextant.data.Place
Xponents version of precision is number of meters of error, approximately.
setPrecision(int) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the precision.
setPrePostContextFor(String, TextEntity, int) - Static method in class org.opensextant.processing.ResultsUtility
Given the TextEntity, set the context on that object.
setRelatedPlace(Place) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the related place.
setRelativeOffset(int) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Note: this should only be called once.
setRelativeOffsets(int) - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
offsets for degree and hemisphere until this point are likely absolute within a document Reset them using this relative offset.
setRemarks(String) - Method in class org.opensextant.extractors.flexpat.PatternTestCase
Set the test remarks and IFF the word "fail" is in the comment, the test is indicated as a true negative.
setRootid(String) - Method in class org.opensextant.data.Taxon
 
setSeparator(Map<String, TextEntity>) - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Sets the separator.
setSource(String) - Method in class org.opensextant.data.Place
 
setStatusGeo(JsonObject, Tweet) - Static method in class org.opensextant.data.social.Tweet
TODO: investigate how close a user Profile geo compares with Status geo.
setStrictMode(boolean) - Static method in class org.opensextant.extractors.xcoord.XCoord
 
setTerms(String[]) - Method in class org.opensextant.data.Taxon
 
setText(String) - Method in class org.opensextant.data.social.Message
Set the text, if not null, isASCII and rawbytes length are calculated.
setText(String) - Method in class org.opensextant.extraction.TextEntity
sets the value of the TextEntity
setTextNatural(String) - Method in class org.opensextant.data.social.Message
 
setTextOnly(String) - Method in class org.opensextant.extraction.TextEntity
Set just the value, without incurring the cost of other metrics or flags about the text that likely are unchanged.
setToday(Date) - Method in class org.opensextant.extractors.xtemporal.XTemporal
Optionally reset your context...
setType(String) - Method in class org.opensextant.extraction.TextMatch
Allow matchers and taggers to set a type label, e.g., pattern family or other string.
setUniqueName(boolean) - Method in class org.opensextant.data.Country
 
setUser(JsonObject) - Method in class org.opensextant.data.social.Tweet
 
setUserGeo(JsonObject, Tweet) - Static method in class org.opensextant.data.social.Tweet
 
setUTCOffset(int) - Method in class org.opensextant.data.social.Message
Set UTC and TZ after date is set.
setUTMPrecision(GeocoordMatch) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
 
setWorkingSize(int) - Method in class org.opensextant.extractors.langid.LangDetect
 
ShapefileFormatter - Class in org.opensextant.output
A results formatter for shapefile output.
ShapefileFormatter() - Constructor for class org.opensextant.output.ShapefileFormatter
 
sort(Map<String, LangID>) - Static method in class org.opensextant.extractors.langid.LangDetect
Sort what was found; Returns LangID by highest score to lowest.
source_id - Variable in class org.opensextant.annotations.Annotation
Source ID here is very much optional Currently it is useful for SQLite usage.
source_id - Variable in class org.opensextant.annotations.Record
Source ID
SOURCE_TWITTER - Static variable in class org.opensextant.data.social.Tweet
 
sourceID - Variable in class org.opensextant.data.social.Message
optional Source ID field
SOUTH - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
SP - Static variable in class org.opensextant.util.TextUtils
 
spanishLang - Static variable in class org.opensextant.util.TextUtils
 
specificity - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
Specificity is a field resolution, that is how many fields were specified in the match? Ex.
SPREADSHEET_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
squeeze_whitespace(String) - Static method in class org.opensextant.util.TextUtils
Minimize whitespace.
start - Variable in class org.opensextant.extraction.TextEntity
char offset of entity; location in document where entity starts.
start(String) - Method in class org.opensextant.output.AbstractFormatter
 
start(String) - Method in class org.opensextant.output.AbstractGenericFormatter
Start.
start(String) - Method in class org.opensextant.output.CSVFormatter
 
start(String) - Method in class org.opensextant.output.CSVGenericFormatter
 
start(String) - Method in class org.opensextant.output.GISDataFormatter
Start output.
start(String) - Method in interface org.opensextant.output.ResultsFormatter
 
START_CHARS - Static variable in class org.opensextant.extraction.MatcherUtils
 
START_OFFSET - Static variable in class org.opensextant.output.OpenSextantSchema
 
state - Variable in class org.opensextant.annotations.Record
State flags indicate what state of processing the record is in or what processing has been applied to it.
stateMask - Variable in class org.opensextant.annotations.Record
 
statusGeo - Variable in class org.opensextant.data.social.Message
the origination of the message -- Country from which the item was sent.
stop(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.DMSFilter
stop a match if it is a bad DMS pattern, e.g., date or date/time.
stop(GeocoordMatch) - Method in interface org.opensextant.extractors.xcoord.GeocoordMatchFilter
stop a match
stop(GeocoordMatch) - Method in class org.opensextant.extractors.xcoord.MGRSFilter
TODO: Document rules.
string2list(String, String) - Static method in class org.opensextant.util.TextUtils
Get a list of values into a nice, scrubbed array of values, no whitespace.
SUBDEG - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
SUBMIN - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
SUBSEC - Enum constant in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
 
symbol - Variable in class org.opensextant.extractors.xcoord.Hemisphere
The symbol.

T

TAB - Static variable in class org.opensextant.util.TextUtils
 
tag_all_taxons - Variable in class org.opensextant.processing.Parameters
"all taxons" is an explicit request to tag all keyphrases AKA taxons
tag_codes - Variable in class org.opensextant.processing.Parameters
Tag/Output codes of placenames -- Exclusive of Coutry or other Placenames
tag_coordinates - Variable in class org.opensextant.processing.Parameters
Tag/Output coordinates
tag_countries - Variable in class org.opensextant.processing.Parameters
Tag/Output countries
tag_lowercase - Variable in class org.opensextant.processing.Parameters
 
tag_names - Variable in class org.opensextant.processing.Parameters
Name taxons are a special grouping of tags, including "person_name" and other entries Entity type is usually `person`.
tag_patterns - Variable in class org.opensextant.processing.Parameters
 
tag_places - Variable in class org.opensextant.processing.Parameters
Tag/Output placenames of any time other than countries
tag_postal - Variable in class org.opensextant.processing.Parameters
Tag/Output postal hierarchy, i.e., "City, Province, Postal code"
tag_taxons - Variable in class org.opensextant.processing.Parameters
"taxons" is a catch all for the non-geo items that come out of geotagger, e.g., negated tags such as the surname "Johnson" -- typically a surname and not a city.
tagFilter - Variable in class org.opensextant.extraction.MatchFilter
The tag filter.
tags - Variable in class org.opensextant.annotations.Record
 
tagset - Variable in class org.opensextant.data.Taxon
 
Taxon - Class in org.opensextant.data
A Taxon is an entry in a taxonomy, which could be as simple as a flat word list or something with lots of structure.
Taxon() - Constructor for class org.opensextant.data.Taxon
 
TelephoneNumber - Class in org.opensextant.extractors.poli.data
 
TelephoneNumber() - Constructor for class org.opensextant.extractors.poli.data.TelephoneNumber
 
TelephoneNumber(String) - Constructor for class org.opensextant.extractors.poli.data.TelephoneNumber
 
TelephoneNumber(Map<String, String>, String) - Constructor for class org.opensextant.extractors.poli.data.TelephoneNumber
 
tempDir - Variable in class org.opensextant.processing.Parameters
 
termset - Variable in class org.opensextant.data.Taxon
Terms: "pineapple", "mango", ...
TestCase - Class in org.opensextant.extractors.poli
 
TestCase - Class in org.opensextant.extractors.xtemporal
 
TestCase(String, String, String) - Constructor for class org.opensextant.extractors.poli.TestCase
 
TestCase(String, String, String) - Constructor for class org.opensextant.extractors.xtemporal.TestCase
 
testcases - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
testing - Variable in class org.opensextant.extractors.flexpat.RegexPatternManager
 
text - Variable in class org.opensextant.data.social.Message
The text of the message.
text - Variable in class org.opensextant.extraction.TextEntity
 
text - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
text - Variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
text_id(String) - Static method in class org.opensextant.util.TextUtils
Static method -- use only if you are sure of thread-safety.
TEXT_WIDTH - Static variable in class org.opensextant.output.AbstractFormatter
 
TEXT_WIDTH - Static variable in class org.opensextant.output.AbstractGenericFormatter
Size of text window around matches -- to use as excerpts
TEXT_WIDTH - Static variable in class org.opensextant.processing.ResultsUtility
The default TEXT WIDTH.
TextEntity - Class in org.opensextant.extraction
A very simple struct to hold data useful for post-processing entities once found.
TextEntity(int, int) - Constructor for class org.opensextant.extraction.TextEntity
Simple Span representation.
TextInput - Class in org.opensextant.data
TextInput is a unit of data -- a tuple that represents the text and its language and an identifier for downstream processing, export formatting, databasing results keyed by text identifier, etc.
TextInput(String, String) - Constructor for class org.opensextant.data.TextInput
A simple input.
TextMatch - Class in org.opensextant.extraction
A variation on TextEntity that also records pattern metadata
TextMatch(int, int) - Constructor for class org.opensextant.extraction.TextMatch
 
TextMatchResult - Class in org.opensextant.extractors.flexpat
This result class holds all the results for a given text block.
TextMatchResult() - Constructor for class org.opensextant.extractors.flexpat.TextMatchResult
 
textNatural - Variable in class org.opensextant.data.social.Message
The natural language version of the text.
textnorm - Variable in class org.opensextant.extractors.poli.PoliMatch
 
TEXTPATH - Static variable in class org.opensextant.output.OpenSextantSchema
 
TextUtils - Class in org.opensextant.util
 
TextUtils() - Constructor for class org.opensextant.util.TextUtils
 
thaiLang - Static variable in class org.opensextant.util.TextUtils
 
timestamp_parser - Static variable in class org.opensextant.data.social.Tweet
 
timezone - Variable in class org.opensextant.data.social.Message
Timezone label of the timestamp, e.g., Europe/London, or just London or "Canada and Atlantic (EST)".
today - Variable in class org.opensextant.extractors.xcoord.MGRSFilter
The today.
TODAY - Variable in class org.opensextant.extractors.xtemporal.XTemporal
Application constants -- note the notion of TODAY is relative to the caller's notion of TODAY.
TODAY_EPOCH - Variable in class org.opensextant.extractors.xtemporal.XTemporal
The today epoch.
toDecimal() - Method in class org.opensextant.extractors.xcoord.DMSOrdinate
toDD() Return the decimal value to the extent it makes sense.
toJSON(Tweet) - Static method in class org.opensextant.data.social.TweetUtility
 
toJSON(Tweet, boolean, boolean) - Static method in class org.opensextant.data.social.TweetUtility
Objective of this is to create a FLAT key/value map for a JSON or a simple HashMap to use.
tokens(String) - Static method in class org.opensextant.util.TextUtils
Return just white-space delmited tokens.
tokensLeft(String) - Static method in class org.opensextant.util.TextUtils
See tokensRight()
tokensRight(String) - Static method in class org.opensextant.util.TextUtils
Return tokens on the right most part of a buffer.
toMGRS() - Method in class org.opensextant.extractors.xcoord.GeocoordMatch
Convert the current coordinate to MGRS.
toString() - Method in class org.opensextant.annotations.Annotation
 
toString() - Method in class org.opensextant.annotations.Record
 
toString() - Method in class org.opensextant.data.Country
 
toString() - Method in class org.opensextant.data.Country.TZ
 
toString() - Method in class org.opensextant.data.GeoBase
Generic label -- anything more sophisticated needs attention E.g.
toString() - Method in class org.opensextant.data.Language
 
toString() - Method in class org.opensextant.data.Place
 
toString() - Method in class org.opensextant.data.social.Tweet
 
toString() - Method in class org.opensextant.data.Taxon
 
toString() - Method in class org.opensextant.extraction.ExtractionMetrics
 
toString() - Method in class org.opensextant.extraction.TextEntity
 
toString() - Method in class org.opensextant.extraction.TextMatch
 
toString() - Method in class org.opensextant.extractors.flexpat.RegexPattern
 
toString() - Method in class org.opensextant.extractors.langid.LangID
 
toString() - Method in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
toUTCOffsetHours(int) - Static method in class org.opensextant.data.social.Message
UTC offset as hours.
true_positive - Variable in class org.opensextant.extractors.flexpat.PatternTestCase
 
turkishLang - Static variable in class org.opensextant.util.TextUtils
 
Tweet - Class in org.opensextant.data.social
 
Tweet() - Constructor for class org.opensextant.data.social.Tweet
 
Tweet(String) - Constructor for class org.opensextant.data.social.Tweet
 
Tweet(String, String, Date) - Constructor for class org.opensextant.data.social.Tweet
 
Tweet.Mention - Class in org.opensextant.data.social
 
TweetLoader - Class in org.opensextant.data.social
 
TweetLoader() - Constructor for class org.opensextant.data.social.TweetLoader
 
tweetStore - Variable in class org.opensextant.data.social.Tweet
Is TweetStore the main repository of data?
TweetUtility - Class in org.opensextant.data.social
 
TweetUtility() - Constructor for class org.opensextant.data.social.TweetUtility
 
type - Variable in class org.opensextant.extraction.TextMatch
Type, as in Annotation type or code.
TZ(String, double, double, double) - Constructor for class org.opensextant.data.Country.TZ
 
TZ(String, String, String, String) - Constructor for class org.opensextant.data.Country.TZ
Parse error will be thrown on invalid data.

U

uncompress(byte[]) - Static method in class org.opensextant.util.TextUtils
 
uncompress(byte[], String) - Static method in class org.opensextant.util.TextUtils
 
UNFILTERED - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
Unimap - Class in org.opensextant.util
 
Unimap() - Constructor for class org.opensextant.util.Unimap
 
UNK_Country - Static variable in class org.opensextant.util.GeonamesUtility
 
UNK_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
UNK_PATTERN - Static variable in class org.opensextant.extractors.xtemporal.XTConstants
 
unknownLanguages - Variable in class org.opensextant.util.GeonamesUtility
 
UNSET_UTC_OFFSET - Static variable in class org.opensextant.data.social.Message
 
update(Annotation) - Method in interface org.opensextant.annotations.DeepEyeStore
Update an existing annotation.
update(Record) - Method in interface org.opensextant.annotations.DeepEyeStore
The implementation of a record update
updateState(Record) - Method in interface org.opensextant.annotations.DeepEyeStore
Update only the state of a record
UPPER_CASE - Static variable in class org.opensextant.extractors.poli.PoliMatch
 
useFileHyperlink - Variable in class org.opensextant.output.GISDataFormatter
 
useFileHyperlink - Variable in class org.opensextant.output.GISDataModel
 
userLang - Variable in class org.opensextant.data.social.Message
Language declared by the user.
usesDST - Variable in class org.opensextant.data.Country.TZ
 
utcOffset - Variable in class org.opensextant.data.Country.TZ
 
utcOffset - Variable in class org.opensextant.data.social.Message
UTC offset in seconds (as given by Twitter and other sources).
utcOffsetHours - Variable in class org.opensextant.data.social.Message
utcOffset is in seconds, UTC offset hours is in hours.
UTM_NORTH - Static variable in class org.opensextant.extractors.xcoord.UTMParser
 
UTM_PATTERN - Static variable in class org.opensextant.extractors.xcoord.XConstants
 
UTM_precision(String) - Static method in class org.opensextant.extractors.xcoord.PrecisionScales
For now default UTM precision to +/- 100m
UTM_SOUTH - Static variable in class org.opensextant.extractors.xcoord.UTMParser
 
UTMParser - Class in org.opensextant.extractors.xcoord
 
UTMParser() - Constructor for class org.opensextant.extractors.xcoord.UTMParser
 

V

VAL_COORD - Static variable in interface org.opensextant.data.MatchSchema
Core metadata type
VAL_COUNTRY - Static variable in interface org.opensextant.data.MatchSchema
Core metadata type
VAL_PLACE - Static variable in interface org.opensextant.data.MatchSchema
Core metadata type
VAL_POSTAL - Static variable in interface org.opensextant.data.MatchSchema
Core metadata type
VAL_TAXON - Static variable in interface org.opensextant.data.MatchSchema
Core metadata type
validate_pattern(RegexPattern) - Method in class org.opensextant.extractors.flexpat.RegexPatternManager
Implementation has the option to check a pattern; For now invalid patterns are only logged.
validate_pattern(RegexPattern) - Method in class org.opensextant.extractors.poli.PoliPatternManager
Implementation has the option to check a pattern; For now invalid patterns are only logged.
validate_pattern(RegexPattern) - Method in class org.opensextant.extractors.xcoord.PatternManager
Implementation has the option to check a pattern; For now invalid patterns are only logged.
validate_pattern(RegexPattern) - Method in class org.opensextant.extractors.xtemporal.PatternManager
 
validateCoordinate(double, double) - Static method in class org.opensextant.util.GeodeticUtility
TODO: consider using geodesy, however that API has no obvious simple validator.
validateUTCOffset(int) - Static method in class org.opensextant.data.social.Message
 
validTZ(Tweet) - Static method in class org.opensextant.data.social.Message
detect if Tweet has UTC offset or TZ
value - Variable in class org.opensextant.annotations.DeepEyeData
 
value - Variable in class org.opensextant.extractors.poli.data.Money
 
valueOf(String) - Static method in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.opensextant.extractors.xcoord.DMSOrdinate.Resolution
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
Returns an array containing the constants of this enum type, in the order they are declared.
version - Variable in class org.opensextant.extractors.flexpat.RegexPattern
 
VID_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
vietnameseLang - Static variable in class org.opensextant.util.TextUtils
 

W

WEBARCHIVE_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
WEBPAGE_MIMETYPE - Static variable in class org.opensextant.util.FileUtility
 
WEEK - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
WEST - Static variable in class org.opensextant.extractors.xcoord.DMSOrdinate
 
WKTFormatter - Class in org.opensextant.output
A formatter for WKT output.
WKTFormatter() - Constructor for class org.opensextant.output.WKTFormatter
 
writeFile(String, String) - Static method in class org.opensextant.util.FileUtility
Write file, UTF-8 is default charset here.
writeFile(String, String, String, boolean) - Static method in class org.opensextant.util.FileUtility
 
writeGeocodingResult(ExtractionResult) - Method in class org.opensextant.output.AbstractFormatter
Write your geocoding result directly to output Result should carry ExtractionResult.recordFile as a URI for original.
writeGeocodingResult(ExtractionResult) - Method in class org.opensextant.output.CSVFormatter
 
writeGeocodingResult(ExtractionResult) - Method in class org.opensextant.output.GISDataFormatter
Implementation of adding info extraction/geocoding restults to GIS outputs.
writeGzipFile(String, String) - Static method in class org.opensextant.util.FileUtility
 
writeRow(Map<String, Object>) - Method in class org.opensextant.output.AbstractGenericFormatter
Write the data to the output stream.
writeRow(Map<String, Object>) - Method in class org.opensextant.output.CSVGenericFormatter
Write the data to the output stream.

X

XConstants - Class in org.opensextant.extractors.xcoord
 
XConstants() - Constructor for class org.opensextant.extractors.xcoord.XConstants
 
XCoord - Class in org.opensextant.extractors.xcoord
Use this XCoord class for both test and development of patterns, as well as to extract coordinates at runtime.
XCoord() - Constructor for class org.opensextant.extractors.xcoord.XCoord
Default constructor, debugging off.
XCoord(boolean) - Constructor for class org.opensextant.extractors.xcoord.XCoord
Debugging constructor -- if debugmode = True, enable debugging else if log4j debug mode is enabled, respect that.
XTConstants - Class in org.opensextant.extractors.xtemporal
 
XTConstants() - Constructor for class org.opensextant.extractors.xtemporal.XTConstants
 
XTemporal - Class in org.opensextant.extractors.xtemporal
Date/Time pattern extractor -- detects, parses, normalizes dates.
XTemporal() - Constructor for class org.opensextant.extractors.xtemporal.XTemporal
non-debugging ctor;.
XTemporal(boolean) - Constructor for class org.opensextant.extractors.xtemporal.XTemporal
XTemporal ctor
XtractorGroup - Class in org.opensextant.processing
A Group of Xponent Extractors.
XtractorGroup() - Constructor for class org.opensextant.processing.XtractorGroup
 

Y

YEAR - Enum constant in enum org.opensextant.extractors.xtemporal.DateMatch.TimeResolution
 
A B C D E F G H I J K L M N O P R S T U V W X Y 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form