Fork me on GitHub

OOP

This is the Stanford document-sentences-tokens json tree decorated with the results of the OOP analysis of the text.

In general, organic annotations will be aggregated up the tree but synthetic annotations will not be generated down the tree. As an example, sentiment organically occurs at the sentence level, thus you will see an sentiment annotation at the document level containing the average of all the sentence annotations, but you will not see a sentiment annotation at the token level. Categorical annotations that organically occur at the token level (usually expressed as Map<String,BigDecimal>) are generally summed at the sentence and document levels.

The list of annotations is customizable via the ParameterStore input, but the default list is as follows:

OOPBiberAnnotation
Biber: Variation across speech and writing
ParagraphIndexAnnotation
Recorded per sentence, index starts at 0
CoreNlpGenderAnnotation
edu/stanford/nlp/models/gender/male_first_names.txt, edu/stanford/nlp/models/gender/female_first_names.txt
OOPGenderAnnotation
io/outofprintmagazine/nlp/models/MaleNames.txt , io/outofprintmagazine/nlp/models/FemaleNames.txt.
OOPPronounAnnotation
PRP minus -self, -selves, -'s
OOPCharCountAnnotation
characterCount
OOPParagraphCountAnnotation
paragraphCount
OOPSentenceCountAnnotation
sentenceCount
OOPSyllableCountAnnotation
syllableCount
OOPTokenCountAnnotation
tokenCount
OOPWordCountAnnotation
wordCount
CoreNlpSentimentAnnotation>
sentiment scaled from 0 to 1
VaderSentimentAnnotation
sentiment scaled from 0 to 1
OOPVerbTenseAnnotation
Verb tenses
OOPPunctuationMarkAnnotation
Question, Exclamation, Colon, Semicolon, Comma, Hypen, Period, Apostrophe, Quotation
OOPAdjectivesAnnotation
JJ, JJR, JJS not in io/outofprintmagazine/nlp/models/Adjectives.txt
OOPPointlessAdjectivesAnnotation
JJ, JJR, JJS in io/outofprintmagazine/nlp/models/Adjectives.txt
OOPAdjectiveCategoriesAnnotation
Good-Better-Best. JJ, JJR, JJS: descriptive, comparative, superlative
OOPAdverbsAnnotation
RB, RBR, RBS not in io/outofprintmagazine/nlp/models/Adverbs.txt
OOPPointlessAdverbsAnnotation
RB, RBR, RBS in io/outofprintmagazine/nlp/models/Adverbs.txt
OOPAdverbCategoriesAnnotation
Fast-Faster-Fasest. RB, RBR, RBS: descriptive, comparative, superlative
OOPPossessivesAnnotation
PRP$, POS, WP$
OOPPrepositionCategoriesAnnotation
IN, CC. subordinating, coordinating
OOPPrepositionsAnnotation
IN or CC
OOPVerbsAnnotation
Transitive Verbs. VB,VBD,VBG,VBN,VBP,VBZ not in io/outofprintmagazine/nlp/models/StativeVerbs.txt
OOPActionlessVerbsAnnotation
Intransitive Verbs. VB,VBD,VBG,VBN,VBP,VBZ in io/outofprintmagazine/nlp/models/StativeVerbs.txt
OOPNounsAnnotation
NN, NNS
OOPTopicsAnnotation
NNP, NNPS
OOPSVOAnnotation
CoreNLP dependency parse subject-verb-object-indirectObject-auxVerb: nsubj, root, obj, iobj, obl
OOPLikeAnnotation
Constituency Parse: PP, like
OOPAsAnnotation
Constituency Parse: SBAR, as
OOPColorsAnnotation
io/outofprintmagazine/nlp/models/Colors.txt
OOPFlavorsAnnotation
io/outofprintmagazine/nlp/models/Flavors.txt
OOPVerblessSentencesAnnotation
VB,VBD,VBG,VBN,VBP,VBZ,MD not in sentence
OOPWordlessWordsAnnotation
Wiktionary lookup is null
OOPWordnetGlossAnnotation
Wordnet gloss if not in io/outofprintmagazine/nlp/models/COCA/en_20k.txt
OOPUncommonWordsAnnotation
not in io/outofprintmagazine/nlp/models/COCA/en_20k.txt
OOPCommonWordsAnnotation
io/outofprintmagazine/nlp/models/COCA/Dolch.txt
OOPFunctionWordsAnnotation
io/outofprintmagazine/nlp/models/FunctionWords.txt
OOPAngliciseAnnotation
io/outofprintmagazine/nlp/models/EN_GB/american_spellings.json
OOPAmericanizeAnnotation
io/outofprintmagazine/nlp/models/EN_GB/british_spellings.json
OOPVerbGroupsAnnotation
Wordnet lexical file name
OOPVerbnetGroupsAnnotation
Verbnet senses
OOPNounGroupsAnnotation
Wordnet lexical file name
OOPWhoAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, who
OOPWhatAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, what
OOPWhenAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, when
OOPWhereAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, when
OOPWhyAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, why
OOPHowAnnotation
Constituency Parse: SBARQ, WHNP || WHADVP, how
OOPLocationsAnnotation
CoreNLP NER mentions: LOCATION
OOPPeopleAnnotation
CoreNLP NER mentions: I, PERSON minus PRP, PRP$, stray NNP/NNPS; coref
OOPMyersBriggsAnnotation
Personality types
OOPDatesAnnotation
CoreNLP NER mentions: DATE or TIME
OOPIfAnnotation
Constituency Parse: SBAR, if
OOPBecauseAnnotation
Constituency Parse: PP, because
OOPQuotesAnnotation
CoreNLP: QuoteAnnotator plus NER speaker
OOPWordsAnnotation
word frequency - all tokens
OOPFleschKincaidAnnotation
(206.835 - (1.015*wordCount/sentenceCount) - (84.6*syllableCount/wordCount))/100
OOPVerbHypernymsAnnotation
Wordnet hypernym
OOPNounHypernymsAnnotation
Wordnet most likely hypernym based on hyponym match against all nouns
OOPWikipediaGlossAnnotation
Wikipedia gloss (first sentence of page body) if NN,NNS,NNP,NNPS
OOPWikipediaPageviewTopicsAnnotation
Wikipedia pageviews for the past month for Topics (NNP,NNPS)
OOPWikipediaCategoriesAnnotation
Wikipedia page categories for Topics (NNP,NNPS)
OOPBiberDimensionsAnnotation
Biber: Variation across speech and writing, Dimensions 1-6
OOPSettingsAnnotation
OOPLocationAnnotation with sentiment, quotes, coref, and universal dependencies nsubj, nmod:poss, advmod, amod nsubj, root, obj, iobj, obl.
OOPActorsAnnotation
OOPPersonAnnotation with sentiment, quotes, coref, and universal dependencies nsubj, nmod:poss, advmod, amod nsubj, root, obj, iobj, obl.