License
Code licensed under Apache License 2.0. See LICENSE file.
- DocuScope
- Corpus analysis
- API
- corpus_analysis
- corpus_analysis.convert_corpus(tm_corpus)
- corpus_analysis.frequency_table(tok, n_tokens, count_by=’pos’)
- corpus_analysis.tags_table(tok, n_tokens, count_by=’pos’)
- corpus_analysis.tags_dtm(tok, count_by=’pos’)
- corpus_analysis.ngrams_table(tok, ng_span, n_tokens, count_by=’pos’)
- corpus_analysis.coll_table(tok, node_word, l_span=4, r_span=4, statistic=’pmi’, count_by=’pos’, node_tag=None, tag_ignore=False)
- corpus_analysis.kwic_center_node(tm_corpus, node_word, ignore_case=True, glob=False)
- corpus_analysis.keyness_table(target_counts, ref_counts, correct=False, tags_only=False)
- corpus_analysis