Toggle navigation
quanteda
4.0.2
Quick Start
Quick Start Guide
Guía de Inicio Rápido
快速入门指南
クイック・スタートガイド
त्वरित आरंभ
Reference
Features
Parallel computing
External pointer tokens objects
Performance benchmarks
Package design
Changelog
Examples
Multi-word expressions
Textual data visualization
Latent Semantic Analysis (LSA)
Chinese text analysis
Social media analysis
Replications
Text Analysis with R for Students of Literature
Word embedding (word2vec)
Quantitative Social Science Ch. 5.1
Changelog
3.1
quanteda 3.1
Bug fixes and stability enhancements
Improved and more consistent handling of empty corpus, tokens and dfm objects, to address
#2110
.
rbind.dfm()
now preserves docvars (
#2109
).
Document name for Biden’s 2021 Inaugural Address in
data_corpus_inaugural
is now consistent with all other documents.
Fix
#2127
that caused subsetting to change document names.
Changes and additions
phrase()
now has a
separator
argument (
#2124
).
Deprecations
phrase()
methods for tokens, collocations, and lists are deprecated in favour of
as.phrase()
(
#2129
).
Contents