as.list.dictionary2()to enable more flexible conversion of dictionary objects. (#1661)
sizenow works with the
byargument, to control the size of units sampled from each group.
textstat_simil(), see below.
textstat_simil()now return sparse symmetric matrix objects using classes from the Matrix package. This replaces the former structure based on the
distclass. Computation of these classes is now also based on the fast implementation in the proxyC package. When computing similarities, the new
min_similargument allows a user to ignore certain values below a specified similarity threshold. A new coercion method
as.data.frame.textstat_simildist()now exists for converting these returns into a data.frame of pairwise comparisons. Existing methods such as
as.list()work as they did before.
textstat_simil()because these were either not symmetric or not invariant to document or feature ordering. Finally, the
selectionargument has been deprecated in favour of a new
textstat_readability()now defaults to
measure = "Flesch"if no measure is supplied. This makes it consistent with
textstat_lexdiv()that also takes a default measure (“TTR”) if none is supplied. (#1715)
tokens_select()are now NULL, meaning they are not applied if the user does not supply values. Fixes #1713.
kwic.tokens()behaviour now aligned, meaning that dictionaries are correctly faceted by key instead of by value. (#1684)
tokens()verbose output. (#1683)