dfm()
returns a dfm with the identical column order even if tokens_compound()
or tokens_ngrams()
is used in the upstream (#2100).dfm_group()
with NA values in a grouping variable now drops those, similar to the behaviour of tokens_group()
and corpus_group()
(#2134).char_wordstem()
now has a a new argument check_whitespace
, which will not throw an error when lower-casing text containing a whitespace character.