Internal-only function to compute summary statistics on a character object.

summary_character(object, n = 100, tolower = FALSE, ...)

Arguments

object

corpus to be summarized

n

maximum number of texts to describe, default=100

tolower

convert texts to lower case before counting types

...

additional arguments passed through to tokens

Examples

# summarize texts quanteda:::summary_character(c("Testing this text. Second sentence.", "And this one."))
#> Text Types Tokens Sentences #> 1 text1 6 7 2 #> 2 text2 4 4 1
quanteda:::summary_character(data_char_ukimmig2010)
#> Text Types Tokens Sentences #> 1 BNP 1125 3280 88 #> 2 Coalition 142 260 4 #> 3 Conservative 251 499 15 #> 4 Greens 322 679 21 #> 5 Labour 298 683 29 #> 6 LibDem 251 483 14 #> 7 PC 77 114 5 #> 8 SNP 88 134 4 #> 9 UKIP 346 723 27
summary_ukimmig2010 <- quanteda:::summary_character(data_char_ukimmig2010) head(summary_ukimmig2010)
#> Text Types Tokens Sentences #> 1 BNP 1125 3280 88 #> 2 Coalition 142 260 4 #> 3 Conservative 251 499 15 #> 4 Greens 322 679 21 #> 5 Labour 298 683 29 #> 6 LibDem 251 483 14