Summary statistics on a character vector

Internal-only function to compute summary statistics on a character object.

summary_character(object, n = 100, tolower = FALSE, ...)

Arguments

object	corpus to be summarized
n	maximum number of texts to describe, default=100
tolower	convert texts to lower case before counting types
...	additional arguments passed through to `tokens`

Examples

# summarize texts
quanteda:::summary_character(c("Testing this text. Second sentence.", "And this one."))
#>    Text Types Tokens Sentences
#> 1 text1     6      7         2
#> 2 text2     4      4         1
quanteda:::summary_character(data_char_ukimmig2010)
#>           Text Types Tokens Sentences
#> 1          BNP  1125   3280        88
#> 2    Coalition   142    260         4
#> 3 Conservative   251    499        15
#> 4       Greens   322    679        21
#> 5       Labour   298    683        29
#> 6       LibDem   251    483        14
#> 7           PC    77    114         5
#> 8          SNP    88    134         4
#> 9         UKIP   346    723        27
summary_ukimmig2010 <- quanteda:::summary_character(data_char_ukimmig2010)
head(summary_ukimmig2010)
#>           Text Types Tokens Sentences
#> 1          BNP  1125   3280        88
#> 2    Coalition   142    260         4
#> 3 Conservative   251    499        15
#> 4       Greens   322    679        21
#> 5       Labour   298    683        29
#> 6       LibDem   251    483        14

Arguments

Examples

Contents