quanteda is an R package for managing and analyzing textual data developed by Kenneth Benoit and other contributors. Its initial development was supported by the European Research Council grant ERC-2011-StG 283794-QUANTESS.

How to Install

The normal way from CRAN, using your R GUI or

Or for the latest development version:

Because this compiles some C++ and Fortran source code, you will need to have installed the appropriate compilers.

If you are using a Windows platform, this means you will need also to install the Rtools software available from CRAN.

If you are using macOS, you should install the macOS tools, namely the Clang 6.x compiler and the GNU Fortran compiler (as quanteda requires gfortran to build).

System Requirements

quanteda is cross-platform but we recommend MacOS or Linux as an operating system for their better handling of Unicode. RAM depends on the size and the structure of the textual data to analyze. Usually, a text file of 100MB on disk takes 500MB to 1GB on memory as a tokens object (short texts require more memory than long texts when the total numbers of words are the same).

Minimum Recommended
OS Windows/MacOS/Linux MacOS/Linux
CPU 1 core 4 cores or more
RAM 2GB 8GB more more
IDE R Studio

How to Use

See the quick start guide to learn how to use quanteda.

Leaving Feedback

If you like quanteda, please consider leaving feedback or a testimonial here.


Contributions in the form of feedback, comments, code, and bug reports are most welcome. How to contribute: