quanteda is an R package for managing and analyzing textual data developed by Kenneth Benoit and other contributors. Its initial development was supported by the European Research Council grant ERC-2011-StG 283794-QUANTESS.
The normal way from CRAN, using your R GUI or
Or for the latest development version:
# devtools package required to install quanteda from Github devtools::install_github("quanteda/quanteda")
Because this compiles some C++ and Fortran source code, you will need to have installed the appropriate compilers.
If you are using a Windows platform, this means you will need also to install the Rtools software available from CRAN.
If you are using macOS, you should install the macOS tools, namely the Clang 6.x compiler and the GNU Fortran compiler (as quanteda requires gfortran to build).
quanteda is cross-platform but we recommend MacOS or Linux as an operating system for their better handling of Unicode. RAM depends on the size and the structure of the textual data to analyze. Usually, a text file of 100MB on disk takes 500MB to 1GB on memory as a tokens object (short texts require more memory than long texts when the total numbers of words are the same).
|CPU||1 core||4 cores or more|
|RAM||2GB||8GB more more|
See the quick start guide to learn how to use quanteda.
If you like quanteda, please consider leaving feedback or a testimonial here.
Contributions in the form of feedback, comments, code, and bug reports are most welcome. How to contribute: