FeatureHashing: Creates a Model Matrix via Feature Hashing with a Formula Interface

Feature hashing, also called as the hashing trick, is a method to transform features of a instance to a vector. Thus, it is a method to transform a real dataset to a matrix. Without looking up the indices in an associative array, it applies a hash function to the features and uses their hash values as indices directly. The method of feature hashing in this package was proposed in Weinberger et al. (2009) <doi:10.48550/arXiv.0902.2206>. The hashing algorithm is the murmurhash3 from the 'digest' package. Please see the README in <https://github.com/wush978/FeatureHashing> for more information.

Version: 0.9.2
Depends: R (≥ 4.0), methods
Imports: Rcpp (≥ 0.11), Matrix, digest (≥ 0.6.8), magrittr (≥ 1.5)
LinkingTo: Rcpp, digest (≥ 0.6.8), BH (≥ 1.54.0-1)
Suggests: RUnit, glmnet, knitr, xgboost, rmarkdown, pROC
Published: 2024-01-10
DOI: 10.32614/CRAN.package.FeatureHashing
Author: Wush Wu [aut, cre], Michael Benesty [aut, ctb]
Maintainer: Wush Wu <wush978 at gmail.com>
BugReports: https://github.com/wush978/FeatureHashing/issues
License: GPL (≥ 3) | file LICENSE
URL: https://github.com/wush978/FeatureHashing
NeedsCompilation: yes
Materials: README ChangeLog
CRAN checks: FeatureHashing results

Documentation:

Reference manual: FeatureHashing.pdf
Vignettes: FeatureHashing
Sentiment Analysis via FeatureHashing

Downloads:

Package source: FeatureHashing_0.9.2.tar.gz
Windows binaries: r-devel: FeatureHashing_0.9.2.zip, r-release: FeatureHashing_0.9.2.zip, r-oldrel: FeatureHashing_0.9.2.zip
macOS binaries: r-release (arm64): FeatureHashing_0.9.2.tgz, r-oldrel (arm64): FeatureHashing_0.9.2.tgz, r-release (x86_64): FeatureHashing_0.9.2.tgz, r-oldrel (x86_64): FeatureHashing_0.9.2.tgz
Old sources: FeatureHashing archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=FeatureHashing to link to this page.