test_anchor
test_anchor
test_anchor
test_anchor
test_anchor
test_anchor
Added test_anchor
Added more unit tests Fixed a bug in doc_centrality
using the centroid
method
Fixes for changes to the Matrix package Updating documentation and added examples
Fix encoding issue for non-ASCII characters to work with fastmatch
Add functionality - perm_tester
for Monte Carlo Permutation Tests for Model P-Values - rancor_builder
creates random corpus based on provided term probabilities - rancors_builder
creates multiple random corpora]
Include additional tests, updated documentation and vignettes
Working on an encoding error in fastmatch
which shows inconsistent behavior with non-ASCII characters. This dev version provides a temporary fix.
doc_centrality
calculates four graph-based centrality metrics using DTMsdoc_similarty
calculates four document similarity measures using DTMsget_regions
, instead of mlpackseq_builder
creates a token-integer sequence representationdtm_builder
includes an option to return a dense base R matrixdtm_stopper
includes an option to remove based on a terms rank (e.g., top 10), stopping based on count and proportion are now two separate optionsfind_transformation()
to norm, center, and align matricesfind_projection()
finds the projection matrix onto a vectorfind_rejection()
finds the rejection matrix away from a vectordtm_melter()
quickly turns a DTM into a triplet dataframe (doc_id, term, count)get_centroid()
naming (limits to single word for names)dtm_stopper()
to stop words by document or term frequencies
stop_freq
was changed to stop_termfreq
dtm_resampler()
to resample proportion and fixed N lengthsNEWS.md
file to track changes to the package.