manydata 1.1.3
Package
- Updated website address
- Updated authorship
manydata 1.1.0
Package
- Updated GitHub actions to use code coverage secrets
Wrangling
- Added
filter_datacube() for filtering datasets in a
datacube by date
- Added
find_ID() and find_common_ID() for
identifying ID columns in datasets
Evaluation
- Added
find_year() for extracting just the year from a
date (potentially unnecessary if messydates::year()
available)
- Added
compare_new() and compare_diff() for
comparing what is new or different in one dataset over another
- Added a range of
score_*() functions for scoring
datasets on various criteria, including consistency, completeness,
accuracy, timeliness, and uniqueness of the data
Maintaining
- Added
find_duplicates() for identifying duplicate
observations in datasets
- Added
code_extend_glove() and
code_extend_bert() for extending existing coding to new or
missing data
manydata 1.0.3
Connection
- Added new
getID() helper that obtains the one or two ID
columns that appear as the first one or two columns in a datacube
compare_overlap() now returns a list of each datasets
IDs to avoid issues with {ggVennDiagram}
plot.compare_overlap() now always returns an upset plot
(closes #292)
- Fixed testing of ggplot objects (closes #308)
- Fixed how
plot.compare_categories() treated identifier
variables (closes #291)
manydata 1.0.2
Connection
- Fixed global variables in several
resolve_*()
functions
manydata 1.0.1
Package
Connection
resolve_*() functions now have a parameter indicating
whether missing values should be included; unlike base R, by default
missing values are excluded
- Restored
resolve_mean()
- Restored
resolve_median()
- Added
resolve_mode() for retaining the most common
values
- Added
resolve_consensus() for retaining only values
where there are no conflicts
manydata 1.0.0
Package
- Updated GitHub checks and release actions
- Fixes to URLs
- Updated website
- Improved ease of operation by making
{cli},
{dplyr}, and {messydates} Depends
- Dropped
{usethis} Suggest
Collection
- Updated
emperors dataset
- Using zero-padded messydates
- Added citation prompts
- Datasets capitalised:
emperors$Wikipedia
emperors$UNRV
emperors$Britannica
- Fixed non-unique IDs bugs
- Fixed inc
Calling
- Added
call_citations() to print citations added as
hidden information
- Fixed finicky
call_sources() bug related to calling
help files
- Improved
call_sources() and
call_citations() to accept datacubes or datasets, as
objects or characters
- Moved
mreport() from messydates
- Added
mreport.list() to make it easier to report on
datacubes
- Added
describe_data() for describing key aspects of
datasets in datacubes
- Fixed
call_releases() to use
messydates::vmin()
Connection
- Improved
pluck()
- Function now wraps
dplyr::pluck() but adds a citation
prompt
- Improved
consolidate()
- Improved useability with
{cli} progress messages and
success alerts
- Improved speed using
{dtplyr} in place of
dplyr::full_join() (closes #288)
{duckplyr} considered: faster, but couldn’t handle
mdate class
{collapse} considered: even faster, but inconsistent
output
- Improved compatibility by converting ‘rows’ argument to ‘join’
(breaking)
- “all” becomes “inner”
- “any” becomes “full”
- “favour” becomes “left”
- Fixed being passed a single dataset
- Prompts users to cite datasets (closes #280)
- Fixed bug in ‘resolve’ argument, named ‘resolve’ vector no longer
has to be same length as variables
- Dropped ‘cols’ argument
- Updated tests for
consolidate() to use new ‘join’
argument
- testthat tests use
{cli} on quiet mode
- Updated
resolve_coalesce() for coalescing (taking first
non-NA value)
- Updated
resolve_random() for returning random values
sampling from those available
- Updated
resolve_min() and resolve_max()
for returning min or max values
- Added
resolve_unite() for returning all possible values
as a set
- Added
resolve_precision() for returning most precise
values available (closes #265)
- Added
precision.numeric() to return most significant
figures
- Added
precision.character() to return most
characters
- Dropped
resolve_median() and
resolve_mean() as uncommon choices
- Dropped
resolve_multiple() in favour of always using
more flexible for loop
- Dropped
favour() in favour of left joins and
coalesces
- Dropped
coalesce_rows() as no longer necessary
manydata 0.9.3
Connection
- Updated
call_sources() to be more flexible when
gathering data from datacube documentation
- Closed #279 by updating documentation across many packages to be
compatible with
call_sources()
- Updated
compare_dimensions() by fixing bugs related to
dates and NA observations
manydata 0.9.2
Package
- Fixed the
emperors data documentation issues related to
lost braces with CRAN submission
manydata 0.9.1
Package
- Updated test expectations to make package compatible with the new
release of
{ggplot2}
Connection
- Closed #266 by adding startup messages to ‘many’ packages
- Closed #267 by adding links to package websites in console
messages
- Closed #282 by updating all references from ‘database’ to
‘datacube’
- Closed #293 by fixing bugs related to missing dates when using
consolidate()
- Closed #294 by updating how
call_sources() identify
datasets within datacubes
manydata 0.9.0
Package
- Closed #259 by revising CCC package structure and updating the
package cheatsheet
- Updated documentation for ‘emperors’ data to new style to improve
visibility and transparency
- Closed #264 by removing
{tibble}and
{janitor} package imports in DESCRIPTION file
- Closed #276 by reviewing package vignettes
- Closed #277 by updating ‘manydata-defunct’ file
- Closed #284 by removing vignette and updating README to include more
information on how to use the package
- Updated all references and argument from ‘database’ to
‘datacube’
Connection
- Renamed and updated ‘call_’ family of functions
- Closed #250, #251, and #262 by renaming
get_packages()
to call_packages() and updating how the function works and
look up packages, version updates, and availailabity
- Closed #269 and # by adding a
call_sources() function
that displays sources and variable changes for datasets in
datacubes
- Closed #271 by updating the
retrieve_ family of
functions to call_ functions
- Closed #283 by renaming
plot_releases() to
call_releases
- Renamed and updated ‘compare_’ family of functions
- Closed #243 and #257 by creating a
compare_missing()
function to compare missing values in datasets in a ‘many’ datacube
- Closed #249 and #253 by renaming
db_plot() function to
compare_categories() and updating variable categories
- Closed #261 by renaming and updating other
db_
functions to compare_ functions
- Closed #268 by adding
compare_overlap() to help users
investigate overlap for datasets within datacubes
- Closed #285 by adding
compare_dimensions() and
compare_ranges() to compare dimensions and ranges in
datacubes
manydata 0.8.3
Connection
- Made ´network_map()´ function defunct
manydata 0.8.2
Connection
- Updated
consolidate() to require two keys when joining
memberships’ databases
- Updated
db_comp() to follow consolidation defaults for
memberships’ databases
- Closed #231 by adding a
retrieve_texts() function to
retrieve treaty texts from other ‘many’ packages
manydata 0.8.1
Package
- Added ‘RDataTmp’ files to Rbuildignore and .gitignore
- Updated
data_evolution() to use inherits()
instead of class() for condition comparison
manydata 0.8.0
Package
- Closed #212 by implementing package caching in GitHub actions
workflows
- Closed #218 by fixing bug with GitHub actions workflows
- Closed #225 by changing the structure of datasets in “many” data
packages
- Closed #240 by updating the package cheatsheet
Connection
- Closed #134 by adding a
data_evolution() function to
the report family of functions that gets original datasets, if
available, or opens the preparation scripts, if not available
- Added ‘db_profile’ family of functions to visualise databases
- Closed #214 by adding
db_plot() function to plot a
profile of the database to facilitate comparison of matched observations
across datasets
- Closed #224 by adding
db_comp() function that creates a
tibble of the database to facilitate comparison of matched observations
across datasets
- Updated
get_packages() function
- Closed #215 by making
get_packages() interactive so
that users can chose which branch to download
- Closed #219 by improving
get_packages() printing
- Updated
get_packages() and plot_releases()
to use {messydates}, instead of {lubridate},
for dates coercion
- Closed #222 by adding
network_map() function for
plotting geographical networks
- Updated
consolidate()function to make function over 20
times faster
- Closed #227 by making
consolidate() ignore text related
variables due to their size
- Closed #230 by making
consolidate() more concise to
avoid running into memory limits
- Closed #228 and #232 by replacing
coalesce_compatible()
for a faster approach to coalescing compatible missing observations that
relies on zoo::na.locf()
- Made
coalesce_compatible() function defunct
manydata 0.7.5
Package
- Removed
{skimr} table from emperors
database documentation
- Updated path for binaries in push release GitHub actions
manydata 0.7.4
Package
- Closed #187 by updating GitHub actions to implement package
caching
- Closed #209 by removing all non-ASCII characters in package
- Closed #210 by removing
{pkgdown} dependency
- Updated
emperors data to contain correct date class
name consistent with {messydates}
manydata 0.7.3
Connection
- Updated how the
get_packages() function identifies
installed packages to avoid using installed.packages()
- Updated documentation for
coalesce_compatible()
function to include the returns
manydata 0.7.2
- Ignored CRAN-SUBMISSION and resubmitted.
manydata 0.7.1
Package
- Updated DESCRIPTION by removing ambiguous word from title
- Updated README by correcting the URL for life cycle badge
Connection
- Updated helper functions for
consolidate() to use
inherits() to identify variable’s class
manydata 0.7.0
Package
- Closed #194 by updating all remaining references from “qID” to
“manyID”
- Updated package website
- Closed #196 by updating elements that configure website to work
properly
- Updated ’_pkgdown.yml’ file to use bootstrap 5 template to build
website
Connection
- Updated
consolidate() function
- Closed #191 by making
consolidate() function more
concise and faster by removing redundant code lines
- Fixed dates-related warnings by changing how
{messydates} package is used to resolve dates
- Updated how
consolidate() substitutes missing
observations with first non-missing observation from other datasets
- Closed #201 by fixing how
consolidate() detects
variables to be resolved to avoid ambiguous variable matching
- Closed #202 by allowing for multiple key vectors to be declared as
arguments for
consolidate()
- Closed #199 by adding
favour() (also
favor()) function that re-orders datasets within a
database
manydata 0.6.0
Package
- Closed #189 by renaming package from
{qData} to
{manydata}
- Updated user vignette to include more examples on working with
consolidate()
- Updated package website
- Closed #167 by adding a cheatsheet to README
Connection
- Updated
consolidate() function
- Closed #169 by making default key variable “many_ID” instead of
“qID”
- Closed #183 by adding further methods to resolve conflicts between
observations:
- Added “max” resolve argument which resolves conflicts in favor of
the largest non NA value
- Added “min” resolve argument which resolves conflicts in favor of
the smallest non NA value
- Added “mean” resolve argument which resolves conflicts in favor of
the average non NA value
- Added “median” resolve argument which resolves conflicts in favor of
the median non NA value
- Added “random” resolve argument which resolves conflicts in favor of
a random non NA value
- Closed #185 by making so that users can specify resolve argument
differently for different variables
- Closed #188 by adding more informative warnings for GitHub download
limits for
get_packages() function
- Added extraction functions to generate edgelists from agreements
membership datasets
- Added
extract_bilaterals() for extracting adjacency
edgelist for bilateral agreements
- Added
extract_multilaterals() for extracting adjacency
edgelist for multilateral agreements
qData 0.5.0
Package
- Updated
get_packages() to load downloaded packages
- Closed #160 by updating how report family of functions to print
objects only if not assigned
- Updated user vignette for working with
consolidate()
- Fixed #165 by moving
interleave() to qCreate
- Closed #149 by adding a citation file
- Closed #181 by adding data for running examples and tests
Connection
- Closed #28 by adding
consolidate() for collapsing a set
or database of (q)dataset into a single dataset
- Closed #177 by adding
coalesce_compact() to coalesce
all compatible rows of a data frame
- Closed #180 by adding progress bar for
coalesce_compact()
- Added tests for
consolidate() and
coalesce_compact()
- Renamed function for getting first non-missing value
coalesce_rows()
- Added tests for
coalesce_rows()
qData 0.4.0
Package
- Separated
{qData} into two packages,
{qData} for users and {qCreate} for developers
- Moved
setup_package() and related functions to
{qCreate}
- Moved
import_data() and related functions to
{qCreate}
- Moved
export_data() and related functions to
{qCreate}
- Moved
plot_releases() to {migraph}
Connection
- Closed #137 by making
data_contrast() and
data_source() return exportable objects
qData 0.3.5
Connection
- Updated
get_packages() function
- Fixed bug with dates in
get_packages() by changing
dependencies to {lubridate}
- Updated formatting of the returned tibble to display more clearly
the information on qPackages
- Fixed #144 by substituting
NA where a listed qPackage
is not installed locally
qData 0.3.4
Package
- Fixed #128 by suppressing issues with warnings on pkgdown
website
- Updated contributor and user vignettes for new package changes
Collection
- Updated
setup_package() to make it more intuitive and
precise for users
- Closed #127 by updating the form in which author arguments are taken
in
setup_package()
- Added role argument to
setup_package() to make more
precise
- Closed #139 by reducing file overhead generated by
setup_package()
- Simplified
setup_package() to be less verbose
- Renamed
new_author() function to
add_author() for naming consistency
- Closed #140 by fixing bug with
add_author()
- Fixed #121 by allowing
add_author() to treat multiple
comments
- Updated documentation for
setup_package() and
add_author()
Correction
- Updated
standardise_dates() to simplify function and
testing
- Closed #110 by simplifying helper functions and removing redundancy
in
standardise_dates()
- Closed #108 by simplifying tests and removing redundancy for
standardise_dates()
Connection
- Divided, updated and renamed
report_data()
- Closed #131 by dividing
report_data() into
data_source() and data_contrast()
functions
- Closed #104 by allowing users to select multiple datasets with
data_contrast()
- Closed #105 by allowing for multiple databases to display in a
package with
data_contrast()
- Fixed #111 by removing tests for outdated
report_data()
function
qData 0.3.3
Package
- Closed #122 by setting package logo to the correct aspect ratio
Collection
- Updated
setup_package() to make it easier to set up
collaborative qPackages
- Closed #106 by making
setup_package() accept more than
two authors as arguments
- Closed #119 by updating
setup_package() function so
that it adds LICENSE file to new qPackages
- Closed #90 by adding a
new_author() function that adds
new package contributors to description file
- Closed #115 by updating README templates for new qPackages to
include more sections
Correction
- Closed #116 by adding more tests for new qPackage templates for
standardizing NA formats
Connection
- Updated
get_packages() to make it easier to install
globalgov qPackages
- Fixed #117 by making the
get_packages() work with
package name only for qPackages part of the globalgov organization
- Closed #79 by updating
get_packages() to accept listed
number for package as arguments
- Closed #103 by adding GitHub lookup for
depends()
function
- Updated user and developer vignettes to be compatible with these
changes
qData 0.3.2
Package
- Replaced logo with new ‘black box’ hex
- Added open graph image to GitHub account
- Closed #83 by improving CII best practices score
- Added
depends() for loading and, if necessary,
installing CRAN packages
- Added
plot_releases() function that visualises
historical milestones/releases for GitHub repositories
- Updated
setup_package() function by removing license
lines in the codes
- Fixed #89 by adding different description templates depending on the
number of authors declared in
setup_package()
- Added ‘dtc’ contributor type in the DESCRIPTION file
Collection
- Updated
import_data() to accept .RData files
Correction
- Updated and fixed bugs with
standardise_dates()
function and tests
- Closed #73
standardise_dates() function by using
lubridate::as_date() in place of
anytime::anydate() to correctly treat historical BC and AD
dates
- Fixed #95 by simplifying
standardise_dates() to make
sure it works with all types of date variables
- Closed #77 and #93 by devolving
standardise_dates()
into various helper functions
- Updated
export_data() to better document new database
and dataset structure
- Documentation generated by
export_data() references the
newly created report_data() function to generate metadata
dataframes at the desired level (Package, Database, Dataset)
- Closed #76 by adding some smart documentation updates to
export_data()
- Closed #92 by tweaking
export_data() to document
datasets at the database level
- Closed #94 by adding a bibliographic attribute to each dataset with
export_data()
- Fixed #63 and #87 by setting up tests templates for the different
types of database created
- Added
retain() to retain chosen objects in the
environment, removing all other variables
Connection
- Closed #78 by creating
report_data() function that
displays information on the data within a qPackage
qData 0.3.1
Package
- Closed #81 by changing description of the Connecting data
section to include
export_data() specificities
- Closed #85 by deleting the argument in the function
use_ccby_license(name = packageAuthor) in the description
of the setup_package() function
- Corrected the typing error found in “develop” vignette
- Corrected styling issues in texts (delete double space in
files)
- Updated the description of the
export_data() function
to its new version including database name as a string
qData 0.3.0
Package
- Changed package name from qDatr to qData
- Created a hex logo for qData
- Closed #32 by setting up a more informative README document
- Added codefactor badge
- Added CII badge, in progress
- Closed #69 by creating a vignette for users working with qData
- Closed #67 by creating a vignette for developers interested in
contributing to the qData ecosystem
- Added pkgdown workflow for updating the website upon release
Collection
- Updated
import_data() by suggesting upper case letters
for dataset names and lower case letters for database names
- Now complains if pointed to unrecognised data format
Correction
- Closed #70 by updating
stadardise_dates() so that
extreme future dates are standardised
- Updated
standardise_dates() to accept multiple variable
dates as inputs and it standardises separators
- Updated
standardise_dates() to be more efficient when
dealing with different date formats
- Deprecates
recent() in favour of
standardise_dates()
- Updates
export_data() by making it save datasets as
named list elements in a database
- Updates exported test templates for data to test for datasets within
databases
Connection
- Updates
get_packages() function to:
- Correctly identify latest release and installed versions
- Identify registered contributors
qDatr 0.2.1
Package
- Fixed #61 by adding tests for
setup_package() and
import_data()
- Closed #50 by making sure workflow templates are created in the
proper folder with
setup_package()
Correction
- Closed #62 by creating a function which vectorises dates and cleans
and standardizes diverse date formats,
standardise_dates()
- Closed #55 by deprecating
rearrange() function in
favour of dplyr::relocate()
- Closed #65 by correcting issues with the path call in
export_data() function
- Ensures consistency for path calls across qData functions
Connection
- Closed #64 by creating the
get_packages() function
which displays other packages, and some information about these
packages, in the qData ecosystem. The function serves as a wrapper for
downloading qPackages from GitHub.
qDatr 0.2.0
Package
- Updated pushrelease.yml to upload all assets individually to
release
- ((Tests for following functions on hold…))
Collection
- Renamed
create_qPackage() to
setup_package()
- Now establishes main testing framework
- Opens fewer files but is more verbose
- Closed #51 by trimming dependencies from
qPackage-DESC
template
- Fixed bug in
qPackage-DESC that names weren’t
strings
- Renamed
use_qData_raw() to import_data()
- Closed #49 with path argument or selector
- Added delete_original argument (default = FALSE)
- Closed #58 by extracting file type information from path and adding
it and path to preparation template
Connection
- Renamed
use_qData() to export_data()
- Avoided reliance on
usethis::use_data()
- Fixed bug where it wasn’t selecting object correctly
- Closed #57 by passing on object information into the documentation
template (now
qData-doc.R)
- Adapted test template so that object name is passed forward
qDatr 0.1.0
Package
- Updated CONTRIBUTING
- Added README.Rmd render to pushrelease workflow
Collection
- Closed #47 by adding
qtemplate() function for finding
and rendering templates
- Closed #44 by adding
create_qPackage() that establishes
a {qData} consistent package framework
- Fixed #42 by drawing information from DESCRIPTION where
possible
- Added DESCRIPTION template
- Added README template
- Added COC template
- Added CONTRIBUTING template
- Added PR and issue templates for bugs and features
- Added workflow templates for push-release, pr-checks, and
pr-commands
- Closed #48 by adding
use_qData_raw() for setting up
data cleaning and wrangling
- Added template for importing, cleaning and exporting raw data
Correction
- Closed #24 by adding
repaint() for filling in missing
data by lookup
- Added
recent() for sensible centuries for dates
Connection
- Closed #45 by adding
use_qData() for setting up tests,
documentation, and lazy-loading of cleaned data
- Added template for documenting cleaned data
- Added template for testing cleaned data
qDatr 0.0.4
Collection
- Closed #25 by adding
create_qpackage()
- Fixed ‘pushrelease.yml’ postfix bug
Correction
- Closed #26 by adding
interleave()
- Closed #23 by adding
resequence()
- Closed #30 by adding
recollect()
qDatr 0.0.3
Package
- Closed #18 by setting up
{lintr},
{goodpractice}, and {spelling} in
prchecks.yml
Correction
- Closed #29 by adding
rearrange()
- Closed #31 by adding
reunite()
qDatr 0.0.2
Package
- Added a
NEWS.md file to track changes to the
package.
Correction
- Added
transmutate() for merging variables
qDatr 0.0.1
Package
Correction
- Added
entitle() for standardising treaty titles,
etc.