| NEWS | R Documentation | 
NEWS for EdSurvey
Changes in version 4.0.7
UPDATES
-  readNAEPfunction now supports analyzing the following NAEP data products:- the main NAEP data up to 2022 
- NAEP long-term trend data up to 2023 
- the 2022 Monthly School Survey (MSS) Linking Study 
- the 2021 School and Teacher Questionnaire Special Study. 
 
BUG FIXES
-  [[<-assignment did not previously work with anedsurvey.data.frameobject. This causes issues when defining new columns with a few possible methods that resulted in newly defined columns not being returned bygetData. Thanks to Tao Jiang for identifying this bug.
Changes in version 4.0.6
UPDATES
- allow rounding by default when printing results after the user either set - options(EdSurvey_round_output=TRUE)or calls the- printfunction on an EdSurvey object with the argument- use_es_round=TRUE. See- ?es_roundfor details.
- added support for PISA 2022, this support was contributed and not tested by the EdSurvey team. 
- added support for ICILS 2023, this support was contributed and not tested by the EdSurvey team. Thanks to Jeppe Bundsgaard for the contirbution! 
- in every EdSurvey function - omittedLevelsis deprecated. It is replacted by- dropOmittedLevelswhich works in the same way.
- allow Matrix 1-5-4.1 after testing to allower older R versions on Mac OS. 
MINOR
- in example code, all arguments are now named 
- the vignette now sends users directly to the EdSurvey book [Analyzing NCES Data Using EdSurvey: A User's Guide](https://naep-research.airprojects.org/portals/0/edsurvey_a_users_guide/_book/index.html). 
- replace - 1:length(object)with- seq_along(object)in the entire package
- decreased size of file by removing unused files, such as unused files from old tests 
- the test file size is decreased on many larger files 
Changes in version 4.0.4
UPDATES
- upated to pass CRAN checks, - cbindand- rbindare nolonger explicitly exported. Apparently this changed in R 4.0.0 and we were behind the times. The load message is not shorter because we do not export these functions.
- similarly - subsetis nolonger exported but is instead implemented. Becuase of that there is no documentation for it now, but it works as the original- subsetworks and adds an- insideargument that prevents- substitutefrom being called on the- subsetargument as recomended in Advanced R (Wickham, 2014).
- 'downloadPIRLS' and 'readPIRLS' were updated to include PIRLS 2021 data 
- updated subscale mappings for TIMSS to allow for four-character subscale names to be used with - mml.sdffor all subscales across grade, subject, and year
- exported - readHSTS
BUG FIXES
-  readNAEPused to read in a non-existant weight namedweightin rare instances. This is now fixed.
- fixed bug where - mml.sdfproduced an error if there were not polytomous items associated with the subscale.
Changes in version 4.0.0
UPDATES
- the argument - omittedLevelshas been deprecated and replaced with- dropOmittedLevelsfor clarity.
-  mml.sdfnow allows users to selectcomposite=TRUEfor TIMSS after recoding thetestDatato indicate the test.
-  mml.sdfnow gives clearer warnings when there are no records with assessment data or no students with valid covariate data.
- when 2019 TIMSS data was used with - mml.sdfonly the digital items were included. EdSurvey reads in the eTIMSS data for eTIMSS countries and the paper data for countries that took only paper TIMSS. With this fix- mml.sdfcan be used in countries that used paper TIMSS in 2019.
- fixed a bug which made it impossible to do analysis on a - light.edsurvey.data.frameafter calling- rebindAttributeswhen using TALIS (and any other data with no plausible values) thanks to Anders Astrup Christensen for documenting the bug.
Changes in version 3.1.0
UPDATES
- fix an issue where - lmand- glmwere marked as generic functions to fix CRAN checks.
-  readNAEPcan now read in the NAEP metadata from an xml file or an fr2 file (the former is new). This change should be transparent to the user, but users wishing to specify the use of the XML file can use thexmlPathargument (see?readNAEP).
BUG FIXES
-  achivementLevelsnow returns number of rows on the data asnPSUwhen there are not PSU/stratum vars (e.g., for Germany in the PIACC data).
-  achivementLevelsnow raise an error if any aggregation variable is continuous.
- fixed multicore in - mml.sdf.
Changes in version 3.0.2
UPDATES
-  readNAEPnow will attempt to read IRT parameters from the AM folder's.dctfile if not found in NAEPirtparams package.
-  mml.sdfnow returns the scored data.
-  edsurveyTablenow returns when a line has n=1 respondents on it
-  cor.sdfnow works with logical variables.
-  readNAEPcan now utilize the NAEPEX .XML layout file as an alternative method to the .FR2 layout file by explicitly specifying thexmlPathargument.
BUG FIXES
-  gapnow returns results (and does not give an error) when the standard deviation of a continuous variable (that does not have plausible values) is requested. Thanks to Christian Christrup Kjeldsen for identifying and documenting this bug.
Changes in version 3.0.1
UPDATES
-  lm.sdfnow has averbosefunction, some warnings now just show as messages (whenverboseisTRUE).
- when an item is used in a call, it does not have omitted levels dropped by default. 
BUG FIXES
- should pass tests on all CRAN configurations 
-  mvrlm.sdfhad a bug when there was an item as a predictor that is now solved
- it was not possible to use TIMSS overall scores with - mml.sdfwithout further editing the- testData, this has been fixed
- some unquoted references in calls were not resolving correctly to columns on the data when they should have. This has been fixed. 
Changes in version 3.0.0
UPDATES
- add - mml.sdfto allow direct estimation using the- Direpackage.
- add - drawPVsto enable users to draw plausible values using- Direafter running- mml.sdf.
-  suggestWeightsfunction assists researchers in deciding which weight to use for analyses with ECLS-K:2011 data. See thesuggestWeightsdocumentation and vignette for more information.
-  summary2output now shows zero weights without a colon.
- it is now possible to subset an - edsurvey.data.frameto just one row with square braces.
BUG FIXES
-  readELSwould sometimes re-cache files when they did not need to be. This has now been corrected.
-  readPIAACis now set to validate variables with missing value labels undefined in the codebook. For missing label values it will now return the numeric value as the label instead of an NA value. Thanks to GitHub user webervienna for identifying and reporting this bug.
Changes in version 2.7.1
DATA UPDATES
- Added support for the National Household Education Survey (NHES) set of data. See - downloadNHESfor instructions to locate and obtain the NHES data files, and see- readNHESfor reading the NHES data for analysis.
- Added support for the School Survey on Crime and Safety (SSOCS) set of data. See - downloadSSOCSfor instructions to locate and obtain the SSOCS data files, and see- readSSOCSfor reading the SSOCS data for analysis.
UPDATES
-  append.edsurvey.data.frame.listnow useslabAandlabBin preference to existing labels. It also now accepts label names other thanlabel. When the resulting labels are not unique, it generates new ones. Also, if a singleedsurvey.data.frameis added to anedsurvey.data.frame.list, and the labels for theedsurvey.data.frame.listappear to be automatically generated, then the labels are automatically generated for the output of the call toappend.edsurvey.data.frame.listas well. These changes should make the output ofappend.edsurvey.data.frame.listseem more logical.
-  showCutPointsnow returns one line per set of cut points and does not show the cut point labels.
- updated warnings and errors when an - edsurvey.data.frameis passed to a vareity of functions that requires use of PSU or stratum variables but they are not on the data.
-  cor.sdfnow allows dynamic variables for the variables being correlated (xandy)
-  searchSDFnow generates a tabled output when passed anedsurvey.data.frame.list.
-  readTIMSSverbose output (verbose=TRUE) prints file merge details during data processing to the console. Additionally,readTIMSSwill printdataLeveldetails for both processed and cached data for theedsurvey.data.frame.
STATISTICS
-  SDused to use the population variance estimator and now uses the sample variance estimator. This also affects standard deviations calculated with thegapfunction.
BUG FIXES
-  SDused to setjkSumMultiplierto 1 by default and now uses the value from theedusrvey.data.frame. While this was the documented behavior, it was counterintuitive since all otherEdSurveyfunctions use the value directly from theedusrvey.data.frame.
- when calling - achievementLevelswith both- returnDiscrete = FALSEand- returnCumulative = TRUEit would return an error. It now properly returns.
-  levelsSDFupdated for handling value labels containing a=character in the label. The full label is now properly returned. This corrects erroneous warning messages related to recodes where=character is present. Thank you to Nicole Guarino for identifying this bug.
Changes in version 2.7.0
DATA UPDATES
- Added support for the PISA Young Adult Follow-up Study (YAFS). See - readPISA_YAFSfor more information.
-  readICCSnow woks for 2016 ICCS data.
STATISTICS
- supports linking error in NAEP assessments that have combined PBA and DBA formats, including NAEP 2018 Social Studies (grades 8); 2019 Science (grades 4, 8, and 12); 2019 Mathematics and Reading (grades 4, 8, and 12). See the vignette for details on the implementation at https://www.air.org/sites/default/files/2021-07/EdSurvey-linkingerror.pdf. 
-  mml.sdfa new function that estimates student scale scores using the marginal maximum likelihood regression estimation method (as the program AM does). This is an alternative method to the plausible values approach.
-  mixed.sdfnow gives clearer information about groups and weights, similar to the updatedWeMix::mixfunction.
-  achievementLevelsnow accepts multiple sets of plausible values at once, allowing joint achievement levels to be calculated. Because of this thediscreteandcumulativeresults column names now include the name of the variable. So for a composite score it will now be labeledcomposite_Levelwhere it used to sayLevel. Also thecutpointsobject now specifies the scores the cutpoints regard.
-  percentilenow has a new method “symmetric” that is similar to “simple” but returns the negative result if all of the values are multiplied by negative one. This method is faster than “unbiased” which has a small bias correction.
-  percentileused to report the number of people who were more extreme than any percentile in thensmallcolumn. This was not used and so was removed.
-  percentilenow accepts adofMethodargument to allow either of the methods described inDoFCorrection:WSfor the Welch-Satterthwaite formula orJRfor the Johnson-Rust formula. This defaults toJRbut was previouslyWS.
-  edsurveyTablenow warns and removes rows with non-positive full sample weights or no outcome. The also warnings now include the number of rows removed by the filter. Non-positive and NA weights are removed only if thereturnMeansorreturnSepctareTRUE. Missing outcome scores are removed only ifreturnMeansisTRUE.
- we simplified all the code in the statistical methods, including - rq.sdf,- lm.sdf, and- percentile.
BUG FIXES
- after subsetting an edsurvey.data.frame, the cache (user defined variables) used to throw errors. This now works. 
-  mixed.sdfused to not correctly identify covariances when outputting the variance tables. Thanks to Endraswara Wiryawan for identifying this bug.
-  readTIMSSpreviously attempted to merge on 2019 teachers using the student's grade level. This lead to many teacher variables not being merged on in AAD (Abu Dhabi, UAE), ARE (United Arab Emirates), and NZL (New Zealand). Thanks to Damian Campbell for identifying this bug.
- fixed - readTIMSSAdvbug for 1995 data relating to storing column dimension information in cache.
OTHER
- when - EdSurveyis attached, it nolonger sets the options value of- timeout. Instead it only sets it to the larger of the current value and an hour when downloading files.
Changes in version 2.6.9
DATA UPDATES
-  downloadTIMSSandreadTIMSSwere updated to include 2019.
-  downloadPIAACwas updated so data/codebook downloads perform more robustly on different OS.
-  readPIAACwas updated so the number of replicate weights in the resultingedsurvey.data.frameis accurate to the OECD source data. Before this number was hard coded, resulting in duplicate replicate weights.
-  readPISAadded 2018 financial literacy.
-  downloadICILSnow references instructions for downloading the 2018 ICILS data set, as thereadICILSfunction supported ICILS 2018 as of EdSurvey v2.4.0
- when calling - $on an edsurvey.data.frame with a weight variable, only the vector of full sample weights are returned now. This used to also return a matrix (when relevant) that also included the replicate weights.
-  downloadPISAandreadPISAnow support the PISA 2018 Financial Literacy (FIN) database.
STATISTICS
-  lm.sdf,glm.sdf, andedsurveyTableand now supports surveys where the PSU and stratum variables varried by weight variable (usually longitudinal surveys) for Taylor series variance estimation.
-  mixed.sdfnow has acoefandvcovmethods.
-  rq.sdfnow returns the rho statistic
-  waldTestnow accommodates JK1 denominator degrees of freedom explicitly and allows for cases where the F-test is not valid to return, but only returns the chi-squared test.
-  waldTestnow coversrq.sdfandmixed.sdfmodels.
NEW FEATURES
- the vignette was expanded to include more links. 
- The - ==(equal) and- !=(not equal) comparison operators now work on- edsurvey.data.frameand/or- edsurvey.data.frame.listobjects to compare objects by their attributes (e.g., survey, year, country).
TECHNICAL
-  readNAEPnow allows fr2 paths that are seperate from the usual location and is more robust to variation in capitalization, but requires that a file name be unique, regardless of capitalization.
-  getPSUvarandgetStratumVarare now more robust and return more informative warnings.
- Added - match.argparameter matching for the following functions/arguments for usability:- downloadPISA (database),- readCivEDICCS (dataSet, gradeLvl),- readICILS (dataSet),- readPISA (database, cognitive),- readTALIS (isced, dataLevel),- readTIMSS (gradeLvl),- readTIMSSAdv (subject).
BUG FIXES
-  readPIAACwas fixed so it can now find cached files forusa17andusa12_14. Additionally,match.argchecks were added to better check user inputted data to thecountriesandusaOptionarguments.
- The - readNAEPfunction has updates for the- frPathargument to correctly handle relative paths, path expansion (~), and additional file validation checks.
- The - gapfunction now correctly parses expressions that use a variable in the global environment.
- The - summary2function now gives a clearer error when something that is not the name of a weight column is passed to the- weightVarargument.
Changes in version 2.6.0
DATA UPDATES
- ECLS_B is now a supported dataset with the - readECLS_Bfunction.
-  downloadECLS_KandreadECLS_K2011now download and read in, respectively, the latest ECLS-K:2011 K-5 data.
-  readTALISwas updated to work with TALIS 2018.
-  downloadPISAadded support for 2018 and now downloads 2018 data. OECD compressed the 2018 PISA with an algorithm not implemented in R (DEFLATE64), and so users are instructed to uncompress it outside of R and EdSurvey before using the data.
-  readPIAACwas updated to work with files OECD released in early 2020. This includes adding ausaOptionargument that allows the user to select between the 2012-2014 file and the 2017 file for U.S. data.
-  readICILSwas updated to work with files IEA released early in 2020.
- EdSurvey now supports NAEP assessments with no plausible value variables, such as some National Indian Education Study (NIES) data. 
- EdSurvey now supports TIMSS 2019 pre-release files, including bridge files. 
- For - readHSLSand- readELSwe bug an issue parsing- .savvariable format definitions relating to integer definitions with a display of more than 10 digits. We believe this does not change how EdSurvey reads in any public use files. The issue can arise if the user converts restricted use files with Stat/Transfer.
- Some IEA datasets have convenience variables that EdSurvey incorrectly identified as plausible value variables that are discretized plausible values. EdSurvey already internally generates similar convenience variables based on the existing plausible values and does not use the IEA variables. As of 2.6.0, these columns are not linked as plausible variables and are ignored. Affected datasets include TIMSS, TIMSS Advanced, PIRLS, and ePIRLS. 
STATISTICS
-  cor.sdfnow reports standard errors linearized back to the correlation space in the result valuese. It also reports the confidence interval in the Fisher-Z space asZconfidenceIntervaland a confidence interval mapped back to the correlation space (without linearization) asconfidenceInterval.
-  cor.sdfnow allows users to select the Fisher-Z transformation or no transformation with thetransformationargument. The default of using the Fisher Z for Pearson and no transformation for other correlation types remains the same.
-  cor.sdfnow allows the user to set a value ofjrrIMaxto allow faster computation.
- Updated - oddsRatioto use t-statistic instead of Z, and to identify the confidence interval in the linear space and then convert.
NEW FEATURES
- The - +function now works on- edsurvey.data.frameand/or- edsurvey.data.frame.listobjects to build an- edsurvey.data.frame.list. See- ?edsurvey.data.frame.listfor examples.
- The - %in%function now works on- edsurvey.data.frame.listobjects, identifying if a survey is in an- edsurvey.data.frame.list. See- ?edsurvey.data.frame.listfor examples.
-  percentile,achievementLevels, andedsurveyTablenow warns when anedsurvey.data.frame.listis passed to it with anedsurvey.data.frameon it that cannot return valid results for the requested statistics. The output isNAfor these surveys.
-  gapnow supports linking error in PISA for math, reading, and science when a user setsincludeLinkingErrortoTRUE.
-  contourPlotdeprecated the argumentcolorswhich only set density colors. Instead, the function now usesdensityColorsandpointColors, which set the density and point colors separately. This will cause calls tocontourPlotthat usecolorsto not function with 2.6.0.
- When a - summary2call is made with a- light.edsurvey.data.frame, unfilled levels will not be returned in the table anymore.
- Users can now assign a variable with $, including subsets. Variables on the original data cannot be removed but can be overwritten. All changes are for the session only and do not change the data on the disk. 
TECHNICAL
-  getDatanow closes LaF connections regardless of how it exits (e.g., with an error). This should improve the stability of the package.
- An - edsurvey.data.framenow can specify a- fr2path. This is used only for NAEP surveys and no functionality depends on it, but it may be used in the future. The value of- fr2pathmay always be set to- NA.
-  achievementLevelsnow warns when anedsurvey.data.frame.listis passed to it with anedsurvey.data.frameon it that cannot return valid results for the requested statistics. The output isNAfor these surveys.
BUG FIXES
-  cor.sdfandpercentileincorrectly calculated standard errors when the pre-multiplier for the replicate weights was not one. Users can see that for a survey usinggetAttributes(data, "jkSumMultiplier")for their data.
-  cor.sdfpreviously failed to calculate some correlations when some records had non-zero full sample weights and zero replicate weights. Now results are still calculated.
- With TIMSS data, when a user updated a teacher-level variable, it used to prevent the user from using the - edsurvey.data.framefor non-teacher-level analysis. We fixed this issue. Thanks to Anders Astrup Christensen for identifying this issue and sharing reproducible code that is now incorporated into the package as a test.
Changes in version 2.5.0
NEW DATA
- Added support for the 2007-2008 Beginning Teacher Longitudinal Study (see - readBTLS) for Waves 1 to 5.
DATA UPDATES
-  readPISAuses less memory when reading in PISA 2015 data. This is because it now uses thehavenpackage'sread_savnewskipandn_maxarguments to read in portions of files. This should save time on read in.
- Other PISA files were changed to use base R read functions, this results in a different ordering of columns in 2012 and 2015. 
- Some PISA 2003 variables were read in as an integer and are now in as a real values, mostly hours variables. 
- After the OECD files were updated, - downloadPIAACand- readPIAACnow properly use the 2012 USA file for Cycle 1.
TECHNICAL
- To allow both 4.0 and 3.6 compatibility, an - edsurvey.data.frame.listnow stores- covsas characters in preference to factors.
-  EdSurveyis no longer importsreadr,rvest,stringi,stringr, orxml2. In addition,RColorBrewerwas imported but is now suggested.
-  ContourPlotnow usesheat.colorsby default.
- Several data files were stored in text format in functions and are now stored in - inst/extdataas- rdsfiles.
- The news for 2.4.0 was edited to note that the - gaplinking error in NAEP is only for mathematics and reading.
BUG FIXES
-  lm.sdfnow rounds the printed coefficients matrix in the standard way.
- Fixed a bug in - edsurveyTablethat caused it to return incorrectly labeled output if multiple covariates were tabbed with weights.
- A broken link in the vignette was fixed. 
Changes in version 2.4.0
NEW DATA
- Added support for the 2009 High School Longitudinal Study (see - downloadHSLSand- readHSLS). These do not support Restricted Use Data (RUD).
- Added support for the 2002 Education Longitudinal Study (see - downloadELSand- readELS).
- Added the ECLS Kindergarten Class of 1998-1999 Study. Thes datasets can be downloaded with - downloadECLS_Kand read in with- readECLS_K1998. This was added in 2.3.0 but first added to the NEWS for 2.4.0.
- Added support for ePIRLS (see - read_ePIRLSand- download_ePIRLS). This was added in 2.3.0 but first added to the NEWS for 2.4.0.
DATA UPDATES
- Added support for 2018 for the existing - readICILSfunction. Thanks to Jeppe Bundsgaard of Aarhus University, Danish School of Education, for contributing the code for this.
- OECD moved the PIAAC data to a new location, and the - downloadPIAACfunction now uses the new URL.
- The PISA 2015 data cache could have been formed incorrectly; that is fixed. When PISA 2015 datasets are first used with 2.4 it will take time to re-cache the data. The process now also uses far less memory. 
- The PISA data had incorrect PSU and stratum variables for most years. They are all fixed except for 2000, which we do not believe has a PSU variable on the file. Several strata have only one PSU, so the design still needs to be edited by the user to get correct Taylor series sampling variance estimates. 
UPDATES
- The - mixed.sdffunction now correctly aggregates results and has its methodology documented. It no longer supports binomial models and has several arguments deprecated as a result.
- The - gapfunction argument- varMethodis deprecated. The function uses only jackknife variance estimation.
- The - gapfunction now accounts for linking error between NAEP paper and digitally based assessments for mathematics and reading.
- The - subsetfunction used to fail when a global variable shared a name with a column on the data; it now works.
- The - percentilefunction has been updated to a formula that generates survey percentiles that are robust to transformation. For example, if the values are multiplied by a constant, then the percentiles move by that same constant.
- The - lm.sdffunction header now prints the number of plausible values used (when they are used) as well as the number of plausible values used in the sampling variance (- jrrIMax).
- The - levelsSDFfunction now makes more informative warnings when passed a- light.edsurvey.data.frame.
- The - printfunction for- cor.sdfnow prints a final new line.
- When - searchSDFwas called with more than one search string and the- levelsargument was set to- TRUE, a useless warning was issued. The warning was removed.
- More - EdSurveyfunctions wrap to the width of the console.
Changes in version 2.3.0
NEW FEATURES
- The - rq.sdffunction adds quantile regression to the package. See the- rq.sdfdocumentation for more information.
- The functions - getStratumVarand- getPSUVarwere added to give users stratum and PSU variables for surveys and Taylor series analyses.
- The - summary2function now accepts a vector of variables.
- The - searchSDFfunction now accepts a vector- stringinput so that results can be filtered.
- The formula for degrees of freedom when Taylor series variance estimation is used has been updated. The new formula is derived in the statistics vignette. See https://www.air.org/sites/default/files/EdSurvey-Statistics.pdf. 
NEW DATA
- PISA 2015 is now supported. 
Changes in version 2.2.0
NEW FEATURES
- The - waldTestfunction allows the user to test composite hypotheses–hypotheses with multiple coefficients involved–even when the data include plausible values. Because there is no likelihood test for plausible values nor residuals, the Wald test fills the role of the likelihood ratio test, ANOVA, and F-test.
- The - mvrlmfunction adds multivariate regression (a regression with multiple outcomes) to the package. See the- mvrlmdocumentation for more information.
- Survey weighted mixed models can now be fit with the - mixed.sdffunction. Both linear and logistic models can be fit. These models are limited to 2 levels (one level with random effects).
- Regressions can now output standardized regression coefficients using - summary(myLm, src=TRUE). When the- lm.sdfcall includes- standardizeWithSamplingVar=TRUEthe standardized error of the standardized regression coefficient account for the sampling error and measurement error (when applicable). Otherwise, the standard deviations are assumed to be measured without error.
- Added - summary2function to produce unweighted and weighted descriptive statistics of a variable in- edsurvey.data.frameor- light.edsurvey.data.frame
- Added $ variable access to edsurvey.data.frame. e.g. sdf$dsex 
- Updated covariance matrix estimation in the - vcovfunction for- lm.sdfand- glm.sdfto work when- varEstInputswas not returned
- Added covariance matrix estimation for - lm.sdfwhen Taylor series variance estimation was used.
- Added - rebindAttributesfunction to make dplyr interaction more smooth. See the- rebindAttributesdocumentation for an example.
- When printing an - edsurvey.data.frameit now says the survey, year, subject, and country at the top. The dimensions are moved down.
- In - gap, achievement levels can be specified with partial matches. This helps out when achievement levels have long names.
NEW DATA
- Added the ECLS Kindergarten Class of 2010-2011 Study. Longitudinal datasets can be downloaded with - downloadECLS_Kand read in with- readECLS_K2011.
- PIRLS 2016 is now supported. 
NEW STATISTICS
- Added - returnNumberOfPSUin- achievenemtLevels,- percentile,- lm.sdf, and- gapto report the number of primary sampling units (PSUs) used to calculate a statistic.
- Added - oddsRatiohelper function for logit results to show odds ratios.
- When running a Pearson correlation on a discrete variable, - cor.sdffunction by default condenses the occupied response codes to be consecutive integers. This can now be turned off by setting the- condenseLevelsargument to- FALSEso that the code book levels are used instead.
UPDATES
- The - glm.sdffunction now uses the- glm2package to fit models. This package converges on a broader class of models.
- The - EdSurveypackage nolonger sets the number of threads used by the required- data.tablepackage to one when EdSurvey is being attached. The issue is now fixed in- data.table.
- Across the download functions, when a file is missing the warning text was homogenized. Additionally, all download functions now support a - verboseargument that can be used to make downloads silent.
- The - lm.sdfand- glm.sdffunctions now accept formulas that use the- I()function or other unevaluated expressions to the left of the tilda. Previously only a single variable could be named.
-  recode.sdfnow checks that each recode has only atoandfromin it.
- The - edsurveyTablefunction now works without RHS variable, allowing the formula- y ~ 1to return overall means.
- The - percentilefunction used to produce output even if asked to produce a percentile outside of the valid range (0 to 100). Now it prints out message if users input at least percentile outside of the valid range, and stops if all percentiles are invalid.
- The - getDatafunction now removes rows with omitted levels after being recoded.
- All SPSS (.sav) file reads using the - havenpackage set the- user_na = TRUEflag to ensure no defined missing/omitted values are automatically converted to- NAvalues prematurely.
- For consistency with other download functions, - downloadPISAnow uses a- yearsargument instead of a- yearargument.
BUG FIXES
- Running an - edsurveyTableon an- edsurvey.data.frame.listused to have the potential to create invalid (unprintable) output if the factor levels did not agree on every element of the- edsurvey.data.frame.list. They now return printable output.
- An - edsurveyTablecould produce a standard error when there was data from only one stratum. It now produces an- NAstandard error.
Changes in version 2.0.0
NEW DATA
- Works with the Trends in International Mathematics and Science Study (TIMSS), TIMSS Advanced, the Progress in International Reading Literacy Study (PIRLS), and the International Computer and Information Literacy Study (ICILS), International Civic and Citizenship Education Study (ICCS), the Civic Education Study (CivEd), the Program for International Student Assessment (PISA), the Program for the International Assessment of Adult Competencies (PIAAC), and the Teaching and Learning International Survey (TALIS). 
- International datasets can be downloaded with - downloadTIMSS,- downloadTIMSSAdv,- downloadPIRLS,- downloadICILS,- downloadICCS,- downloadCivEDICCS,- downloadPISA,- downloadPIAAC,- downloadTALIS.
- International datasets can be loaded with - readTIMSS,- readTIMSSAdv,- readPIRLS,- readICILS,- readICCS,- readCivEDICCS,- readPISA,- readPIAAC,- readTALIS.
NEW STATISTICS
- Added - logit.sdfand- probit.sdffunctions with support for survey item responses.
- Added - gapthat compares the average, percentile, achievement level, or percentage of survey responses between two groups that potentially share members.
- Added - percentilethat calculates the percentiles of a numeric variable.
- Added - showCodebookthat retrieves variable names, variable labels, and value labels for an- edsurvey.data.frame,- light.edsurvey.data.frame, or- edsurvey.data.frame.list.
UPDATES
- Redesigned - achievementLevels,- edsurveyTablefunction for significant faster computation with much smaller memory footprint. We also made error messages and outputs more informative.
- More informative error message and output for - cor.sdf,- levelsSDF,- getPlausibleValue,- print.edsurvey.data.frame,- searchSDF,- showPlausibleValues,- showWeights, and- getData.
-  lm.sdfandglm.sdfare now S3 method extended fromstats::lmso users can call the function usinglmandglm.
- Added the - contourPlotfor regressions diagnostic.
- Added the - recode.sdffunction for recoding levels within variables.
- Added the - rename.sdffunction for modifying variable names.
- Added the - append.edsurvey.data.frame.listfunction to return a list of sdfs from either an edsurvey.data.frame.list or a single edsurvey.data.frame.
DOCUMENTATION
- Manual documentation was refreshed. 
- Moved vignettes to AIR webiste at https://www.air.org/project/nces-data-r-project-edsurvey or see links in the vignette inlcuded in this package. 
- Added a new vignette on “Exploratory Data Analysis on NCES Data” provides examples of conducting exploratory data analysis on NAEP data. 
- Added new vignette on “Calculating Adjusted p-Values From EdSurvey Results” to the AIR website describing the basics of adjusting p-Values to account for multiple comparisons. 
- Added new vignette on “Using EdSurvey to Analyze TIMSS Data” to the AIR website describing; an introduction to the methods used in analysis of large-scale educational assessment programs such as Trends in International Mathematics and Science Study (TIMSS) using the EdSurvey package. The vignette covers topics such as preparing the R environment for processing, creating summary tables, running linear regression models, and correlating variables. 
- Added new vignette on “Using EdSurvey for Trend Analysis” to the AIR website describing the methods used in the EdSurvey package to conduct analyses of statistics that change over time in large-scale educational studies. 
- Added new vignette on “Producing LaTeX Tables From edsurveyTable Results With edsurveyTable2pdf” to the AIR website detailing the creation of pdf summary tables from summary results using the edsurveyTable2pdf function. 
- Added new methodology documentation on “Methods Used for Gap Analysis in EdSurvey” to the AIR website convering the methods comparing the gap analysis results of the EdSurvey package to the NAEP Data Explorer. 
- Added new methodology documentation on “Methods Used for Estimating Percentiles in EdSurvey” to the AIR website describing the methods used to estimate percentiles. 
- Added new methodology documentation on “Weighted and Unweighted Correlation Methods for Large-Scale Educational Assessment: wCorr Formulas” to the AIR website detailing the methodology used by the wCorr R package for computing the Pearson, Spearman, polyserial, and polychoric correlations, with and without weights applied. See https://www.air.org/resource/report/weighted-and-unweighted-correlation-methods-large-scale-educational-assessment. 
BUG FIXES
- Fixed connection issue associated with closing connections to an - LaF.
Changes in version 1.0.6
- readNAEP function now works on a case sensitive file system. 
- Vignettes now should appear in numerical order on CRAN. 
- Examples now name most arguments. 
- Vignettes now name most argument. 
- some print functions had number of plausible values added.