ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	20

Descriptor

Computation	21
Error of Measurement	21
Item Response Theory	7
National Competency Tests	7
Statistical Analysis	7
Comparative Analysis	6
Sampling	6
Scores	6
Accuracy	5
Equated Scores	5
Regression (Statistics)	5
Statistical Bias	5
Maximum Likelihood Statistics	4
Sample Size	4
Grade 8	3
Reliability	3
Simulation	3
Ability	2
Bayesian Statistics	2
Grade 4	2
Hierarchical Linear Modeling	2
Methods	2
Models	2
Probability	2
Scoring	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	21
Reports - Research	21
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Grade 8	3
Higher Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Secondary Education	3
Grade 4	2
Intermediate Grades	2

Audience

Location

California	1
Nevada	1
New Jersey	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	7
Praxis Series	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Variance Estimation with Complex Data and Finite Population Correction--A Paradigm for Comparing Jackknife and Formula-Based Methods for Variance Estimation. Research Report. ETS RR-20-11

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2020

The finite population correction (FPC) factor is often used to adjust variance estimators for survey data sampled from a finite population without replacement. As a replicated resampling approach, the jackknife approach is usually implemented without the FPC factor incorporated in its variance estimates. A paradigm is proposed to compare the…

Descriptors: Computation, Sampling, Data, Statistical Analysis

Error Variance in Common Population Linking Bridge Studies. Research Report. ETS RR-19-42

Peer reviewed
PDF on ERIC

Download full text

Jewsbury, Paul A. – ETS Research Report Series, 2019

When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…

Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement

Grouping Effects on Jackknifed Variance Estimation for Item Response Theory Scaling and Equating with Cluster-Based Assessment Data. Research Report. ETS RR-18-16

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018

Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…

Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping

Measurement Error and Bias in Value-Added Models. Research Report. ETS RR-17-25

Peer reviewed
PDF on ERIC

Download full text

Kane, Michael T. – ETS Research Report Series, 2017

By aggregating residual gain scores (the differences between each student's current score and a predicted score based on prior performance) for a school or a teacher, value-added models (VAMs) can be used to generate estimates of school or teacher effects. It is known that random errors in the prior scores will introduce bias into predictions of…

Descriptors: Error of Measurement, Value Added Models, Scores, Teacher Effectiveness

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Notes on a General Framework for Observed Score Equating. Research Report. ETS RR-08-59

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Holland, Paul – ETS Research Report Series, 2008

The purpose of this paper is to extend von Davier, Holland, and Thayer's (2004b) framework of kernel equating so that it can incorporate raw data and traditional equipercentile equating methods. One result of this more general framework is that previous equating methodology research can be viewed more comprehensively. Another result is that the…

Descriptors: Equated Scores, Error of Measurement, Statistical Analysis, Computation

Evaluation of Methods to Compute Complex Sample Standard Errors in Latent Regression Models. Research Report. ETS RR-09-49

Peer reviewed
PDF on ERIC

Download full text

Oranje, Andreas; Li, Deping; Kandathil, Mathew – ETS Research Report Series, 2009

Several complex sample standard error estimators based on linearization and resampling for the latent regression model of the National Assessment of Educational Progress (NAEP) are studied with respect to design choices such as number of items, number of regressors, and the efficiency of the sample. This paper provides an evaluation of the extent…

Descriptors: Error of Measurement, Computation, Regression (Statistics), National Competency Tests

Reliability of Scaled Scores. Research Report. ETS RR-08-70

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

The reliability of a scaled score can be computed by use of item response theory. Estimated reliability can be obtained even if the item response model selected is not valid.

Descriptors: Reliability, Scores, Item Response Theory, Computation

Parameter Recovery and Subpopulation Proficiency Estimation in Hierarchical Latent Regression Models. Research Report. ETS RR-07-27

Peer reviewed
PDF on ERIC

Download full text

Li, Deping; Oranje, Andreas; Jiang, Yanlin – ETS Research Report Series, 2007

The hierarchical latent regression model (HLRM) is a flexible framework for estimating group-level proficiency while taking into account the complex sample designs often found in large-scale educational surveys. A complex assessment design in which information is collected at different levels (such as student, school, and district), the model also…

Descriptors: Hierarchical Linear Modeling, Regression (Statistics), Computation, Comparative Analysis

Investigating the Effectiveness of Collateral Information on Small-Sample Equating. Research Report. ETS RR-08-52

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Linvingston, Samuel A.; Lewis, Charles – ETS Research Report Series, 2008

This paper describes an empirical evaluation of a Bayesian procedure for equating scores on test forms taken by small numbers of examinees, using collateral information from the equating of other test forms. In this procedure, a separate Bayesian estimate is derived for the equated score at each raw-score level, making it unnecessary to specify a…

Descriptors: Equated Scores, Statistical Analysis, Sample Size, Bayesian Statistics

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

Kernel and Traditional Equipercentile Equating with Degrees of Presmoothing. Research Report. ETS RR-07-15

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Holland, Paul – ETS Research Report Series, 2007

The purpose of this study was to empirically evaluate the impact of loglinear presmoothing accuracy on equating bias and variability across chained and post-stratification equating methods, kernel and percentile-rank continuization methods, and sample sizes. The results of evaluating presmoothing on equating accuracy generally agreed with those of…

Descriptors: Equated Scores, Statistical Analysis, Accuracy, Sample Size

Refinement of a Bias-Correction Procedure for the Weighted Likelihood Estimator of Ability. Research Report. ETS RR-07-23

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming; Lu, Ting – ETS Research Report Series, 2007

In practical applications of item response theory (IRT), item parameters are usually estimated first from a calibration sample. After treating these estimates as fixed and known, ability parameters are then estimated. However, the statistical inferences based on the estimated abilities can be misleading if the uncertainty of the item parameter…

Descriptors: Item Response Theory, Ability, Error of Measurement, Maximum Likelihood Statistics

On the Latent Regression Model of Item Response Theory. Research Report. ETS RR-07-12

Peer reviewed
PDF on ERIC

Download full text

Antal, Tamás – ETS Research Report Series, 2007

Full account of the latent regression model for the National Assessment of Educational Progress is given. The treatment includes derivation of the EM algorithm, Newton-Raphson method, and the asymptotic standard errors. The paper also features the use of the adaptive Gauss-Hermite numerical integration method as a basic tool to evaluate…

Descriptors: Regression (Statistics), Item Response Theory, National Competency Tests, Evaluation Methods

Mapping State Standards to the NAEP Scale. Research Report. ETS RR-08-57

Peer reviewed
PDF on ERIC

Download full text

Braun, Henry; Qian, Jiahe – ETS Research Report Series, 2008

This report describes the derivation and evaluation of a method for comparing the performance standards for public school students set by different states. It is based on an approach proposed by McLaughlin and associates, which constituted an innovative attempt to resolve the confusion and concern that occurs when very different proportions of…

Descriptors: State Standards, Comparative Analysis, Public Schools, National Competency Tests

Previous Page | Next Page »

Pages: 1 | 2

Moses, Tim	4
Oranje, Andreas	4
Qian, Jiahe	4
Haberman, Shelby J.	3
Holland, Paul	2
Kim, Sooyeon	2
Lee, Yi-Hsuan	2
Li, Deping	2
Zhang, Jinming	2
Antal, Tamás	1
Braun, Henry	1
Deping, Li	1
Freund, David	1
Jewsbury, Paul A.	1
Jiang, Yanlin	1
Kandathil, Mathew	1
Kane, Michael T.	1
Lewis, Charles	1
Lin, Mei-jang	1
Linvingston, Samuel A.	1
Lu, Ting	1
Puhan, Gautam	1
Sinharay, Sadip	1
Tang, Yuxin	1
Wang, Lin	1
More ▼