ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	14

Descriptor

Statistical Analysis	107
True Scores	107
Mathematical Models	37
Error of Measurement	36
Test Reliability	35
Correlation	28
Measurement Techniques	18
Reliability	18
Test Interpretation	17
Analysis of Variance	14
Comparative Analysis	14
Equated Scores	14
Scores	14
Raw Scores	11
Test Theory	11
Criterion Referenced Tests	10
Statistical Significance	10
Test Validity	10
Goodness of Fit	9
Item Response Theory	9
Probability	9
Research Methodology	9
Test Items	9
Testing	9
Analysis of Covariance	8
More ▼

Publication Type

Reports - Research	47
Journal Articles	29
Reports - Evaluative	9
Speeches/Meeting Papers	5
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Descriptive	1
Reports - General	1

Education Level

Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
Secondary Education	1

Audience

Researchers	4
Practitioners	2
Administrators	1
Teachers	1

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
Law School Admission Test	2
ACT Assessment	1
College Level Examination…	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Metropolitan Readiness Tests	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 107 results Save | Export

Observed Scores as Matching Variables in Differential Item Functioning under the One- and Two-Parameter Logistic Models: Population Results. Research Report. ETS RR-19-06

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…

Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

Asymptotic Standard Errors of Equating Coefficients Using the Characteristic Curve Methods for the Graded Response Model

Peer reviewed

Direct link

Zhang, Zhonghua – Applied Measurement in Education, 2020

The characteristic curve methods have been applied to estimate the equating coefficients in test equating under the graded response model (GRM). However, the approaches for obtaining the standard errors for the estimates of these coefficients have not been developed and examined. In this study, the delta method was applied to derive the…

Descriptors: Error of Measurement, Computation, Equated Scores, True Scores

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Measurement Error and Equating Error in Power Analysis

Peer reviewed
PDF on ERIC

Download full text

Phillips, Gary W.; Jiang, Tao – Practical Assessment, Research & Evaluation, 2016

Power analysis is a fundamental prerequisite for conducting scientific research. Without power analysis the researcher has no way of knowing whether the sample size is large enough to detect the effect he or she is looking for. This paper demonstrates how psychometric factors such as measurement error and equating error affect the power of…

Descriptors: Error of Measurement, Statistical Analysis, Equated Scores, Sample Size

Controlling Type I Error Rates in Assessing DIF for Logistic Regression Method Combined with SIBTEST Regression Correction Procedure and DIF-Free-Then-DIF Strategy

Peer reviewed

Direct link

Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung – Educational and Psychological Measurement, 2014

The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…

Descriptors: Test Bias, Regression (Statistics), Test Items, True Scores

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Confidence Intervals for True Scores Using the Skew-Normal Distribution

Peer reviewed

Direct link

Garcia-Perez, Miguel A. – Journal of Educational and Behavioral Statistics, 2010

A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…

Descriptors: Computation, Statistical Analysis, True Scores, Raw Scores

Rocks: A Concrete Activity That Introduces Normal Distribution, Sampling Error, Central Limit Theorem and True Score Theory

Download full text

Van Duzer, Eric – Online Submission, 2011

This report introduces a short, hands-on activity that addresses a key challenge in teaching quantitative methods to students who lack confidence or experience with statistical analysis. Used near the beginning of the course, this activity helps students develop an intuitive insight regarding a number of abstract concepts which are key to…

Descriptors: Course Content, True Scores, Statistical Analysis, Sampling

Jackknifing Techniques for Evaluation of Equating Accuracy. Research Report. ETS RR-09-39

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan; Qian, Jiahe – ETS Research Report Series, 2009

Grouped jackknifing may be used to evaluate the stability of equating procedures with respect to sampling error and with respect to changes in anchor selection. Properties of grouped jackknifing are reviewed for simple-random and stratified sampling, and its use is described for comparisons of anchor sets. Application is made to examples of item…

Descriptors: Equated Scores, Accuracy, Sampling, Statistical Analysis

An Equipercentile Version of the Levine Linear Observed-Score Equating Function Using the Methods of Kernel Equating. Research Report. ETS RR-07-14

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Fournier-Zajac, Stephanie; Holland, Paul W. – ETS Research Report Series, 2007

In the nonequivalent groups with anchor test (NEAT) design, there are several ways to use the information provided by the anchor in the equating process. One of the NEAT-design equating methods is the linear observed-score Levine method (Kolen & Brennan, 2004). It is based on a classical test theory model of the true scores on the test forms…

Descriptors: Equated Scores, Statistical Analysis, Test Items, Test Theory

Reliability and the Nonequivalent Groups with Anchor Test Design. Research Report. ETS RR-07-16

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007

This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…

Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

The Effect of Correlated Errors of Measurement on Correlations Among Tests: A Correlation for Spearman's Correction for Attenuation

Peer reviewed

Williams, Richard H. – Journal of Experimental Education, 1974

An equation comparable to Spearman's correction for attenuation, which does not depend upon the assumption that error scores are uncorrelated with true scores and with other sets of scores, is derived. (Editor)

Descriptors: Correlation, Error of Measurement, Statistical Analysis, True Scores

What is the True Coefficient of Partial Correlation?

Download full text

Livingston, Samuel A.; Stanley, Julian C. – 1971

Although partial correlation is a correlation of residuals, the correlation of the true-score components of these residuals is not equivalent to the partial correlation of the true scores themselves. The source of this discrepancy is explained and its implications are briefly discussed. (Author)

Descriptors: Correlation, Multiple Regression Analysis, Statistical Analysis, True Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational and Psychological…	14
Psychometrika	9
ETS Research Report Series	7
Journal of Educational…	5
Applied Measurement in…	3
Journal of Educational…	3
Applied Psychological…	2
Developmental Psychology	2
Test Service Bulletin	2
American Educational Research…	1
Educ Psychol Meas	1
Educational Sciences: Theory…	1
Evaluation Quarterly	1
Journal of Educational and…	1
Journal of Educational and…	1
Journal of Experimental…	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Online Submission	1
Physics Teacher	1
Practical Assessment,…	1
Scandinavian Journal of…	1
More ▼

Lord, Frederic M.	7
Livingston, Samuel A.	5
Brennan, Robert L.	4
Werts, Charles E.	4
Kristof, Walter	3
Mellenbergh, Gideon J.	3
Qian, Jiahe	3
Stanley, Julian C.	3
Wilcox, Rand R.	3
Cliff, Norman	2
Edwards, Keith J.	2
Haberman, Shelby J.	2
Linn, Robert L.	2
Martin, Charles G.	2
Wainer, Howard	2
Werts, C. E.	2
van der Linden, Wim J.	2
von Davier, Alina A.	2
Asher, William	1
Belfry, M. Joan	1
Ben-Simon, Anat	1
Braun, Henry I.	1
Bresler, Samuel	1
Cahan, Sorel	1
More ▼