ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	90

Descriptor

True Scores	416
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	51
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	192
Reports - Research	176
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 181 to 195 of 416 results Save | Export

An Empirical Bayes Approach to Subscore Augmentation: How Much Strength Can We Borrow?

Peer reviewed

Direct link

Edwards, Michael C.; Vevea, Jack L. – Journal of Educational and Behavioral Statistics, 2006

This article examines a subscore augmentation procedure. The approach uses empirical Bayes adjustments and is intended to improve the overall accuracy of measurement when information is scant. Simulations examined the impact of the method on subscale scores in a variety of realistic conditions. The authors focused on two popular scoring methods:…

Descriptors: Geometric Concepts, True Scores, Scoring, Item Response Theory

Making Essay Test Scores Fairer with Statistics. ETS Program Statistics Research Technical Report No. 89-90.

Download full text

Braun, Henry I.; Wainer, Howard – 1989

A desirable goal would be to develop a methodology for scoring essays so that the final grades are less affected by when or by whom each essay was read. It seems sensible to derive such grades by somehow adjusting the ratings originally given by each reader. This essay describes a solution that relies on statistical adjustment, using the context…

Descriptors: Essay Tests, Estimation (Mathematics), Interrater Reliability, Scoring

A Reassessment of Standard Error of Measurement.

Download full text

Klaas, Alan C. – 1975

Current usage and theory of standard error of measurement calls for one standard error of measurement figure to be used across all levels of scoring. The study revealed that scoring variance across scoring levels is not constant. As scoring ability increases scoring variance decreases. The assertion that low and high scoring subjects will…

Descriptors: Error of Measurement, Guessing (Tests), Scoring, Statistical Analysis

An Interval Estimate for Statistical Inference about True Scores.

Download full text

Lord, Frederic M.; Hamilton, Martha S. – 1972

A numerical procedure is outlined for obtaining an interval estimate of true score. The procedure is applied to several sets of test data. (Author)

Descriptors: Bayesian Statistics, Hypothesis Testing, Psychological Testing, Statistical Analysis

Tolerance Intervals for True Scores.

Peer reviewed

Jarjoura, David – Journal of Educational Statistics, 1985

Issues regarding tolerance and confidence intervals are discussed within the context of educational measurement, and conceptual distinctions are drawn between these two types of intervals. Points are raised about the advantages of tolerance intervals when the focus is on a particular observed score rather than a particular examinee. (Author/BW)

Descriptors: Comparative Analysis, Error of Measurement, Mathematical Models, Test Interpretation

Statistical Control of "Impurity" in the Estimation of Test Reliability

Peer reviewed

Lu, K. H. – Educational and Psychological Measurement, 1971

Descriptors: Difficulty Level, Statistical Analysis, Statistical Significance, Test Items

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Exploring the Feasibility of Collateral Information Test Equating.

Peer reviewed

Hsu, Tse-chi; Wu, Kuo-liang; Yu, Jya-yi Wu; Lee, Ming-yen – International Journal of Testing, 2002

Explored the feasibility of applying a method that incorporates collateral information to equate tests constructed for a college entrance examination by comparing its results with those of item response theory (IRT) true score equating. Simulation results suggest that, overall, equating results based on collateral information are relatively…

Descriptors: College Entrance Examinations, Equated Scores, Item Response Theory, Simulation

Standard Errors of Prediction for the Vineland Adaptive Behavior Scales.

Peer reviewed

Atkinson, Leslie – Journal of School Psychology, 1990

Offers standard errors of prediction and confidence intervals for Vineland Adaptive Behavior Scales (VABS) that help in deciding whether variation in obtained scores of scale administered to the same person more than once is a result of measurement error or whether it reflects actual change in examinee's functional level. Presented values were…

Descriptors: Error of Measurement, Foreign Countries, Raw Scores, Test Interpretation

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

Objective Standard Setting for Judge-Mediated Examinations

Peer reviewed

Direct link

Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008

Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…

Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

The Effects on Observed- and True-Score Equating Procedures of Matching on a Fallible Criterion: A Simulation with Test Variation.

Download full text

Eignor, Daniel R.; And Others – 1995

Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker; (2) Levine equally reliable; (3) Chained equipercentile observed-score; and (4) three-parameter, item response theory true-score…

Descriptors: Criteria, Equated Scores, Item Response Theory, Raw Scores

Method of Moments Estimates for the Four-Parameter Beta Compound Binomial Model and the Calculation of Classification Consistency Indexes.

PDF pending restoration

Hanson, Bradley A. – 1991

This paper presents a detailed derivation of method of moments estimates used in computer programs for the four-parameter beta compound binomial strong true score model. A procedure is presented to deal with the case in which the usual method of moments estimates do not exist or result in invalid parameter estimates. The results presented…

Descriptors: Classification, Computation, Computer Software, Equations (Mathematics)

Congeneric Models and Levine's Linear Equating Procedures.

Download full text

Brennan, Robert L. – 1990

In 1955, R. Levine introduced two linear equating procedures for the common-item non-equivalent populations design. His procedures make the same assumptions about true scores; they differ in terms of the nature of the equating function used. In this paper, two parameterizations of a classical congeneric model are introduced to model the variables…

Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Research Design

An Assessment of the Relationship between the Assumption of Unidimensionality and the Quality of IRT True-Score Equating.

Download full text

Cook, Linda L.; And Others – 1983

The purpose of this study was to empirically examine the relationship between violations of the assumption of unidimensionality, as assessed by the factor analysis of item parcel data, and the quality of item response theory (IRT) true-score equating, as measured by score scale stability. The verbal section of the Scholastic Aptitude Test (SAT)…

Descriptors: College Entrance Examinations, Equated Scores, Factor Analysis, Latent Trait Theory

« Previous Page | Next Page »

Pages: 1 | ... | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Educational Measurement:…	5
Multivariate Behavioral…	5
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼