ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	95

Descriptor

True Scores	415
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	50
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	191
Reports - Research	175
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

True Scores X

Showing 151 to 165 of 415 results Save | Export

Shrinkage Estimation of Linear Combinations of True Scores.

Peer reviewed

Longford, Nicholas T. – Psychometrika, 1997

It is demonstrated that, in the presence of population information, a linear combination of true scores can be estimated more efficiently than by the same linear combination of the observed scores. Three criteria for optimality are discussed, but they yield the same solution, described as a multivariate shrinkage estimator. (Author/SLD)

Descriptors: Error of Measurement, Estimation (Mathematics), Multivariate Analysis, Population Distribution

A Comparison among IRT True- and Observed-Score Equatings and Traditional Equipercentile Equating.

Peer reviewed

Han, Tianqi; And Others – Applied Measurement in Education, 1997

Stability among equating procedures was studied by comparing item response theory (IRT) true-score equating with IRT observed-score equating, IRT true-score equating with equipercentile equating, and IRT observed-score equating with equipercentile equating. On average, IRT true-score equating more frequently produced more stable conversions. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Item Response Theory, Raw Scores

Simulation Results of Effects on Linear and Curvilinear Observed- and True-Score Equating Procedures of Matching on a Fallible Criterion.

Peer reviewed

Eignor, Daniel R.; And Others – Applied Measurement in Education, 1990

Two independent replications of a sequence of simulations were conducted to aid in the diagnosis and interpretation of equating differences found between representative (random) and matched (nonrandom) samples for three commonly used conventional observed-score equating procedures and one item-response-theory-based equating procedure. (SLD)

Descriptors: Equated Scores, Item Response Theory, Sampling, Simulation

Equating Tests under the Graded Response Model.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1992

The procedure of M.L. Stocking and F.M. Lord (1983) for computing equating coefficients for tests having dichotomously scored items is extended to the case of graded response items. A system of equations for obtaining the equating coefficients under the graded response model is derived. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Item Response Theory, Mathematical Models

Variability in Reliability Coefficients and the Standard Error of Measurement from School District to District.

Peer reviewed

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1999

Examined the stability of the standard error of measurement and the relationship between the reliability coefficient and the variance of both true scores and error scores for 170 school districts in a state. As expected, reliability coefficients varied as a function of group variability, but the variation in split-half coefficients from school to…

Descriptors: Elementary Secondary Education, Error of Measurement, Reliability, School Districts

Evaluating the Effects of Multidimensionality on IRT True-Score Equating.

Peer reviewed

Bolt, Daniel M. – Applied Measurement in Education, 1999

Examined whether the item response theory (IRT) true-score equating method is more adversely affected by the presence of multidimensionality than two conventional equating methods, linear and equipercentile equating. Results of two simulation studies suggest that the IRT method performs as well as the conventional methods when the correlation…

Descriptors: Correlation, Equated Scores, Item Response Theory, Simulation

A Comparison of IRT Equating and Beta 4 Equating

Peer reviewed

Direct link

Kim, Dong-In; Brennan, Robert; Kolen, Michael – Journal of Educational Measurement, 2005

Four equating methods (3PL true score equating, 3PL observed score equating, beta 4 true score equating, and beta 4 observed score equating) were compared using four equating criteria: first-order equity (FOE), second-order equity (SOE), conditional-mean-squared-error (CMSE) difference, and the equi-percentile equating property. True score…

Descriptors: True Scores, Psychometrics, Equated Scores, Item Response Theory

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

Estimating Consistency and Accuracy Indices for Multiple Classifications

Peer reviewed

Direct link

Lee, Won-Chan; Hanson, Bradley A.; Brennan, Robert L. – Applied Psychological Measurement, 2002

This article describes procedures for estimating various indices of classification consistency and accuracy for multiple category classifications using data from a single test administration. The estimates of the classification consistency and accuracy indices are compared under three different psychometric models: the two-parameter beta binomial,…

Descriptors: Classification, True Scores, Psychometrics, Item Response Theory

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

The Relative Efficiency of Two Tests as a Function of Ability Level.

Download full text

Lord, Frederic M. – 1973

A new formula is developed for the relative efficiency of two tests measuring the same trait. The formula expresses relative efficiency solely in terms of the standard errors of measurement and, surprisingly, the frequency distributions of true scores. Approximate methods for estimating relative efficiency may make this function routinely…

Descriptors: Error of Measurement, Research Reports, Statistical Analysis, Test Interpretation

Mathematical Considerations About the Effects of Guessing on Test Variance.

Koplyay, Janos B.; And Others – 1972

The relationship between true ability (operationally defined as the number of items for which the examinee actually knew the correct answer) and the effects of guessing upon observed test variance was investigated. Three basic hypotheses were treated mathematically: there is no functional relationship between true ability and guessing success;…

Descriptors: Guessing (Tests), Predictor Variables, Probability, Scoring

Further Studies of Linear Prediction Following Matrix Sampling.

Download full text

Kleinke, David J. – 1973

In a post mortem study, it is demonstrated that linear prediction is as effective as computing a negative hyper-geometric distribution for estimating test norms following matrix sampling from a total test with a highly skewed score distribution, provided the same prediction coefficient is used for all examinee groups. It is also demonstrated…

Descriptors: Item Sampling, Norms, Predictive Measurement, Research Reports

Coefficients for Tests from a Decision Theoretic Point of View

Peer reviewed

Vander Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1978

A general coefficient for tests, delta, is derived from a decision theoretic point of view. The situations are considered in which a true score is estimated by a function of the observed score, observed scores are split into more than two categories, and observed scores are split into only two categories. (Author/CTM)

Descriptors: Criterion Referenced Tests, Decision Making, Mathematical Models, Raw Scores

A Study of the Accuracy of Subkoviak's Single-Administration Estimate of the Coefficient of Agreement Using Two True-Score Estimates

Peer reviewed

Algina, James; Noe, Michael J. – Journal of Educational Measurement, 1978

A computer simulation study was conducted to investigate Subkoviak's index of reliability for criterion-referenced tests, called the coefficient of agreement. Results indicate that the index can be adequately estimated. (JKS)

Descriptors: Criterion Referenced Tests, Mastery Tests, Measurement, Test Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Multivariate Behavioral…	5
Educational Measurement:…	4
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼