ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	95

Descriptor

True Scores	415
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	50
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	191
Reports - Research	175
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

True Scores X

Showing 106 to 120 of 415 results Save | Export

Estimation of Reliability and True Score Variance From a Split of a Test Into Three Arbitrary Parts

Peer reviewed

Kristof, Walter – Psychometrika, 1974

Descriptors: Models, Statistical Analysis, Test Reliability, Testing

A Relationship Between Harris Factors and Guttman's Sixth Lower Bound To Reliability

Peer reviewed

Nicewander, W. Alan – Psychometrika, 1975

Shows that the Harris factors of R have psychometric properties similar to those discussed by Kaiser and Caffrey (1965) and Bentler (1968). Specifically it is shown that the Harris factors of R maximize a lower-bound to the reliability of a composite measure derived by Guttman (1945). (Author/RC)

Descriptors: Correlation, Factor Analysis, Matrices, Prediction

Some Results Relating to Test Equating Under Relaxed Test Form Equivalence

Peer reviewed

Marks, Edmond; Lindsay, Carl A. – Journal of Educational Measurement, 1972

Examines the effects of four parameters on the accuracy of test equating under a relaxed definition of test form equivalence. The four parameters studied were sample size, test form length, test form reliability, and the correlation between true scores of the test forms to be equated. (CK)

Descriptors: Scores, Test Interpretation, Test Reliability, Test Results

True Score Theory: A Paradox

Peer reviewed

Ramsay, J. O. – Educational and Psychological Measurement, 1971

The consequences of the assumption that the expected score is equal to the true score are shown and alternatives discussed. (MS)

Descriptors: Psychological Testing, Statistical Analysis, Test Reliability, Testing

A Note on Gaylord's "Estimating Test Reliability from the Item-Test Correlations"

Peer reviewed

Bowers, John – Educational and Psychological Measurement, 1971

Descriptors: Error of Measurement, Mathematical Models, Test Reliability, True Scores

On the Base-Free Measure of Change Proposed by Tucker, Damarin and Messick.

Peer reviewed

Bond, Lloyd – Psychometrika, 1979

Tucker, Damarin, and Messick proposed a "base-free" measure of change which involves the computation of residual scores that are uncorrelated with true scores on the pretest. The present note discusses this change measure and demonstrates that properties they attribute to a are, in fact, properties of b. (Author/CTM)

Descriptors: Differences, Pretests Posttests, Research Reviews (Publications), Scores

Maximally Reliable Composites for Unidimensional Measures.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1980

Reliability maximizing weights are related to theoretically specified true score scaling weights to show a constant relationship that is invariant under separate linear tranformations on each variable in the system. Test theoretic relations should be derived for the most general model available and not for unnecessarily constrained models.…

Descriptors: Mathematical Formulas, Scaling, Test Reliability, Test Theory

A Note on Decision Theoretic Coefficients for Tests.

Peer reviewed

Wilcox, Rand R. – Applied Psychological Measurement, 1979

Using a new coefficient, a rescaling of the Bayes risk is examined and a modification of this coefficient is described which yields an index that always has a value between zero and one. (Author/MH)

Descriptors: Bayesian Statistics, Measurement Techniques, Scoring, Technical Reports

Reliability and True-Score Measures of Binary Items as a Function of Their Rasch Difficulty Parameter.

Peer reviewed

Dimitrov, Dimiter M. – Journal of Applied Measurement, 2003

Proposes formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty when the trait (ability) distribution is normal or logistic. Provides an illustrative example for using the proposed formulas. (SLD)

Descriptors: Ability, Difficulty Level, Item Response Theory, Reliability

Longitudinal Models of Reliability and Validity: A Latent Curve Approach.

Peer reviewed

Tisak, John; Tisak, Marie S. – Applied Psychological Measurement, 1996

Dynamic generalizations of reliability and validity that will incorporate longitudinal or developmental models, using latent curve analysis, are discussed. A latent curve model formulated to depict change is incorporated into the classical definitions of reliability and validity. The approach is illustrated with sociological and psychological…

Descriptors: Definitions, Development, Longitudinal Studies, Models

Ordinal Consistency and Ordinal True Scores.

Peer reviewed

Cliff, Norman – Psychometrika, 1989

This paper argues that: test data are ordinal; latent trait scores are only determined ordinally; and test data are used largely for ordinal purposes. A set of ordinal assumptions is presented, including an ordinal version of local independence. It is concluded that a purely ordinal test theory is possible. (TJH)

Descriptors: Equations (Mathematics), Latent Trait Theory, Regression (Statistics), True Scores

The Problem of Negative Reliabilities.

Peer reviewed

Krus, David J.; Helmstadter, Gerald C. – Educational and Psychological Measurement, 1993

Negative coefficients of reliability, sometimes returned by the standard formula for estimation of the internal-consistency reliability, are neither theoretically nor numerically correct. Alternative strategies for test development in this special case are suggested. (Author)

Descriptors: Estimation (Mathematics), Reliability, Test Construction, Test Use

Improved Type I Error Control and Reduced Estimation Bias for DIF Detection Using SIBTEST.

Peer reviewed

Jiang, Hai; Stout, William – Journal of Educational and Behavioral Statistics, 1998

Proposes a new regression correction for the SIBTEST statistical tests (R. Shealy and W. Stout, 1993) that essentially uses a two-segment piecewise linear regression of the true on observed matching subtest scores. A simulation study illustrates the approach. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Regression (Statistics), Simulation

Objective Standard Setting for Judge-Mediated Examinations

Peer reviewed

Direct link

Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008

Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…

Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

Factors Affecting the Sample Invariant Properties of Linear and Curvilinear Observed- and True-Score Equating Procedures.

Download full text

Stocking, Martha L.; And Others – 1988

A sequence of simulations was carried out to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker linear observed-score equating; (2) Levine equally reliable linear observed-score equating; (3) equipercentile curvilinear…

Descriptors: Equated Scores, Item Response Theory, Sample Size, Simulation

« Previous Page | Next Page »

Pages: 1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Multivariate Behavioral…	5
Educational Measurement:…	4
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼