ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	90

Descriptor

True Scores	416
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	51
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	192
Reports - Research	176
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 391 to 405 of 416 results Save | Export

Demonstrating the Utility of a Multilevel Model in the Assessment of Differential Item Functioning.

Download full text

Pommerich, Mary – 1995

When tests contain few items, observed score may not be an accurate reflection of true score, and the Mantel Haenszel (MH) statistic may perform poorly in detecting differential item functioning. Applications of the MH procedure in such situations require an alternate strategy; one such strategy is to include background variables in the matching…

Descriptors: Criteria, Evaluation Methods, Grade 3, Identification

Improving Prediction by Correcting Test Scores for Person Disturbances Using the Rasch Model.

Download full text

Westfall, Philip Jean-Louis; D'Costa, Ayres G. – 1987

This study, based on the Rasch model, used R. M. Smith's (1986) classification of measurement disturbances to assess the Rasch model approach to error control and statistical prediction. Partitioning the error component into a person component, an item-person interaction component, and a random unexplained error component has the net effect of…

Descriptors: Classification, College Entrance Examinations, Error of Measurement, French

The KR-20 Reliability Coefficient as a Special Case of a More General Formula.

Download full text

Smith, Donald M. – 1976

The Kuder Richardson-20 Formula is shown to be a special case, where each examinee is given sufficient time to answer each item, of a more general formula where each examinee may not be allowed the necessary time. The formula is extended to allow two scores, knowledge and speed, to be extracted from each examinees test score. Using a sample of 82…

Descriptors: Career Development, Comparative Analysis, Grade Point Average, Predictive Measurement

Adjustments for Rater Effects in Performance Assessment.

Peer reviewed

Houston, Walter M.; And Others – Applied Psychological Measurement, 1991

The effectiveness of alternative procedures to correct for rater leniency/stringency effects was studied when true scores were known. Ordinary least squares, weighted least squares, and imputation of the missing data consistently outperformed averaging the observed ratings; and the imputation technique was superior to the least squares methods.…

Descriptors: Comparative Analysis, Computer Simulation, Educational Assessment, Equations (Mathematics)

Using Confirmatory Factor Analysis of Multitrait-Multimethod Data To Assess the Psychometrical Equivalence of 4-Point and 6-Point Likert-Type Scales.

Download full text

Chang, Lei – 1993

Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…

Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students

The Best Linear Predictor for True Score from a Direct Estimate and Several Derived Estimates. Research Report. ETS RR-04-35

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Qian, Jiahe – ETS Research Report Series, 2004

Statistical prediction problems often involve both a direct estimate of a true score and covariates of this true score. Given the criterion of mean squared error, this study determines the best linear predictor of the true score given the direct estimate and the covariates. Results yield an extension of Kelley's formula for estimation of the true…

Descriptors: True Scores, Computation, Predictor Variables, Correlation

Population Invariance of Test Equating and Linking: Theory Extension and Applications across Exams. Research Report. ETS RR-06-31

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A., Ed.; Liu, Mei, Ed. – ETS Research Report Series, 2006

This report builds on and extends existent research on population invariance to new tests and issues. The authors lay the foundation for a deeper understanding of the use of population invariance measures in a wide variety of practical contexts. The invariance of linear, equipercentile and IRT equating methods are examined using data from five…

Descriptors: Equated Scores, Statistical Analysis, Data Collection, Test Format

The TOEFL Computerized Placement Test: Adaptive Conventional Measurement. TOEFL Research Reports, Report 31.

Download full text

Hicks, Marilyn M. – 1989

Methods of computerized adaptive testing using conventional scoring methods in order to develop a computerized placement test for the Test of English as a Foreign Language (TOEFL) were studied. As a consequence of simulation studies during the first phase of the study, the multilevel testing paradigm was adopted to produce three test levels…

Descriptors: Adaptive Testing, Adults, Algorithms, Computer Assisted Testing

Analysis of Covariance: Is It the Appropriate Model to Study Change?

Download full text

Marston, Paul T., Borich, Gary D. – 1977

The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…

Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement

The Evaluation of Mastery Test Items. Final Report.

Download full text

Brennan, Robert L. – 1974

The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…

Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement

Racial Differences in Measurement Error in Educational Achievement Models.

Peer reviewed

Wolfle, Lee M.; Robertshaw, Dianne – Journal of Educational Measurement, 1983

Racial differences in the reporting accuracy of parental status characteristics by White and Black high school seniors were investigated using Joreskog's general framework for simultaneous covariance structure analyses of multiple populations. Reliability estimates for Whites were significantly higher than for Blacks due to differences in true…

Descriptors: Academic Achievement, Black Students, Educational Research, Error of Measurement

An Investigation of the Ordinal True Score Test Theory.

Peer reviewed

Donoghue, John R.; Cliff, Norman – Applied Psychological Measurement, 1991

The validity of the assumptions under which the ordinal true score test theory was derived was examined using (1) simulation based on classical test theory; (2) a long empirical test with data from 321 sixth graders; and (3) an extensive simulation with 480 datasets based on the 3-parameter model. (SLD)

Descriptors: Computer Simulation, Elementary Education, Elementary School Students, Equations (Mathematics)

A Search for TRUTH in Student Responses to Selected Survey Items. AIR 1993 Annual Forum Paper.

Download full text

Takalkar, Pradnya; And Others – 1993

This study compared 4,594 student responses from three different surveys of incoming students at the University of South Florida (USF) with data from Florida's State University System (SUS) admissions files to determine what proportion of error occurs in the survey responses. Specifically, the study investigated the amount of measurement error in…

Descriptors: College Admission, College Applicants, College Bound Students, Comparative Analysis

The Rasch Model for Dichotomous Items: Theory, Applications and a Computer Program. No. 63.

Download full text

Gustafsson, Jan-Eric – 1977

The Rasch model for test analysis is described and compared with two-parameter and three-parameter latent-trait models. Conditional maximum likelihood equations for estimating item parameters are derived, and estimates of person parameters are described together with their confidence intervals. Goodness of fit tests are discussed, including a…

Descriptors: Adaptive Testing, Computer Programs, Equated Scores, Error of Measurement

Strategies for Analyzing Data from Intact Groups.

Download full text

Cross, Lawrence H.; Lane, Carolyn E. – 1977

Action research often necessitates the use of intact groups for the comparison of educational treatments or programs. This paper considers several analytical methods that might be used for such situations when pretest scores indicate that these intact groups differ significantly initially. The methods considered include gain score analysis of…

Descriptors: Achievement Gains, Analysis of Covariance, Analysis of Variance, Control Groups

« Previous Page | Next Page »

Pages: 1 | ... | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Educational Measurement:…	5
Multivariate Behavioral…	5
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼