ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	90

Descriptor

True Scores	416
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	51
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	192
Reports - Research	176
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 271 to 285 of 416 results Save | Export

Multiple Perspectives on Family Relationships: A Latent Variables Model.

Peer reviewed

Cook, William L.; Goldstein, Michael J. – Child Development, 1993

Tested the assumption that familial self-reports are biased by social desirability and other factors, through the use of a latent variables modeling approach that evaluated rater reliability and bias in mother, father, and child ratings of parent-child negativity. Results based on 78 families demonstrated that family member ratings contained a…

Descriptors: Children, Family Relationship, Interrater Reliability, Parent Child Relationship

Stepping Up Test Score Conditional Variances.

Peer reviewed

Woodruff, David – Journal of Educational Measurement, 1991

Improvements are made on previous estimates for the conditional standard error of measurement in prediction, the conditional standard error of estimation (CSEE), and the conditional standard error of prediction (CSEP). Better estimates of how test length affects CSEE and CSEP are derived. (SLD)

Descriptors: Equations (Mathematics), Error of Measurement, Estimation (Mathematics), Mathematical Models

Confirmatory Measurement Model Comparisons Using Latent Means.

Peer reviewed

Millsap, Roger E.; Everson, Howard – Multivariate Behavioral Research, 1991

Use of confirmatory factor analysis (CFA) with nonzero latent means in testing six different measurement models from classical test theory is discussed. Implications of the six models for observed mean and covariance structures are described, and three examples of the use of CFA in testing the models are presented. (SLD)

Descriptors: Comparative Analysis, Equations (Mathematics), Goodness of Fit, Mathematical Models

Detection of Differential Item Functioning in the Graded Response Model.

Peer reviewed

Cohen, Allan S.; And Others – Applied Psychological Measurement, 1993

Three measures of differential item functioning for the dichotomous response model are extended to include Samejima's graded response model. Two are based on area differences between item true score functions, and one is a chi-square statistic for comparing differences in item parameters. (SLD)

Descriptors: Chi Square, Comparative Analysis, Identification, Item Bias

Statistical Inference Based on Latent Ability Estimates.

Peer reviewed

Hoijtink, Herbert; Boomsma, Anne – Psychometrika, 1996

The quality of approximations to first- and second-order moments based on latent ability estimates is discussed. The ability estimates are based on the Rasch or the two-parameter logistic model, and true score theory is used to account for the fact that the basic quantities are estimates. (SLD)

Descriptors: Ability, Bayesian Statistics, Estimation (Mathematics), Item Response Theory

Applying a Score Confidence Interval to Aiken's Item Content-Relevance Index

Peer reviewed

Direct link

Penfield, Randall D.; Giacobbi, Peter R., Jr – Measurement in Physical Education and Exercise Science, 2004

Item content-relevance is an important consideration for researchers when developing scales used to measure psychological constructs. Aiken (1980) proposed a statistic, "V," that can be used to summarize item content-relevance ratings obtained from a panel of expert judges. This article proposes the application of the Score confidence interval to…

Descriptors: Intervals, True Scores, Content Validity, Sport Psychology

Assessing Equating Results on Different Equating Criteria

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael – Applied Psychological Measurement, 2005

The performance of three equating methods--the presmoothed equipercentile method, the item response theory (IRT) true score method, and the IRT observed score method--were examined based on three equating criteria: the same distributions property, the first-order equity property, and the second-order equity property. The magnitude of the…

Descriptors: True Scores, Criteria, Raw Scores, Item Response Theory

The Effect of Anchor Length and Equating Method on the Accuracy of Test Equating: Comparisons of Linear and IRT-Based Equating Using an Anchor-Item Design.

Download full text

Yang, Wen-Ling; Houang, Richard T. – 1996

The influence of anchor length on the accuracy of test equating was studied using Tucker's linear method and two Item-Response-Theory (IRT) based methods, focusing on whether equating accuracy improved with more anchor items, whether the anchor effect depended on the equating method used, and the adequacy of the inclusion of the guessing parameter…

Descriptors: Equated Scores, Estimation (Mathematics), Guessing (Tests), Item Response Theory

Equating Reading Test Scores That Combine Narrative and Expository Test Formats.

Download full text

Hsu, Yaowen; Ackerman, Terry A. – 1994

This paper summarizes an investigation of the format used for equating the 1993 Illinois Goal Assessment Program (IGAP) sixth grade reading test. In 1992, each student took only one test, either a narrative test or an expository test. In 1993, there was only one test, which included both formats. Several possible approaches for linking the 1993…

Descriptors: Context Effect, Elementary School Students, Equated Scores, Grade 6

The Estimation of True Scores for Tests Not Taken: A Simulation Study.

Download full text

Bekhuis, Tanja C. H. M. – 1988

An Educational Testing Service (ETS) procedure was evaluated, which is based on item response theory and estimates true scores on tests not taken. The reading, vocabulary, and mathematics tests of high school seniors from the National Longitudinal Study (NLS) of 1972 and the High School and Beyond (HSB) seniors of 1980 and 1982 were found to share…

Descriptors: Achievement Tests, Computer Simulation, Estimation (Mathematics), Latent Trait Theory

Reliability of Tests Used to Make Pass/Fail Decisions: Answering the Right Questions.

Download full text

Livingston, Samuel A. – 1978

The traditional reliability coefficient and standard error of measurement are not adequate measures of reliability for tests used to make pass/fail decisions. Answering the important reliability questions requires estimation of the joint distribution of true and observed scores. Lord's "Method 20" estimates this distribution without the…

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

Assigning Grades More Fairly

Cheshier, Stephen R. – Engineering Education, 1975

Describes a simplified method for converting raw scores to standard scores and transforming them to "T-scores" for easy comparison of performance. Obtaining letter grades from T-scores is discussed. A reading list is included. (GH)

Descriptors: Achievement Rating, Error of Measurement, Evaluation Methods, Grades (Scholastic)

An Empirical Test of a Strategy for Training Examinees in the Use of Partial Information in Taking Multiple Choice Tests.

Download full text

Bliss, Leonard B. – 1981

The aim of this study was to show that the superiority of corrected-for-guessing scores over number right scores as true score estimates depends on the ability of examinees to recognize situations where they can eliminate one or more alternatives as incorrect and to omit items where they would only be guessing randomly. Previous investigations…

Descriptors: Algorithms, Guessing (Tests), Intermediate Grades, Multiple Choice Tests

A New Index for the Accuracy of a Criterion-Referenced Test.

Divgi, D. R. – 1978

One aim of criterion-referenced testing is to classify an examinee without reference to a norm group; therefore, any statements about the dependability of such classification ought to be group-independent also. A population-independent index is proposed in terms of the probability of incorrect classification near the cutoff true score. The…

Descriptors: Criterion Referenced Tests, Cutting Scores, Difficulty Level, Error of Measurement

Some Potential Uses of Decision-Theoretic (Confidence) Testing in the Analysis of Criterion-Referenced Item Data.

Download full text

Brennan, Robert L. – 1974

An attempt is made to explore the use of subjective probabilities in the analysis of item data, especially criterion-referenced item data. Two assumptions are implicit: (1) one wants to obtain a maximum amount of information with respect to an item using a minimum number of subjects; and (2) once the item is validated, it may well be administered…

Descriptors: Confidence Testing, Criterion Referenced Tests, Guessing (Tests), Item Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Educational Measurement:…	5
Multivariate Behavioral…	5
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼