ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Item Sampling	11
Test Theory	11
Test Reliability	6
Career Development	5
Item Analysis	5
Criterion Referenced Tests	4
Test Construction	4
Test Interpretation	4
Test Validity	4
Achievement Tests	3
Difficulty Level	3
Error of Measurement	3
Adaptive Testing	2
Comparative Analysis	2
Higher Education	2
Individual Differences	2
Latent Trait Theory	2
Measurement Techniques	2
Norm Referenced Tests	2
Performance Factors	2
Scores	2
Statistical Analysis	2
Test Items	2
Testing Problems	2
Ability Grouping	1
More ▼

Source

Psychometrika	2
Applied Psychological…	1
Assessment	1
Evaluation in Education:…	1

Author

Lord, Frederic M.	2
Archer, Robert P.	1
Arnau, Randolph C.	1
Cliff, Norman	1
Dandy, Kristina L.	1
Epstein, Kenneth I.	1
Forster, Fred	1
Haladyna, Tom	1
Handel, Richard W.	1
Jarjoura, David	1
Knerr, Claramae S.	1
Theunissen, Phiel J. J. M.	1
van den Brink, Wulfert	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	4
Reports - Evaluative	3
Speeches/Meeting Papers	2
Collected Works - Proceedings	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Minnesota Multiphasic…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Best Linear Prediction of Composite Universe Scores.

Peer reviewed

Jarjoura, David – Psychometrika, 1983

The problem of predicting universe scores for samples of examinees based on their responses to samples of items is treated. The measurement model categorizes items according to the cells of a table of test specifications, and the linear function derived for minimizing error variance in prediction uses responses to these categories. (Author/JKS)

Descriptors: Error of Measurement, Generalizability Theory, Item Sampling, Prediction

Binomial Test Models for Domain-Referenced Testing.

van den Brink, Wulfert – Evaluation in Education: International Progress, 1982

Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques

An Evaluation of the MMPI-2 and MMPI-A True Response Inconsistency (TRIN) Scales

Peer reviewed

Direct link

Handel, Richard W.; Arnau, Randolph C.; Archer, Robert P.; Dandy, Kristina L. – Assessment, 2006

The Minnesota Multiphasic Personality Inventory--Adolescent (MMPI-A) and Minnesota Multiphasic Personality Inventory--2 (MMPI-2) True Response Inconsistency (TRIN) scales are measures of acquiescence and nonacquiescence included among the standard validity scales on these instruments. The goals of this study were to evaluate the effectiveness of…

Descriptors: Adolescents, Protocol Analysis, Effect Size, Personality Measures

Estimating the Imputed Social Cost of Errors of Measurement.

Peer reviewed

Lord, Frederic M. – Psychometrika, 1985

Given a loss function, an asymptotic method for optimal test design for a specified target population of examinees is presented. Also, of more practical use, given an existing unidimensional test and target population, a way is presented to find the loss function for which the test is optimal. (NSF)

Descriptors: Error of Measurement, Higher Education, Item Sampling, Latent Trait Theory

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

A Basic Test Theory Generalizable to Tailored Testing. Technical Report No. 1.

Download full text

Cliff, Norman – 1975

Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…

Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences

Some Item Analysis and Test Theory for a System of Computer-Assisted Test Construction for Individualized Instruction

Peer reviewed

Lord, Frederic M. – Applied Psychological Measurement, 1977

Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…

Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms

Introduction to Rasch Measurement: Some Implications for Languages.

Theunissen, Phiel J. J. M. – 1983

Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…

Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

The Paradox of Criterion-Referenced Measurement.

Download full text

Haladyna, Tom – 1976

The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…

Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis

Testing and the Public Interest.

Download full text

Educational Testing Service, Princeton, NJ. – 1977

The 1976 Educational Testing Service (ETS) Invitational Conference served as a platform for individuals who have been prominent in educational measurement and research to present their views on issues surrounding the testing controversy. The 1976 ETS "The Testing Scene: Chaos and Controversy," presents a historical review of events surrounding the…

Descriptors: Achievement Tests, Adaptive Testing, Awards, Career Development