ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	115
Since 2007 (last 20 years)	378

Descriptor

Test Theory	1166
Test Items	262
Test Reliability	252
Test Construction	246
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Difficulty Level	85
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 496 to 510 of 1,166 results Save | Export

Error of Measurement and Statistical Inference: Some Anomalies.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980

It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)

Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

A Primer of Testing.

Peer reviewed

Green, Bert F. – American Psychologist, 1981

Discusses classical test theory, including test construction, administration, and use. Covers basic statistical concepts in measurement, reliability, and validity; principles of sound test construction and item analysis; test administration and scoring; procedures for transforming raw test data into scaled scores; and future prospects in test…

Descriptors: Scores, Statistics, Test Construction, Test Interpretation

"Perceptual Speed" and Cognitive Controls: Tasks in Reconstructing Group Test Theory and Practice within and across Cultures.

Peer reviewed

Irvine, S.H.; Reuning, H. – Journal of Cross-Cultural Psychology, 1981

Under experimental conditions, Canadian students were subjected to group tests on simple cognitive tasks constructed on the basis of previous theoretical work on perceptual speed. Cross-cultural replications suggest that such tests can be validated within and across cultures. (Author/MJL)

Descriptors: Adolescents, Cognitive Tests, Cross Cultural Studies, Perception

An Empirical Test of Roskam's Conjecture about the Interpretation of an ICC Parameter in Personality Inventories.

Peer reviewed

Zumbo, Bruno D.; Pope, Gregory A.; Watson, Jackie E.; Hubley, Anita M. – Educational and Psychological Measurement, 1997

E. Roskam's (1985) conjecture that steeper item characteristic curve (ICC) "a" parameters (slopes) (and higher item total correlations in classical test theory) would be found with more concretely worded test items was tested with results from 925 young adults on the Eysenck Personality Questionnaire (H. Eysenck and S. Eysenck, 1975).…

Descriptors: Correlation, Personality Assessment, Personality Measures, Test Interpretation

Estimating the Reliability of Criterion-Referenced Tests before Administration.

Peer reviewed

Chase, Clint – Mid-Western Educational Researcher, 1996

Classical procedures for calculating the two indices of decision consistency (P and Kappa) for criterion-referenced tests require two testings on each child. Huynh, Peng, and Subkoviak have presented one-testing procedures for these indices. These indices can be estimated without any test administration using Ebel's estimates of the mean, standard…

Descriptors: Criterion Referenced Tests, Educational Research, Educational Testing, Estimation (Mathematics)

Item Response Theory Models and Spurious Interaction Effects in Factorial ANOVA Designs.

Peer reviewed

Embretson, Susan E. – Applied Psychological Measurement, 1996

Conditions under which interaction effects estimated from classical total scores, rather than item response theory trait scores, can be misleading are discussed with reference to analysis of variance (ANOVA). When no interaction effects exist on the true latent variable, spurious interaction effects can be observed from the total score scale. (SLD)

Descriptors: Analysis of Variance, Interaction, Item Response Theory, Models

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Item Response Theory and Classical Test Theory: An Empirical Comparison of Their Item/Person Statistics.

Peer reviewed

Fan, Xitao – Educational and Psychological Measurement, 1998

This study empirically examined the behaviors of item and person statistics derived from item response theory and classical test theory, focusing on item and person statistics and using a large-scale statewide assessment. Findings show that the person and item statistics from the two measurement frameworks are quite comparable. (SLD)

Descriptors: Item Response Theory, State Programs, Statistical Analysis, Test Items

Construct Validity of Scores/Measures from a Developmental Assessment in Mathematics Using Classical and Many-Facet Rasch Measurement.

Peer reviewed

Banerji, Madhabi – Journal of Applied Measurement, 2000

Validated data from a developmental mathematics assessment using classical and three-faceted Rasch measurement methods. Analysis of field test data for 289 elementary school students suggested that a unidimensional construct was being measured, as defined by Rasch criteria. Discusses limitations in confirming content-related validity of the…

Descriptors: Construct Validity, Content Validity, Elementary Education, Elementary School Students

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Probability-Based Inference in a Domain of Proportional Reasoning Tasks.

Peer reviewed

Beland, Anne; Mislevy, Robert J. – Journal of Educational Measurement, 1996

This article addresses issues in model building and statistical inference in the context of student modeling. The use of probability-based reasoning to explicate hypothesized and empirical relationships and to structure inference in the context of proportional reasoning tasks is discussed. Ideas are illustrated with an example concerning…

Descriptors: Cognitive Psychology, Models, Networks, Probability

Reliability as a Function of the Number of Item Options Derived from the "Knowledge or Random Guessing" Model

Peer reviewed

Direct link

MacCann, Robert G. – Psychometrika, 2004

For (0, 1) scored multiple-choice tests, a formula giving test reliability as a function of the number of item options is derived, assuming the "knowledge or random guessing model," the parallelism of the new and old tests (apart from the guessing probability), and the assumptions of classical test theory. It is shown that the formula is a more…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Test Theory

Features of Groups and Status Hierarchies in Girls' and Boys' Early Adolescent Peer Networks

Peer reviewed

Direct link

Gest, Scott D.; Davidson, Alice J.; Rulison, Kelly L.; Moody, James; Welsh, Janet A. – New Directions for Child and Adolescent Development, 2007

The near universality of gender segregation in middle childhood and early adolescence has stimulated extensive research on sex differences in peer relationship processes. Recent reviews of the literature suggest that although some claims of two-cultures theory have clear empirical support, such as strong preference for same-sex peers over…

Descriptors: Early Adolescents, Peer Relationship, Friendship, Peer Groups

Quality Assurance of Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin E. – Quality Assurance in Education: An International Perspective, 2006

Purpose: To provide educationalists with an understanding of the key quality issues relating to multiple-choice tests, and a set of guidelines for the quality assurance of such tests. Design/methodology/approach: The discussion of quality issues is structured to reflect the order in which those issues naturally arise. It covers the design of…

Descriptors: Multiple Choice Tests, Test Reliability, Educational Quality, Quality Control

« Previous Page | Next Page »

Pages: 1 | ... | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	733
Reports - Research	619
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼