ERIC - Search Results

Publication Date

In 2025	3
Since 2024	12
Since 2021 (last 5 years)	41
Since 2016 (last 10 years)	126
Since 2006 (last 20 years)	395

Descriptor

Test Theory	1161
Test Items	261
Test Reliability	252
Test Construction	245
Test Validity	245
Psychometrics	181
Scores	176
Item Response Theory	165
Foreign Countries	159
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	113
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Multiple Choice Tests	85
More ▼

Education Level

Higher Education	95
Postsecondary Education	65
Secondary Education	48
Elementary Education	39
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 496 to 510 of 1,161 results Save | Export

Accuracy of Year-1, Year-2 Comparisons Using Individual Percentile Rank Scores: Classical Test Theory Calculations. CSE Technical Report.

Download full text

Rogosa, David – 2000

In the reporting of individual student results from standardized tests in educational assessments, the percentile rank of the individual student is a major numerical indicator. For example, in the 1998 and 1999 California Standardized Testing and Reporting (STAR) program using the Stanford Achievement Test Series, Ninth Edition, Form T (Stanford…

Descriptors: Comparative Analysis, Elementary Secondary Education, Standardized Tests, Tables (Data)

The Radex Structure of Intelligence: A Replication.

Peer reviewed

Adler, Nurit; Guttman, Ruth – Educational and Psychological Measurement, 1982

Thirteen ability tests were administered as defined within a mapping sentence containing four content facets: rule type, expression mode, language of communication and dimensionality of portrayed object. Smallest Space Analysis of intercorrelations among test scores showed the radex structure of the two-dimensional space conformed to the…

Descriptors: Content Analysis, Factor Structure, Intelligence Tests, Scores

Multivariate Generalizability Models for Tests Developed from Tables of Specifications.

Jarjoura, David; Brennan, Robert L. – New Directions for Testing and Measurement, 1983

Multivariate generalizability techniques are used to bridge the gap between psychometric constraints and the tables of specifications needed in test development. Techniques are illustrated with results from the American College Testing Assessment Program. (Author/PN)

Descriptors: Data Analysis, Mathematical Models, Multivariate Analysis, Test Construction

Error of Measurement and Statistical Inference: Some Anomalies.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980

It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)

Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

A Primer of Testing.

Peer reviewed

Green, Bert F. – American Psychologist, 1981

Discusses classical test theory, including test construction, administration, and use. Covers basic statistical concepts in measurement, reliability, and validity; principles of sound test construction and item analysis; test administration and scoring; procedures for transforming raw test data into scaled scores; and future prospects in test…

Descriptors: Scores, Statistics, Test Construction, Test Interpretation

"Perceptual Speed" and Cognitive Controls: Tasks in Reconstructing Group Test Theory and Practice within and across Cultures.

Peer reviewed

Irvine, S.H.; Reuning, H. – Journal of Cross-Cultural Psychology, 1981

Under experimental conditions, Canadian students were subjected to group tests on simple cognitive tasks constructed on the basis of previous theoretical work on perceptual speed. Cross-cultural replications suggest that such tests can be validated within and across cultures. (Author/MJL)

Descriptors: Adolescents, Cognitive Tests, Cross Cultural Studies, Perception

An Empirical Test of Roskam's Conjecture about the Interpretation of an ICC Parameter in Personality Inventories.

Peer reviewed

Zumbo, Bruno D.; Pope, Gregory A.; Watson, Jackie E.; Hubley, Anita M. – Educational and Psychological Measurement, 1997

E. Roskam's (1985) conjecture that steeper item characteristic curve (ICC) "a" parameters (slopes) (and higher item total correlations in classical test theory) would be found with more concretely worded test items was tested with results from 925 young adults on the Eysenck Personality Questionnaire (H. Eysenck and S. Eysenck, 1975).…

Descriptors: Correlation, Personality Assessment, Personality Measures, Test Interpretation

Estimating the Reliability of Criterion-Referenced Tests before Administration.

Peer reviewed

Chase, Clint – Mid-Western Educational Researcher, 1996

Classical procedures for calculating the two indices of decision consistency (P and Kappa) for criterion-referenced tests require two testings on each child. Huynh, Peng, and Subkoviak have presented one-testing procedures for these indices. These indices can be estimated without any test administration using Ebel's estimates of the mean, standard…

Descriptors: Criterion Referenced Tests, Educational Research, Educational Testing, Estimation (Mathematics)

Item Response Theory Models and Spurious Interaction Effects in Factorial ANOVA Designs.

Peer reviewed

Embretson, Susan E. – Applied Psychological Measurement, 1996

Conditions under which interaction effects estimated from classical total scores, rather than item response theory trait scores, can be misleading are discussed with reference to analysis of variance (ANOVA). When no interaction effects exist on the true latent variable, spurious interaction effects can be observed from the total score scale. (SLD)

Descriptors: Analysis of Variance, Interaction, Item Response Theory, Models

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Item Response Theory and Classical Test Theory: An Empirical Comparison of Their Item/Person Statistics.

Peer reviewed

Fan, Xitao – Educational and Psychological Measurement, 1998

This study empirically examined the behaviors of item and person statistics derived from item response theory and classical test theory, focusing on item and person statistics and using a large-scale statewide assessment. Findings show that the person and item statistics from the two measurement frameworks are quite comparable. (SLD)

Descriptors: Item Response Theory, State Programs, Statistical Analysis, Test Items

Construct Validity of Scores/Measures from a Developmental Assessment in Mathematics Using Classical and Many-Facet Rasch Measurement.

Peer reviewed

Banerji, Madhabi – Journal of Applied Measurement, 2000

Validated data from a developmental mathematics assessment using classical and three-faceted Rasch measurement methods. Analysis of field test data for 289 elementary school students suggested that a unidimensional construct was being measured, as defined by Rasch criteria. Discusses limitations in confirming content-related validity of the…

Descriptors: Construct Validity, Content Validity, Elementary Education, Elementary School Students

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Probability-Based Inference in a Domain of Proportional Reasoning Tasks.

Peer reviewed

Beland, Anne; Mislevy, Robert J. – Journal of Educational Measurement, 1996

This article addresses issues in model building and statistical inference in the context of student modeling. The use of probability-based reasoning to explicate hypothesized and empirical relationships and to structure inference in the context of proportional reasoning tasks is discussed. Ideas are illustrated with an example concerning…

Descriptors: Cognitive Psychology, Models, Networks, Probability

« Previous Page | Next Page »

Pages: 1 | ... | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
Applied Measurement in…	10
International Journal of…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Practical Assessment,…	6
School Psychology Review	6
Astronomy Education Review	5
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	728
Reports - Research	615
Reports - Evaluative	214
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Test of English as a Foreign…	6
Program for International…	5
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼