ERIC - Search Results

Publication Date

In 2026	0
Since 2025	9
Since 2022 (last 5 years)	37
Since 2017 (last 10 years)	116
Since 2007 (last 20 years)	379

Descriptor

Test Theory	1167
Test Items	262
Test Reliability	252
Test Construction	247
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Multiple Choice Tests	86
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 436 to 450 of 1,167 results Save | Export

Measuring Absolute Amounts of Reading Comprehension Using the Rauding Rescaling Procedure.

Peer reviewed

Carver, Ronald P. – Journal of Reading Behavior, 1985

Describes a procedure for rescaling objective measures of comprehension so that they reflect amounts of accuracy of comprehension on an absolute scale. Suggests there is adequate empirical evidence supporting the validity of the rauding rescaling procedure. (RS)

Descriptors: Elementary Secondary Education, Predictive Validity, Reading Comprehension, Test Theory

A Comparison between Intelligence and Neuropsychological Functioning.

Peer reviewed

D'Amato, Rik Carl; And Others – Journal of School Psychology, 1988

Investigated the overlap between the Wechsler Intelligence Scale for Children - Revised (WISC-R) and the Halstead-Reitan Neuropsychological Battery (HRNB) in light of their use in diagnosing children's learning problems using scores for children (N=1,181) on the WISC-R and the HRNB. Results showed primary overlap between measures was attributed to…

Descriptors: Adolescents, Children, Intelligence Tests, Test Items

Historical Views of Invariance: Evidence from the Measurement Theories of Thorndike, Thurstone, and Rasch.

Peer reviewed

Engelhard, George, Jr. – Educational and Psychological Measurement, 1992

A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)

Descriptors: Behavioral Sciences, Educational History, Measurement Techniques, Psychometrics

Coefficient Alpha and Composite Reliability with Interrelated Nonhomogeneous Items.

Peer reviewed

Raykov, Tenko – Applied Psychological Measurement, 1998

Examines the relationship between Cronbach's coefficient alpha and the reliability of a composite of a prespecified set of interrelated nonhomogeneous components through simulation. Shows that alpha can over- or underestimate scale reliability at the population level. Illustrates the bias in terms of structural parameters. (SLD)

Descriptors: Reliability, Simulation, Statistical Bias, Structural Equation Models

Classical, Generalizability, and Multifaceted Rasch Detection of Interrater Variability in Large, Sparse Data Sets.

Peer reviewed

MacMillan, Peter D. – Journal of Experimental Education, 2000

Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…

Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability

A Perspective on the History of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997

The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)

Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics

A Nominal Response Model Approach for Detecting Answer Copying.

Peer reviewed

Wollack, James A. – Applied Psychological Measurement, 1997

Introduces a new Item Response Theory (IRT) based statistic for detecting answer copying. Compares this omega statistic with the best classical test theory-based statistic under various conditions, and finds omega superior based on Type I error rate and power. (SLD)

Descriptors: Cheating, Identification, Item Response Theory, Power (Statistics)

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

On the Roles of External Knowledge Representations in Assessment Design. CSE Report 722

Download full text

Mislevy, Robert J.; Behrens, John T.; Bennett, Randy E.; Demark, Sarah F.; Frezzo, Dennis C.; Levy, Roy; Robinson, Daniel H.; Rutstein, Daisy Wise; Shute, Valerie J.; Stanley, Ken; Winters, Fielding I. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

People use external knowledge representations (EKRs) to identify, depict, transform, store, share, and archive information. Learning how to work with EKRs is central to becoming proficient in virtually every discipline. As such, EKRs play central roles in curriculum, instruction, and assessment. Five key roles of EKRs in educational assessment are…

Descriptors: Educational Assessment, Computer Networks, Test Construction, Computer Assisted Testing

Two Simple Classes of Mastery Scores Based On the Beta-Binomial Model

Peer reviewed

Huynh, Huynh – Psychometrika, 1977

A model for the setting of mastery cut scores is presented. The model, based on the beta-binomial test distribution, allows for hand calculation of cut scores. The model provides a simple way to explore the consequences of selecting a particular cut score. (Author/JKS)

Descriptors: Career Development, Cutting Scores, Mastery Tests, Mathematical Models

Problems, Perspectives, and Practical Issues in Equating.

Peer reviewed

Brennan, Robert L.; And Others – Applied Psychological Measurement, 1988

Seven papers on technical and practical issues in equating are presented. Problems related to the use of conventional and item response theory equating methods, using pre- and post-smoothing to increase equipercentile equating's precision, and linear equating models for common-item nonequivalent-population design are discussed. (SLD)

Descriptors: Equated Scores, Latent Trait Theory, Research Problems, Scaling

Rasch Scaling and Reading Tests.

Peer reviewed

Pumfrey, Peter D. – Journal of Research in Reading, 1987

Discusses, for the benefit of research workers and other test users, the ongoing controversy concerning the relative merits of conventional test theory and Rasch scaling in the construction of reading tests. Concludes that a great deal of further research is required to see whether these approaches are educationally valid. (JD)

Descriptors: Reading Research, Reading Tests, Test Construction, Test Format

Aggregating Standard Scores or Raw Scores: Whither the Controversy?

Peer reviewed

Gardner, Robert C.; Erdle, Stephen – Educational and Psychological Measurement, 1986

This article evaluated criticisms by Stevens and Aleamoni (1986) of an article by Gardner and Erdle (1984) on aggregation using either raw or standard scores. It was demonstrated that their criticisms were unfounded. (Author)

Descriptors: Correlation, Factor Analysis, Raw Scores, Scores

A Nonparametric K-Sample Test for Equality of Slopes.

Peer reviewed

Penfield, Douglas A.; Koffler, Stephen L. – Educational and Psychological Measurement, 1986

The development of a nonparametric K-sample test for equality of slopes using Puri's generalized L statistic is presented. The test is recommended when the assumptions underlying the parametric model are violated. This procedure replaces original data with either ranks (for data with heavy tails) or normal scores (for data with light tails).…

Descriptors: Mathematical Models, Nonparametric Statistics, Regression (Statistics), Sampling

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

« Previous Page | Next Page »

Pages: 1 | ... | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	734
Reports - Research	620
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼