ERIC - Search Results

Publication Date

In 2025	4
Since 2024	8
Since 2021 (last 5 years)	19
Since 2016 (last 10 years)	35
Since 2006 (last 20 years)	57

Descriptor

Test Validity	165
Test Reliability	68
Test Construction	52
Validity	52
Higher Education	36
Test Items	35
Predictive Validity	33
Scores	33
Item Analysis	31
Test Interpretation	30
Test Bias	29
Achievement Tests	28
Multiple Choice Tests	28
Evaluation Methods	26
Comparative Analysis	24
Scoring	23
Item Response Theory	21
Testing Problems	21
Models	20
Test Use	20
College Entrance Examinations	18
Measurement Techniques	18
Correlation	16
Academic Achievement	15
Criterion Referenced Tests	15
More ▼

Source

Journal of Educational…

252

Publication Type

Journal Articles	173
Reports - Research	118
Reports - Evaluative	30
Opinion Papers	14
Reports - Descriptive	10
Information Analyses	7
Speeches/Meeting Papers	7
Book/Product Reviews	1
Reports - General	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Postsecondary Education	6
Secondary Education	4
Middle Schools	3
Elementary Education	2
Elementary Secondary Education	2
Junior High Schools	2
Grade 7	1
Grade 8	1
High Schools	1

Audience

Researchers	7
Practitioners	2

Location

Canada	2
Australia	1
Ireland	1
Israel	1
Jordan	1
United Kingdom	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Journal of Educational Measurement X

Showing 91 to 105 of 252 results Save | Export

Effects of Different Samples on Item and Test Characteristics of Criterion-Referenced Tests

Peer reviewed

Haladyna, Thomas Michael – Journal of Educational Measurement, 1974

Classical test construction and analysis procedures are applicable and appropriate for use with criterion referenced tests when samples of both mastery and nonmastery examinees are employed. (Author/BB)

Descriptors: Criterion Referenced Tests, Item Analysis, Mastery Tests, Test Construction

The Issue of Item and Test Variance for Criterion-Referenced Tests: A Reply

Peer reviewed

Woodson, M. I. Charles E. – Journal of Educational Measurement, 1974

The basis for selection of the calibration sample determines the kind of scale which will be developed. A random sample from a population of individuals leads to a norm-referenced scale, and a sample representative of abilities of a range of characteristics leads to a criterion-referenced scale. (Author/BB)

Descriptors: Criterion Referenced Tests, Discriminant Analysis, Item Analysis, Test Construction

Wechsler Intelligence Scales for Children--Revised

Peer reviewed

Tittle, Caroll Kehr – Journal of Educational Measurement, 1975

This review looks at these changes and their impact on the quality of the instrument: alteration of age-range from 5-15 to 6-16; development of new norms; improvement of manual in format and function; and a number of old items deleted and new ones added for the subtests. (RC)

Descriptors: Elementary Secondary Education, Guides, Intelligence Tests, Norms

The Use of Latent Partition Analysis to Identify Homogeneity of an Item Population

Peer reviewed

Hartke, Alan R. – Journal of Educational Measurement, 1978

Latent partition analysis is shown to be useful in determining the conceptual homogeneity of an item population. Such item populations are useful for mastery testing. Applications of latent partition analysis in assessing content validity are suggested. (Author/JKS)

Descriptors: Higher Education, Item Analysis, Item Sampling, Mastery Tests

Survey Testing on an Out-Of-Level Basis

Peer reviewed

Ayrer, James E.; McNamara, Thomas C. – Journal of Educational Measurement, 1973

Out-of-level'' testing is the assigning of pupils to levels of a standardized test on the basis of previous test scores rather than their present grade assignment. Test results of 1500 children were reviewed to see if their performance supported the rationale behind the practice. (Author/CB)

Descriptors: Achievement Rating, Elementary School Students, Standardized Tests, Test Interpretation

Another Look at "Cultural Fairness"

Peer reviewed

Darlington, Richard B. – Journal of Educational Measurement, 1971

Four definitions of cultural fairness" are critically examined. Suggestions for dealing with conflicts between the two goals of maximizing a test's validity and minimizing its culture-group discrimination, are presented. Terms in which this judgment should be made, and methods of using its results are described. (LR)

Descriptors: Cultural Background, Cultural Differences, Culture Fair Tests, Test Bias

Toward an Improved Measure of Remote Associational Ability

Peer reviewed

Worthen, Blaine R.; Clark, Philip M. – Journal of Educational Measurement, 1971

Descriptors: Association Measures, College Students, Creativity, Creativity Tests

A Mexican Version of the Peabody Picture Vocabulary Test.

Peer reviewed

Simon, Alan J.; Joiner, Lee M. – Journal of Educational Measurement, 1976

The purpose of this study was to determine whether a Mexican version of the Peabody Picture Vocabulary Test could be improved by directly translating both forms of the American test, then using decision procedures to select the better item of each pair. The reliability of the simple translations suffered. (Author/BW)

Descriptors: Early Childhood Education, Spanish, Test Construction, Test Format

A Model of Rater Behavior in Essay Grading Based on Signal Detection Theory

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2005

An approach to essay grading based on signal detection theory (SDT) is presented. SDT offers a basis for understanding rater behavior with respect to the scoring of construct responses, in that it provides a theory of psychological processes underlying the raters' behavior. The approach also provides measures of the precision of the raters and the…

Descriptors: Validity, Simulation, Grading, Item Response Theory

Validating A Priori Instructional Hierarchies

Peer reviewed

Airasian, Peter W.; Bart, William M. – Journal of Educational Measurement, 1975

Validation studies of learning hierarchies usually examine whether task relationships posited a priori are confirmed by student learning data. This method was compared with a non-posited task relationship where all possible task relationships were generated and investigated. A learning hierarchy in a seventh grade mathematics study reported by…

Descriptors: Difficulty Level, Intellectual Development, Junior High Schools, Learning Theories

Can Teachers Write Good True-False Test Items?

Peer reviewed

Ebel, Robert L. – Journal of Educational Measurement, 1975

Descriptors: Comparative Analysis, Multiple Choice Tests, Objective Tests, Teachers

A Comprehensive System for Item Analysis in Psychological Scale Construction

Peer reviewed

Schwartz, Steven A. – Journal of Educational Measurement, 1978

A method for the construction of scales which combines the rational (or intuitive) approach with an empirical (item analysis) approach is presented. A step-by-step procedure is provided. (Author/JKS)

Descriptors: Factor Analysis, Item Analysis, Measurement, Psychological Testing

Adaptation of an Intelligence Test from English to French

Peer reviewed

Bhushan, Vidya – Journal of Educational Measurement, 1974

Descriptors: Cultural Differences, French, Intelligence Tests, Languages

Postdiction Study of the Graduate Record Examination and Eight Semesters of College Grades

Peer reviewed

Humphreys, Lloyd G.; Taber, Thomas – Journal of Educational Measurement, 1973

Data from a postdictive study of the tests of the Graduate Record Examination and the eight semesters of undergraduate grade averages, each semester's average being computed independently of the rest, are presented. (Editor)

Descriptors: Aptitude Tests, Class Average, Correlation, Grade Point Average

Validity of the Discrimination Index as a Measure of Item Quality

Peer reviewed

Pyrczak, Fred – Journal of Educational Measurement, 1973

Despite the numerous individual illustrations in the literature showing how the discrimination index may be used to identify items with faults, its overall effectiveness as a measure of item quality, defined in terms of the presence or absence of faults, is not clear. This study investigates its validity. (Author/RK)

Descriptors: Correlation, Discriminant Analysis, Item Banks, Rating Scales

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Bennett, Randy Elliot	4
Wainer, Howard	4
Whitney, Douglas R.	4
Clauser, Brian E.	3
Goldman, Roy D.	3
Hanna, Gerald S.	3
Kane, Michael T.	3
Linn, Robert L.	3
Novick, Melvin R.	3
Ackerman, Terry A.	2
Airasian, Peter W.	2
Algina, James	2
Baldwin, Peter	2
Bejar, Isaac I.	2
Brandenburg, Dale C.	2
Chang, Hua-Hua	2
Ebel, Robert L.	2
Embretson, Susan	2
Farr, Roger	2
Fitzpatrick, Anne R.	2
Frisbie, David A.	2
Haertel, Edward	2
Hakstian, A. Ralph	2
Hambleton, Ronald K.	2
More ▼

SAT (College Admission Test)	11
Comprehensive Tests of Basic…	3
Graduate Record Examinations	3
Stanford Achievement Tests	3
Differential Aptitude Test	2
Iowa Tests of Basic Skills	2
National Assessment of…	2
Peabody Picture Vocabulary…	2
ACT Interest Inventory	1
Advanced Placement…	1
Alabama High School…	1
Classroom Environment Scale	1
College and University…	1
General Aptitude Test Battery	1
Kaufman Assessment Battery…	1
Law School Admission Test	1
Lexile Scale of Reading	1
McCarthy Scales of Childrens…	1
Metropolitan Achievement Tests	1
Metropolitan Readiness Tests	1
My Class Inventory	1
National Teacher Examinations	1
Preschool Inventory	1
Program for International…	1
Remote Associates Test	1
More ▼