ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,161 to 5,175 of 9,552 results Save | Export

A Note on the Covariance of the Mantel-Haenszel Log-Odds Ratio Estimator and the Sample Marginal Rates. Program Statistics Research Technical Report No. 89-85.

Download full text

Holland, Paul W. – 1989

A simple technique, developed by A. Phillips (1987) is used to approximate the covariance between the Mantel-Haenszel log-odds-ratio estimator for a 2 x 2 x k table and the sample marginal proportions. These results are then applied to obtain an approximate variance estimate of an adjusted risk difference based on the Mantel-Haenszel odds-ratio…

Descriptors: Difficulty Level, Estimation (Mathematics), Item Bias, Risk

Quantifying Item Dependency by Fisher's Z.

Download full text

Shen, Linjun – 1997

Three aspects of the usual approach to assessing local item dependency, Yen's "Q" (H. Huynh, H. Michaels, and S. Ferrara, 1995), deserve further investigation. Pearson correlation coefficients do not distribute normally when the coefficients are large, and thus cannot quantify the dependency well. In the second place, the accuracy of…

Descriptors: Ability, Estimation (Mathematics), Item Response Theory, Reliability

Detection of Aberrant Item Score Patterns: A Review of Recent Developments. Research Report 94-8.

Download full text

Meijer, Rob R.; Sijtsma, Klaas – 1994

Methods for detecting item score patterns that are unlikely (aberrant) given that a parametric item response theory (IRT) model gives an adequate description of the data or given the responses of the other persons in the group are discussed. The emphasis here is on the latter group of statistics. These statistics can be applied when a…

Descriptors: Foreign Countries, Identification, Item Response Theory, Nonparametric Statistics

Identifying Nonuniform DIF in Polytomously Scored Test Items. ACT Research Report Series 94-1.

Download full text

Spray, Judith; Miller, Tim – 1994

Computer simulations under three conditions of polytomous differential item functioning (DIF) compared the ability of three different statistical procedures to detect nonuniform DIF. The procedures were a nominal and an ordinal extension of the Mantel-Haenszel statistic, and logistic discriminant function analysis. Results showed that only the…

Descriptors: Computer Simulation, Identification, Item Bias, Sample Size

Rating Scale Analysis: Gauging the Impact of Positively and Negatively Worded Items.

Download full text

Bergstrom, Betty A.; Lunz, Mary E. – 1998

This paper addresses questions of whether positively- and negatively-worded items measure the same construct and whether the rating scale categories "strongly agree" to "strongly disagree" are used in the same way for both types of items. Item response theory (IRT), specifically the Andrich Rating Scale Model (B. Wright and G.…

Descriptors: Adults, Item Response Theory, Rating Scales, Research Methodology

The Effect of Rounding Aggregated Item Ratings for Constructed Response Items in Mixed-Item Format Tests.

Download full text

Sykes, Robert C.; Ito, Kyoko – 1998

A common procedure for obtaining multiple readings (ratings) for a constructed response item, especially in high-stakes tests, is to have two readers read the papers independently, with a third reading if the results differ by more than one point. This necessitates a scoring rule that specifies how the ratings will be aggregated into a single item…

Descriptors: Ability, Constructed Response, High Stakes Tests, Judges

A Generalizability Approach To Evaluating the Reliability of Testlet-Based Test Scores.

Download full text

Lee, Guemin; Frisbie, David A. – 1997

Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…

Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Scores

Cognitive-Developmental Hierarchies: A Search for Structure Using Item-Level Data.

Download full text

Martinez, Michael E.; Simpson, R. Scott – 1999

Item-level statistics from ability and achievement tests have been underutilized as sources of data for building models of cognitive development. How item data can be used to build a cognitive-developmental map of proportional reasoning is demonstrated. The product of the analysis is a cognitive hierarchy with levels corresponding to categories of…

Descriptors: Ability, Achievement Tests, Cognitive Development, Cognitive Tests

The Effect of Sample Size on the Functioning of the Mantel-Haenszel Statistic.

Download full text

Mazor, Kathleen M.; And Others – 1991

The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning. Valid results with relatively small numbers of examinees represent one of the advantages typically attributed to this procedure. In this study, examinee item responses were simulated to contain differentially functioning…

Descriptors: Difficulty Level, Item Bias, Item Response Theory, Sample Size

Reliability of Speeded Number-Right Multiple-Choice Tests. Research Report. RR-04-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2004

Contrary to common belief, reliability estimates of number-right multiple-choice tests are not inflated by speededness. Because examinees guess on questions when they run out of time, the responses to these questions show less consistency with the responses of other questions, and the reliability of the test will be decreased. The surprising…

Descriptors: Multiple Choice Tests, Timed Tests, Test Reliability, Guessing (Tests)

Joint and Conditional Maximum Likelihood Estimation for the Rasch Model for Binary Responses. Research Report. RR-04-20

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2004

The usefulness of joint and conditional maximum-likelihood is considered for the Rasch model under realistic testing conditions in which the number of examinees is very large and the number is items is relatively large. Conditions for consistency and asymptotic normality are explored, effects of model error are investigated, measures of prediction…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Testing

The 1988 Tests of General Educational Development: A Preview.

Whitney, Douglas R.; And Others – 1985

This preview of the Tests of General Educational Development (GED) to be introduced in 1988 begins with a brief background of the review process that will result in the GED Test. An overview of committee recommendations then highlights five themes of Test Specifications Committee panel reports: the tests should (1) demand more highly developed…

Descriptors: Adult Education, High School Equivalency Programs, Test Format, Test Items

The Glenwood Assessment of Behavior of the Mentally Retarded: A Well-Factored Scale of Adaptive Behavior.

Larsen, Gary Y. – 1984

The paper describes the reasons for developing a new instrument to measure adaptive behavior of mentally retarded residents at Glenwood State Hospital-School and recounts the processes involved in constructing the new scale. Among complaints about the American Association on Mental Deficiency Adaptive Behavior Scale (ABS) are its inappropriateness…

Descriptors: Adaptive Behavior (of Disabled), Factor Analysis, Mental Retardation, Test Construction

Item Bias and Test Scores.

Scheuneman, Janice Dowd – 1982

The connection between item bias and test scores was investigated using a simulation approach. Two samples of hypothetical examinees were simulated using an item response theory model. The two samples were identical, except that the mean theta value 1 sample was 5 less than the other. The simulated tests consisted of 50 items with characteristics…

Descriptors: Latent Trait Theory, Research Methodology, Research Problems, Simulation

The Effectiveness of Illustrated Items

Peer reviewed

Washington, William N.; Godfrey, R. Richard – Journal of Educational Measurement, 1974

Item statistics between illustrated and written items drawn from the same content areas were compared using F ratios. The results indicated: that illustrated items performed slightly better than matched written items; and that the best performing category of illustrated items was tables. (Author/BB)

Descriptors: Achievement Tests, Illustrations, Test Construction, Test Items

« Previous Page | Next Page »

Pages: 1 | ... | 341 | 342 | 343 | 344 | 345 | 346 | 347 | 348 | 349 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼