ERIC - Search Results

Publication Date

In 2026	0
Since 2025	220
Since 2022 (last 5 years)	1089
Since 2017 (last 10 years)	2599
Since 2007 (last 20 years)	4960

Descriptor

Test Items	9552
Test Construction	2724
Foreign Countries	2185
Item Response Theory	1872
Difficulty Level	1624
Item Analysis	1502
Test Validity	1418
Test Reliability	1189
Multiple Choice Tests	1160
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	846
Psychometrics	835
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1316
Postsecondary Education	1066
Secondary Education	928
Elementary Education	716
Middle Schools	421
High Schools	364
Elementary Secondary Education	359
Junior High Schools	321
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	70
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 5,011 to 5,025 of 9,552 results Save | Export

Item Selection in Adaptive Testing with the Sequential Probability Ratio Test.

Peer reviewed

Eggen, T. J. H. M. – Applied Psychological Measurement, 1999

Evaluates a method for item selection in adaptive testing that is based on Kullback-Leibler information (KLI) (T. Cover and J. Thomas, 1991). Simulation study results show that testing algorithms using KLI-based item selection perform better than or as well as those using Fisher information item selection. (SLD)

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Selection

Applying the Mantel-Haenszel Procedure to Complex Samples of Items.

Peer reviewed

Allen, Nancy L.; Donoghue, John R. – Journal of Educational Measurement, 1996

Examined the effect of complex sampling of items on the measurement of differential item functioning (DIF) using the Mantel-Haenszel procedure through a Monte Carlo study. Suggests the superiority of the pooled booklet method when items are selected for examinees according to a balanced incomplete block design. Discusses implications for other DIF…

Descriptors: Item Bias, Monte Carlo Methods, Research Design, Sampling

Item Parameter Recovery for the Nominal Response Model.

Peer reviewed

De Ayala, R. J.; Sava-Bolesta, Monica – Applied Psychological Measurement, 1999

Investigated the relationship between sample size, latent trait distribution, and item parameter estimation with the nominal response model through simulation. Results suggest guidelines for reasonable item parameter estimation. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Sample Size, Simulation

Minimizing the Influence of Item Parameter Estimation Errors in Test Development: A Comparison of Three Selection Procedures.

Peer reviewed

Gierl, Mark J.; Henderson, Diane; Jodoin, Michael; Klinger, Don – Journal of Experimental Education, 2001

Examined the influence of item parameter estimation errors across three item selection methods using the two- and three-parameter logistic item response theory (IRT) model. Tests created with the maximum no target and maximum target item selection procedures consistently overestimated the test information function. Tests created using the theta…

Descriptors: Estimation (Mathematics), Item Response Theory, Selection, Test Construction

Assessing Differential Item Functioning among Multiple Groups: A Comparison of Three Mantel-Haenszel Procedures.

Peer reviewed

Penfield, Randall D. – Applied Measurement in Education, 2001

Compared the performance of three methods of assessing differential item functioning (DIF) across demographic groups, using: (1) the Mantel-Haenszel chi-square statistic with no adjustment to the alpha level; (2) the Mantel-Haenszel statistic with a Bonferroni adjusted alpha level; and (3) the generalized Mantel-Haenszel statistic. Simulation…

Descriptors: Chi Square, Demography, Item Bias, Power (Statistics)

A Note on the Estimator of the Alpha Coefficient for Standardized Variables Under Normality

Peer reviewed

Direct link

Hayashi, Kentaro; Kamata, Akihito – Psychometrika, 2005

The asymptotic standard deviation (SD) of the alpha coefficient with standardized variables is derived under normality. The research shows that the SD of the standardized alpha coefficient becomes smaller as the number of examinees and/or items increase. Furthermore, this research shows that the degree of the dependence of the SD on the number of…

Descriptors: Correlation, Statistical Analysis, Measurement Techniques, Simulation

Estimates of the Sampling Distribution of Scalability Coefficient H

Peer reviewed

Direct link

Van Onna, Marieke J. H. – Applied Psychological Measurement, 2004

Coefficient "H" is used as an index of scalability in nonparametric item response theory (NIRT). It indicates the degree to which a set of items rank orders examinees. Theoretical sampling distributions, however, have only been derived asymptotically and only under restrictive conditions. Bootstrap methods offer an alternative possibility to…

Descriptors: Sampling, Item Response Theory, Scaling, Comparative Analysis

The Information in Multiple Ratings

Peer reviewed

Direct link

Bock, R. Darrell; Brennan, Robert L.; Muraki, Eiji – Applied Psychological Measurement, 2002

In assessment programs where scores are reported for individual examinees, it is desirable to have responses to performance exercises graded by more than one rater. If more than one item on each test form is so graded, it is also desirable that different raters grade the responses of any one examinee. This gives rise to sampling designs in which…

Descriptors: Generalizability Theory, Test Items, Item Response Theory, Error of Measurement

Measuring Human Performance on Clustering Problems: Some Potential Objective Criteria and Experimental Research Opportunities

Peer reviewed

Direct link

Brusco, Michael J. – Journal of Problem Solving, 2007

The study of human performance on discrete optimization problems has a considerable history that spans various disciplines. The two most widely studied problems are the Euclidean traveling salesperson problem and the quadratic assignment problem. The purpose of this paper is to outline a program of study for the measurement of human performance on…

Descriptors: Problem Solving, Performance, Measurement, Criticism

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

Vocabulary Development and Performance on Multiple-Choice Exams in Large Entry-level Courses

Peer reviewed
PDF on ERIC

Download full text

Direct link

Turner, Haley; Williams, Robert L. – Journal of College Reading and Learning, 2007

Scores on a vocabulary test given at the beginning of two semesters in a large entry-level course predicted performance on multiple-choice exams more strongly than pre-course knowledge and critical thinking. Words on the vocabulary instrument were derived from multiple-choice exam items in the course. Although commonly used in the course, these…

Descriptors: Vocabulary Development, Multiple Choice Tests, Scores, Introductory Courses

Using Qualitative Methods to Inform Scale Development

Peer reviewed
PDF on ERIC

Download full text

Rowan, Noell; Wulff, Dan – Qualitative Report, 2007

This article describes the process by which one study utilized qualitative methods to create items for a multi dimensional scale to measure twelve step program affiliation. The process included interviewing fourteen addicted persons while in twelve step focused treatment about specific pros (things they like or would miss out on by not being…

Descriptors: Qualitative Research, Measures (Individuals), Test Items, Test Construction

Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Nylund, Karen L.; Asparouhov, Tihomir; Muthen, Bengt O. – Structural Equation Modeling: A Multidisciplinary Journal, 2007

Mixture modeling is a widely applied data analysis technique used to identify unobserved heterogeneity in a population. Despite mixture models' usefulness in practice, one unresolved issue in the application of mixture models is that there is not one commonly accepted statistical indicator for deciding on the number of classes in a study…

Descriptors: Test Items, Monte Carlo Methods, Program Effectiveness, Data Analysis

Gender Differences in Lunar-Related Scientific and Mathematical Understandings

Peer reviewed

Direct link

Wilhelm, Jennifer – International Journal of Science Education, 2009

This paper reports an examination on gender differences in lunar phases understanding of 123 students (70 females and 53 males). Middle-level students interacted with the Moon through observations, sketching, journalling, two-dimensional and three-dimensional modelling, and classroom discussions. These lunar lessons were adapted from the Realistic…

Descriptors: Test Results, Test Items, Females, Astronomy

Linking for the General Diagnostic Model. Research Report. ETS RR-08-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…

Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items

« Previous Page | Next Page »

Pages: 1 | ... | 331 | 332 | 333 | 334 | 335 | 336 | 337 | 338 | 339 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	40
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5887
Reports - Research	5597
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼