Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Peer reviewedEggen, T. J. H. M. – Applied Psychological Measurement, 1999
Evaluates a method for item selection in adaptive testing that is based on Kullback-Leibler information (KLI) (T. Cover and J. Thomas, 1991). Simulation study results show that testing algorithms using KLI-based item selection perform better than or as well as those using Fisher information item selection. (SLD)
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Selection
Peer reviewedAllen, Nancy L.; Donoghue, John R. – Journal of Educational Measurement, 1996
Examined the effect of complex sampling of items on the measurement of differential item functioning (DIF) using the Mantel-Haenszel procedure through a Monte Carlo study. Suggests the superiority of the pooled booklet method when items are selected for examinees according to a balanced incomplete block design. Discusses implications for other DIF…
Descriptors: Item Bias, Monte Carlo Methods, Research Design, Sampling
Peer reviewedDe Ayala, R. J.; Sava-Bolesta, Monica – Applied Psychological Measurement, 1999
Investigated the relationship between sample size, latent trait distribution, and item parameter estimation with the nominal response model through simulation. Results suggest guidelines for reasonable item parameter estimation. (SLD)
Descriptors: Estimation (Mathematics), Item Response Theory, Sample Size, Simulation
Peer reviewedGierl, Mark J.; Henderson, Diane; Jodoin, Michael; Klinger, Don – Journal of Experimental Education, 2001
Examined the influence of item parameter estimation errors across three item selection methods using the two- and three-parameter logistic item response theory (IRT) model. Tests created with the maximum no target and maximum target item selection procedures consistently overestimated the test information function. Tests created using the theta…
Descriptors: Estimation (Mathematics), Item Response Theory, Selection, Test Construction
Peer reviewedPenfield, Randall D. – Applied Measurement in Education, 2001
Compared the performance of three methods of assessing differential item functioning (DIF) across demographic groups, using: (1) the Mantel-Haenszel chi-square statistic with no adjustment to the alpha level; (2) the Mantel-Haenszel statistic with a Bonferroni adjusted alpha level; and (3) the generalized Mantel-Haenszel statistic. Simulation…
Descriptors: Chi Square, Demography, Item Bias, Power (Statistics)
Hayashi, Kentaro; Kamata, Akihito – Psychometrika, 2005
The asymptotic standard deviation (SD) of the alpha coefficient with standardized variables is derived under normality. The research shows that the SD of the standardized alpha coefficient becomes smaller as the number of examinees and/or items increase. Furthermore, this research shows that the degree of the dependence of the SD on the number of…
Descriptors: Correlation, Statistical Analysis, Measurement Techniques, Simulation
Van Onna, Marieke J. H. – Applied Psychological Measurement, 2004
Coefficient "H" is used as an index of scalability in nonparametric item response theory (NIRT). It indicates the degree to which a set of items rank orders examinees. Theoretical sampling distributions, however, have only been derived asymptotically and only under restrictive conditions. Bootstrap methods offer an alternative possibility to…
Descriptors: Sampling, Item Response Theory, Scaling, Comparative Analysis
Bock, R. Darrell; Brennan, Robert L.; Muraki, Eiji – Applied Psychological Measurement, 2002
In assessment programs where scores are reported for individual examinees, it is desirable to have responses to performance exercises graded by more than one rater. If more than one item on each test form is so graded, it is also desirable that different raters grade the responses of any one examinee. This gives rise to sampling designs in which…
Descriptors: Generalizability Theory, Test Items, Item Response Theory, Error of Measurement
Wilhelm, Jennifer – International Journal of Science Education, 2009
This paper reports an examination on gender differences in lunar phases understanding of 123 students (70 females and 53 males). Middle-level students interacted with the Moon through observations, sketching, journalling, two-dimensional and three-dimensional modelling, and classroom discussions. These lunar lessons were adapted from the Realistic…
Descriptors: Test Results, Test Items, Females, Astronomy
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008
Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…
Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items
Abedi, Jamal; Leon, Seth; Kao, Jenny C. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2008
This study examines performance differences between students with disabilities and students without disabilities students using differential item functioning (DIF) analyses in a high-stakes reading assessment. Results indicated that for Grade 9, many items exhibited DIF. Items that exhibited DIF were more likely to be located in the second half…
Descriptors: Test Bias, Test Items, Student Evaluation, Disabilities
Ministerial Council for Education, Early Childhood Development and Youth Affairs (NJ1), 2008
The information and assessment materials in these resources have been designed to assist teachers to gauge their own students' proficiency in Information and Communication Technologies (ICT) literacy. By examining modules from the National Year 6 and Year 10 ICT Literacy Assessment teachers may be able to design similar tasks and to judge their…
Descriptors: Foreign Countries, National Programs, Testing Programs, National Competency Tests
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests
Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008
BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…
Descriptors: Test Items, Test Format, Test Construction, Item Response Theory
Tan, Kim Chwee Daniel; Taber, Keith S.; Liu, Xiufeng; Coll, Richard K.; Lorenzo, Mercedes; Li, Jia; Goh, Ngoh Khang; Chia, Lian Sai – International Journal of Science Education, 2008
Previous studies have indicated that A-level students in the UK and Singapore have difficulty learning the topic of ionisation energy. A two-tier multiple-choice instrument developed in Singapore in an earlier study, the Ionisation Energy Diagnostic Instrument, was administered to A-level students in the UK, advanced placement high school students…
Descriptors: College Freshmen, Difficulty Level, Advanced Placement, Foreign Countries

Direct link
