Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Ariel, Adelaide; van der Linden, Wim J.; Veldkamp, Bernard P. – Journal of Educational Measurement, 2006
Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item production to test assembly. A simulation study with…
Descriptors: Item Banks, Simulation, Interaction, Test Construction
Jansen, M. G. H.; Glas, C. A. W. – Psychometrika, 2005
Two new tests for a model for the response times on pure speed tests by Rasch (1960) are proposed. The model is based on the assumption that the test response times are approximately gamma distributed, with known index parameters and unknown rate parameters. The rate parameters are decomposed in a subject ability parameter and a test difficulty…
Descriptors: Timed Tests, Reaction Time, Models, Difficulty Level
Karantonis, Ana; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2006
The Bookmark method for setting standards on educational tests is currently one of the most popular standard-setting methods. However, research to support the method is scarce. In this report, we review the published and unpublished literature on this method as well as some seminal work in the area of evaluating standard-setting studies. Our…
Descriptors: Academic Standards, Educational Testing, Literature Reviews, Validity
Wiberg, Marie – International Journal of Testing, 2006
A simulation study of a sequential computerized mastery test is carried out with items modeled with the 3 parameter logistic item response theory model. The examinees' responses are either identically distributed, not identically distributed, or not identically distributed together with estimation errors in the item characteristics. The…
Descriptors: Test Length, Computer Simulation, Mastery Tests, Item Response Theory
Kahana, Michael J.; Rizzuto, Daniel S.; Schneider, Abraham R. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2005
This article addresses the relation between item recognition and associative (cued) recall. Going beyond measures of performance on each task, the analysis focuses on the degree to which the contingency between successful recognition and successful recall of a studied item reflects the commonality of memory processes underlying the recognition and…
Descriptors: Correlation, Recognition (Psychology), Recall (Psychology), Models
Karabatsos, George; Sheu, Ching-Fan – Applied Psychological Measurement, 2004
This study introduces an order-constrained Bayes inference framework useful for analyzing data containing dichotomous scored item responses, under the assumptions of either the monotone homogeneity model or the double monotonicity model of nonparametric item response theory (NIRT). The framework involves the implementation of Gibbs sampling to…
Descriptors: Inferences, Nonparametric Statistics, Item Response Theory, Data Analysis
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2003
The Hetter and Sympson (1997; 1985) method is a method of probabilistic item-exposure control in computerized adaptive testing. Setting its control parameters to admissible values requires an iterative process of computer simulations that has been found to be time consuming, particularly if the parameters have to be set conditional on a realistic…
Descriptors: Law Schools, Adaptive Testing, Admission (School), Computer Assisted Testing
Lotz, Anja; Kinder, Annette – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2006
In this article, the authors report 2 experiments that investigated the sources of information used in transfer and nontransfer tasks in artificial grammar learning. Multiple regression analyses indicated that 2 types of information about repeating elements were crucial for performance in both tasks: information about the repetition of adjacent…
Descriptors: Grammar, Multiple Regression Analysis, Test Items, Transfer of Training
Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2008
This technical report describes the development of reading comprehension assessments designed for use as progress monitoring measures appropriate for 2nd Grade students. The creation, piloting, and technical adequacy of the measures are presented. The following are appended: (1) Item Specifications for MC [Multiple Choice] Comprehension - Passage…
Descriptors: Reading Comprehension, Reading Tests, Grade 2, Elementary School Students
Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2008
This technical report describes the development and piloting of reading comprehension measures developed for use by fifth-grade students as part of an online progress monitoring assessment system, http://easycbm.com. Each comprehension measure is comprised of an original work of narrative fiction approximately 1500 words in length followed by 20…
Descriptors: Reading Comprehension, Reading Tests, Grade 5, Multiple Choice Tests
Baldi, Stephane; Jin, Ying; Green, Patricia J.; Herget, Deborah – National Center for Education Statistics, 2007
The Program for International Student Assessment (PISA) is a system of international assessments administered by the Organization for Economic Cooperation and Development (OECD) that measures 15-year-olds' performance in reading literacy, mathematics literacy, and science literacy every 3 years. This report focuses on the performance of U.S.…
Descriptors: Student Evaluation, Comparative Analysis, International Education, Measures (Individuals)
Daro, Phil; Stancavage, Frances; Ortega, Moreica; DeStefano, Lizanne; Linn, Robert – American Institutes for Research, 2007
In Spring 2006,. the NAEP Validity Studies (NVS) Panel was asked by the National Center for Education Statistics (NCES) to undertake a validity study to examine the quality of the NAEP Mathematics Assessments at grades 4 and 8. Specifically, NCES asked the NVS Panel to address five questions: (1) Does the NAEP framework offer reasonable content…
Descriptors: National Competency Tests, Mathematics Achievement, Adaptive Testing, Quality Control
Mental Models of Elementary and Middle School Students in Analyzing Simple Battery and Bulb Circuits
Jabot, Michael; Henry, David – School Science and Mathematics, 2007
Written assessment items were developed to probe students' understanding of a variety of direct current (DC) resistive electric circuit concepts. The items were used to explore the mental models that grade 3-8 students use in explaining the direction of electric current and how electric current is affected by different configurations of simple…
Descriptors: Models, Elementary School Students, Middle School Students, Test Items
Afolabi, E. R. I. – Educational Research and Reviews, 2007
The study examined the effects of item format, self-concept and anxiety on response changing behaviour. Four hundred undergraduate students who offered a counseling psychology course in a Nigerian university participated in the study. Students' answers in multiple--choice and true--false formats of an achievement test were observed for response…
Descriptors: Undergraduate Students, Test Items, Self Concept, Multiple Choice Tests
DeMars, Christine E. – Educational Assessment, 2007
A series of 8 tests was administered to university students over 4 weeks for program assessment purposes. The stakes of these tests were low for students; they received course points based on test completion, not test performance. Tests were administered in a counterbalanced order across 2 administrations. Response time effort, a measure of the…
Descriptors: Reaction Time, Guessing (Tests), Testing Programs, College Students

Peer reviewed
Direct link
