Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
College Board, 2010
This is the College Board's response to a research article by Drs. Maria Veronica Santelices and Mark Wilson in the Harvard Educational Review, entitled "Unfair Treatment? The Case of Freedle, the SAT, and the Standardization Approach to Differential Item Functioning" (see EJ930622).
Descriptors: Test Bias, College Entrance Examinations, Standardized Tests, Test Items
Koen, Joshua D.; Yonelinas, Andrew P. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2010
It is well established that the memory strength of studied items is more variable than the strength of new items on tests of recognition memory, but the reason why this occurs is poorly understood. One account for this old "item variance effect" is based on single-process theory, which proposes that this effect is due to variability in how well…
Descriptors: Test Items, Familiarity, Recognition (Psychology), Regression (Statistics)
Facon, Bruno; Nuchadee, Marie-Laure – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
Standardized tests are widely used in intellectual disability research, either as dependent or control variables. Yet, it is not certain that their items give rise to the same performance in various groups under study. In the present work, 48 participants with Down syndrome were matched on their raw score on Raven's Colored Progressive Matrices…
Descriptors: Test Items, Standardized Tests, Down Syndrome, Item Analysis
Meert, Gaelle; Gregoire, Jacques; Noel, Marie-Pascale – Journal of Experimental Child Psychology, 2010
This study tested whether 10- and 12-year-olds who can correctly compare the magnitudes of fractions with common components access the magnitudes of the whole fractions rather than only compare the magnitudes of their components. Time for comparing two fractions was predicted by the numerical distance between the whole fractions, suggesting an…
Descriptors: Numbers, Cognitive Processes, Test Items, Comparative Analysis
Panjaburee, Patcharin; Hwang, Gwo-Jen; Triampo, Wannapong; Shih, Bo-Ying – Computers & Education, 2010
With the popularization of computer and communication technologies, researchers have attempted to develop computer-assisted testing and diagnostic systems to help students improve their learning performance on the Internet. In developing a diagnostic system for detecting students' learning problems, it is difficult for individual teachers to…
Descriptors: Learning Problems, Test Items, Testing, Teaching Methods
Botzer, Assaf; Meyer, Joachim; Bak, Peter; Parmet, Yisrael – Journal of Experimental Psychology: Applied, 2010
The output of binary cuing systems, such as alerts or alarms, depends on the threshold setting--a parameter that is often user-adjustable. However, it is unknown if users are able to adequately adjust thresholds and what information may help them to do so. Two experiments tested threshold settings for a binary classification task based on binary…
Descriptors: Experimental Groups, Cues, Test Items, Probability
Davey, Tim; Lee, Yi-Hsuan – ETS Research Report Series, 2011
Both theoretical and practical considerations have led the revision of the Graduate Record Examinations® (GRE®) revised General Test, here called the rGRE, to adopt a multistage adaptive design that will be continuously or nearly continuously administered and that can provide immediate score reporting. These circumstances sharply constrain the…
Descriptors: Context Effect, Scoring, Equated Scores, College Entrance Examinations
Wickett, Maryann; Hendrix-Martin, Eunice – Stenhouse Publishers, 2011
Multiple-choice testing is an educational reality. Rather than complain about the negative impact these tests may have on teaching and learning, why not use them to better understand your students' true mathematical knowledge and comprehension? Maryann Wickett and Eunice Hendrix-Martin show teachers how to move beyond the student's answer--right…
Descriptors: Mathematics Education, Multiple Choice Tests, Grade 2, Grade 3
Tian, Feng – ProQuest LLC, 2011
There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…
Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis
Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011
Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability
Day, James; Bonn, Doug – Physical Review Special Topics - Physics Education Research, 2011
The Concise Data Processing Assessment (CDPA) was developed to probe student abilities related to the nature of measurement and uncertainty and to handling data. The diagnostic is a ten question, multiple-choice test that can be used as both a pre-test and post-test. A key component of the development process was interviews with students, which…
Descriptors: Multiple Choice Tests, Test Reliability, Physics, Item Analysis
Wang, Changjiang; Gierl, Mark J. – Journal of Educational Measurement, 2011
The purpose of this study is to apply the attribute hierarchy method (AHM) to a subset of SAT critical reading items and illustrate how the method can be used to promote cognitive diagnostic inferences. The AHM is a psychometric procedure for classifying examinees' test item responses into a set of attribute mastery patterns associated with…
Descriptors: Reading Comprehension, Test Items, Critical Reading, Protocol Analysis
Gao, Lingyun; Rogers, W. Todd – Language Testing, 2011
The purpose of this study was to explore whether the results of Tree Based Regression (TBR) analyses, informed by a validated cognitive model, would enhance the interpretation of item difficulties in terms of the cognitive processes involved in answering the reading items included in two forms of the Michigan English Language Assessment Battery…
Descriptors: Test Items, Reading Tests, Item Analysis, Reading Processes
Lowrie, Tom; Diezmann, Carmel M.; Kay, Russell – Evaluation & Research in Education, 2011
The graphics-decoding proficiency (G-DP) instrument was developed as a screening test for the purpose of measuring students' (aged 8-11 years) capacity to solve graphics-based mathematics tasks. These tasks include number lines, column graphs, maps and pie charts. The instrument was developed within a theoretical framework which highlights the…
Descriptors: Screening Tests, Mathematics Achievement, Mathematical Aptitude, Graphs
Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011
The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…
Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability

Peer reviewed
Direct link
