Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Liu, Ou Lydia; Lee, Hee-Sun; Linn, Marcia C. – Educational Assessment, 2010
To improve student science achievement in the United States we need inquiry-based instruction that promotes coherent understanding and assessments that are aligned with the instruction. Instead, current textbooks often offer fragmented ideas and most assessments only tap recall of details. In this study we implemented 10 inquiry-based science…
Descriptors: Inquiry, Active Learning, Science Achievement, Science Instruction
Prowker, Adam; Camilli, Gregory – Journal of Educational Measurement, 2007
The central idea of differential item functioning (DIF) is to examine differences between two groups at the item level while controlling for overall proficiency. This approach is useful for examining hypotheses at a finer-grain level than are permitted by a total test score. The methodology proposed in this paper is also aimed at estimating…
Descriptors: Scores, Test Bias, Difficulty Level, Test Items
Rae, Gordon – Psychological Methods, 2007
The relationship between stratified alpha (alpha-sub(s)) and the reliability of a test composed of interrelated nonhomogeneous items is examined. It is mathematically demonstrated that when there is congeneric equivalence within the strata or subtests, the difference between the coefficients is a function of the variances of the loadings within…
Descriptors: Test Reliability, Test Items, Computation, Error of Measurement
Penfield, Randall D. – Journal of Educational Measurement, 2007
Many statistics used in the assessment of differential item functioning (DIF) in polytomous items yield a single item-level index of measurement invariance that collapses information across all response options of the polytomous item. Utilizing a single item-level index of DIF can, however, be misleading if the magnitude or direction of the DIF…
Descriptors: Simulation, Test Bias, Statistics, Test Items
Vincent, Juliet – ProQuest LLC, 2009
Student underachievement on standardized math achievement tests is a major concern in American public schools. One of the speculated reasons for student underachievement is the inability to solve math word problems. Word problems are the most challenging problems in math because word problem solving requires the use of skills in language,…
Descriptors: Test Items, Underachievement, Word Problems (Mathematics), Problem Solving
Lavy, Victor; Silva, Olmo; Weinhardt, Felix – National Bureau of Economic Research, 2009
We study the scale and nature of ability peer effects in secondary schools in England. In order to shed light on the nature of these effects, we investigate which segments of the peer ability distribution drive the impact of peer quality on students' achievements. Additionally, we study which quantiles of the pupil ability distribution are…
Descriptors: Test Items, Females, Measures (Individuals), Foreign Countries
Lowrie, Tom; Diezmann, Carmel M. – Australian Journal of Education, 2009
Mandatory numeracy tests have become commonplace in many countries, heralding a new era in school assessment. New forms of accountability and an increased emphasis on national and international standards (and benchmarks) have the potential to reshape mathematics curricula. It is noteworthy that the mathematics items used in these tests are rich in…
Descriptors: Testing Programs, Numeracy, Foreign Countries, Standardized Tests
Lee, Yong-Won; Sawaki, Yasuyo – Language Assessment Quarterly, 2009
The present study investigated the functioning of three psychometric models for cognitive diagnosis--the general diagnostic model, the fusion model, and latent class analysis--when applied to large-scale English as a second language listening and reading comprehension assessments. Data used in this study were scored item responses and incidence…
Descriptors: Reading Comprehension, Field Tests, Identification, Classification
Kettler, Ryan J.; Elliott, Stephen N.; Beddow, Peter A. – Peabody Journal of Education, 2009
Federal regulations allow up to 2% of the student population of a state to achieve proficiency for adequate yearly progress by taking an alternate assessment based on modified academic achievement standards (AA-MAS). Such tests are likely to be easier, but as long as a test is considered a valid measure of grade level content, it is allowable as…
Descriptors: Test Items, Alternative Assessment, Academic Achievement, Test Validity
Weigert, Susan – Peabody Journal of Education, 2009
In this commentary on the "Peabody Journal of Education" special edition, the author addresses implications of the contributing articles to three central domains of interest to states engaged in or considering the development of an alternate assessment on modified academic achievement standards: (a) identifying an eligible student population, (b)…
Descriptors: Test Items, Eligibility, Alternative Assessment, Academic Achievement
Calik, Muammer; Ayas, Alipasa; Coll, Richard K. – International Journal of Science and Mathematics Education, 2009
This paper reports on an investigation on the use of an analogy activity and seeks to provide evidence of whether the activity enables students to change alternative conceptions towards views more in accord with scientific views for aspects of solution chemistry. We were also interested in how robust any change was and whether these changes in…
Descriptors: Test Items, Chemistry, Long Term Memory, Foreign Countries
Jordan, Sally; Mitchell, Tom – British Journal of Educational Technology, 2009
A natural language based system has been used to author and mark short-answer free-text assessment tasks. Students attempt the questions online and are given tailored and relatively detailed feedback on incorrect and incomplete responses, and have the opportunity to repeat the task immediately so as to learn from the feedback provided. The answer…
Descriptors: Feedback (Response), Test Items, Natural Language Processing, Teaching Methods
Siddiek, Ahmed Gumaa – English Language Teaching, 2010
Examinations--among other things--are tools of quality control by which we can measure the attainment of the national educational goals. High-quality examinations are means of evaluation that can help teachers modify their teaching techniques, as well as helping learners adjust their learning strategies. Examinations are also benchmarks that can…
Descriptors: Foreign Countries, Student Certification, Questionnaires, Test Validity
Rakes, Christopher R. – ProQuest LLC, 2010
In this study, the author examined the relationship of probability misconceptions to algebra, geometry, and rational number misconceptions and investigated the potential of probability instruction as an intervention to address misconceptions in all 4 content areas. Through a review of literature, 5 fundamental concepts were identified that, if…
Descriptors: Control Groups, Fundamental Concepts, Intervention, Structural Equation Models
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests

Peer reviewed
Direct link
