Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Ceuppens, Stijn; Deprez, Johan; Dehaene, Wim; De Cock, Mieke – Physical Review Physics Education Research, 2018
This study reports on the development, validation, and administration of a 48-item multiple-choice test to assess students' representational fluency of linear functions in a physics context (1D kinematics) and a mathematics context. The test includes three external representations: graphs, tables, and formulas, which result in six possible…
Descriptors: Secondary School Students, Mathematics Tests, Test Construction, Foreign Countries
Bagriacik Yilmaz, Ayse; Karatas, Serçin – Interactive Learning Environments, 2018
The aim of this study was to develop a measurement instrument which is compatible with literature, of which validity and reliability are proved with the aim of determining interaction perceived by learners in online learning environments. Accordingly, literature review was made, and outline form of the scale was formed with item pool by taking 14…
Descriptors: Foreign Countries, College Students, Likert Scales, Computer Mediated Communication
Bramley, Tom – Cambridge Assessment, 2018
The aim of the research reported here was to get some idea of the accuracy of grade boundaries (cut-scores) obtained by applying the 'similar items method' described in Bramley & Wilson (2016). In this method experts identify items on the current version of a test that are sufficiently similar to items on previous versions for them to be…
Descriptors: Accuracy, Cutting Scores, Test Items, Item Analysis
Chen, Haiwen; Livingston, Samuel A. – ETS Research Report Series, 2013
This paper presents a new equating method for the nonequivalent groups with anchor test design: poststratification equating based on true anchor scores. The linear version of this method is shown to be equivalent, under certain conditions, to Levine observed score equating, in the same way that the linear version of poststratification equating is…
Descriptors: Equated Scores, Test Items, Methods
Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015
In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…
Descriptors: Test Construction, Equated Scores, Test Items, Sampling
Schmitz, Florian; Wilhelm, Oliver – Measurement: Interdisciplinary Research and Perspectives, 2015
The excellent paper by Goldhammer (this issue) deals with a most relevant and very pervasive problem of ability assessment: the evaluation of performance by considering speed and accuracy of performance. Goldhammer proposes item-level time limits as a possible remedy for individual differences in the speed-accuracy trade-off (SATO) to keep time…
Descriptors: Ability, Reaction Time, Accuracy, Performance
Andrich, David; Hagquist, Curt – Educational and Psychological Measurement, 2015
Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…
Descriptors: Test Bias, Test Items, Item Response Theory, Statistical Analysis
Tay-lim, Brenda Siok-Hoon; Zhang, Jinming – Applied Measurement in Education, 2015
To ensure the statistical result validity, model-data fit must be evaluated for each item. In practice, certain actions or treatments are needed for misfit items. If all misfit items are treated, much item information would be lost during calibration. On the other hand, if only severely misfit items are treated, the inclusion of misfit items may…
Descriptors: Test Items, Goodness of Fit, Classification, Item Response Theory
Crisp, Victoria – Educational Studies, 2015
This research investigated the difficulty of examination questions for students with weaker reading skills. Item level performance data were obtained for all candidates who took a maths examination (for 16 year olds). A sub-group of students who had access to a reader was identified (students with proven reading difficulties are permitted to have…
Descriptors: Test Items, Difficulty Level, Mathematics Tests, Reading Difficulties
Briggs, Derek C.; Dadey, Nathan – Educational Assessment, 2015
This study focuses on an instance in which the mean grade-to-grade scale scores on a vertical scale showed evidence of common test items that do not get easier from one grade to the next. The issue was examined as part of a 2-day workshop in which participants were asked to predict the growth on all linking items used in the construction of…
Descriptors: Test Items, Grading, Scores, Scaling
Klein, Ariel; Badia, Toni – Journal of Creative Behavior, 2015
In this study we show how complex creative relations can arise from fairly frequent semantic relations observed in everyday language. By doing this, we reflect on some key cognitive aspects of linguistic and general creativity. In our experimentation, we automated the process of solving a battery of Remote Associates Test tasks. By applying…
Descriptors: Language Usage, Semantics, Natural Language Processing, Test Items
Seipel, Ben; Biancarosa, Gina; Carlson, Sarah; Davison, Mark – Society for Research on Educational Effectiveness, 2015
Previous research has established two types of struggling readers: those who struggle with lower-level reading skills and those who struggle with higher-level reading skills (Cain & Oakhill, 2006; Perfetti, 2007). The latter group is commonly termed "poor comprehenders": readers who exhibit poor comprehension compared to peers with…
Descriptors: Reading Comprehension, Reading Tests, Cloze Procedure, Multiple Choice Tests
Scott, Terry F.; Schumayer, Dániel – Physical Review Physics Education Research, 2017
The Force Concept Inventory is one of the most popular and most analyzed multiple-choice concept tests used to investigate students' understanding of Newtonian mechanics. The correct answers poll a set of underlying Newtonian concepts and the coherence of these underlying concepts has been found in the data. However, this inventory was constructed…
Descriptors: World Views, Scientific Concepts, Scientific Principles, Multiple Choice Tests
Shangraw, Rebecca – Strategies: A Journal for Physical and Sport Educators, 2017
The Domain Five Observation Instrument (DFOI) is a competency-based observation instrument recommended for sport leaders or researchers who wish to evaluate coaches' instructional behaviors. The DFOI includes 10 behavior categories and four timed categories that encompass 34 observable instructional benchmarks outlined in domain five of the…
Descriptors: Competency Based Teacher Education, Coaching (Performance), Evaluation Methods, Teacher Behavior
Çakir, Sinan – Journal of Language and Linguistic Studies, 2017
The present study is a follow-up study of Çakir (2016b) which focused on the wh-adverbial & which NP constructions asymmetry within island structures in Turkish. The characteristics of wh-adverbial nasil "how" is compared with the which-NP constructions "hangisekilde" "in what way" and "hangihalde"…
Descriptors: Nouns, Phrase Structure, Grammar, Turkish

Peer reviewed
Direct link
