Publication Date
| In 2026 | 0 |
| Since 2025 | 215 |
| Since 2022 (last 5 years) | 1084 |
| Since 2017 (last 10 years) | 2594 |
| Since 2007 (last 20 years) | 4955 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Shivabasappa, Prarthana; Peña, Elizabeth Z.; Bedore, Lisa M. – Journal of Speech, Language, and Hearing Research, 2017
Purpose: The study examines the typicality effect in Spanish-English bilingual children and adults in their 2 languages. Method: Two studies were conducted using a category-generation task to compare the typical items generated by children with those generated by adults. Children in the 1st study differed orthogonally with respect to age (older,…
Descriptors: Bilingualism, Spanish, English, Adults
Batsell, W. Robert, Jr.; Perry, Jennifer L.; Hanley, Elizabeth; Hostetter, Autumn B. – Teaching of Psychology, 2017
The testing effect is the enhanced retention of learned information by individuals who have studied and completed a test over the material relative to individuals who have only studied the material. Although numerous laboratory studies and simulated classroom studies have provided evidence of the testing effect, data from a natural class setting…
Descriptors: Tests, Psychology, Introductory Courses, Quasiexperimental Design
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017
There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…
Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests
Yasar, Metin – European Journal of Educational Sciences, 2017
In this study, a multiple choice test which is composed of 19 articles which is prepared as per the scope of lesson of Measurement and Evaluation in Education, has been applied as interim exam to 207 teacher candidates who are getting education at the Faculty of Education. The difficulty levels of items which are in the test have been calculated…
Descriptors: Test Items, Difficulty Level, Preservice Teachers, Teacher Education
Li, Dan; Benton, Stephen L. – IDEA Center, Inc., 2017
In the study evaluated in this report, the authors asked what effect survey length has on student non-response rates to individual items on IDEA's "Diagnostic Feedback" (DF) and "Learning Essentials" (LE) forms. The approach was to analyze individual student ratings of classes contained in the 2015-2016 IDEA-CL database.…
Descriptors: Response Rates (Questionnaires), Student Surveys, Test Length, Test Items
Sarwanto; Fajari, Laksmi Evasufi Widi; Chumdari – International Journal of Instruction, 2021
Critical thinking skills are the 21st-century life skills that are needed by students. However, in elementary schools, there are no instruments that are truly effective and efficient to measure critical thinking skills. This research aims to develop an open-ended question assessment instrument to measure students' critical-thinking skills, to test…
Descriptors: Critical Thinking, Thinking Skills, Teaching Methods, Questioning Techniques
Gareis, Christopher R.; McMillan, James H.; Smucker, Amelie; Huang, Ke – Online Submission, 2021
The purpose of this study was to gauge the degree to which selected NWEA MAP Growth assessments are aligned to the Virginia Standards of Learning (SOL) and the extent to which MAP Growth reports can be used by school divisions to gauge student achievement relative to grade level and to identify learning gaps. The study was delimited to four MAP…
Descriptors: Achievement Tests, Academic Standards, State Standards, Alignment (Education)
Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021
MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…
Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students
Chandra Shekar Karnati – ProQuest LLC, 2021
The purpose of this study was to examine the presence of gender and ELL Differential Item Functioning (DIF) in a teacher-created mathematics benchmark test in one public charter school district in Northeast Georgia. DIF occurs when an item behaves differently in different subgroups, rather than measuring a test taker's true ability. The geometry…
Descriptors: Mathematics Tests, Delphi Technique, Test Items, Test Construction
Ralston, Nicole C.; Li, Min; Taylor, Catherine – Educational Assessment, 2018
Elementary school students often exhibit a variety of conceptions associated with algebraic thinking that their teachers fail to recognize or understand. It is crucial that elementary school teachers possess knowledge of the variety of student conceptions and also have abilities to address varying states of conceptions. Otherwise, students who are…
Descriptors: Elementary School Students, Student Evaluation, Mathematics Tests, Test Construction
Kaplan, David; Su, Dan – Large-scale Assessments in Education, 2018
Background: This paper extends a recent study by Kaplan and Su ("J Educ Behav Stat" 41: 51-80, 2016) examining the problem of matrix sampling of context questionnaire scales with respect to the generation of plausible values of cognitive outcomes in large-scale assessments. Methods: Following Weirich et al. ("Nested multiple…
Descriptors: Questionnaires, Measurement, Measurement Techniques, Evaluation Methods
Yalçin, Seher – International Journal of Assessment Tools in Education, 2018
The purpose of this study is to determine the best IRT model [Rasch, 2PL, 3PL, 4PL and mixed IRT (2 and 3PL)] for the science and technology subtest of the Transition from Basic Education to Secondary Education (TEOG) exam, which is carried out at national level, it is also aimed to predict the item parameters under the best model. This study is a…
Descriptors: Item Response Theory, Models, Goodness of Fit, Multiple Choice Tests
Dempster, Edith R.; Kirby, Nicki F. – South African Journal of Education, 2018
Public perception of "declining standards" in school-leaving examinations often accompanies increases in pass rates in schoolleaving examinations. "Declining standards" to the public means easier examination papers. The present study evaluates a South African attempt to estimate the level of difficulty, as distinct from…
Descriptors: Foreign Countries, Interrater Reliability, Difficulty Level, Science Tests
Sinharay, Sandip – Grantee Submission, 2018
Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Peer reviewed
Direct link
