Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Pournara, Craig; Sanders, Yvonne – Africa Education Review, 2020
The transition from arithmetic to algebra is a well-known difficulty in school mathematics. In order to succeed, learners require inter alia a better understanding of algebraic symbols, equality, equations and working with negatives/subtraction. This article reports on a response pattern analysis (RPA) of learners' responses to six test items…
Descriptors: Foreign Countries, Arithmetic, Algebra, Equations (Mathematics)
Turhan, Nihan Sölpük – International Journal of Progressive Education, 2020
Measurement tools that are used in education are important factors that affect course success and motivation of students. This study aims to determine the opinions of high school students on different question types. As the subgoals of the research, the study aims to determine the reasons for multiple choice test preference and its effect on…
Descriptors: Test Items, Preferences, High School Students, Learning Motivation
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Ezechukwu, Roseline Ifeoma; Chinecherem, Basil; Oguguo, E.; Ene, Catherine U.; Ugorji, Clifford O. – World Journal of Education, 2020
This study determined the psychometric properties of the Economics Achievement Test (EAT) using Item Response Theory (IRT). Two popular IRT models namely, one-parameter logistics (1PL) and two-parameter logistics (2PL) models were utilized. The researcher adopted instrumentation research design. Four research questions and two hypotheses were…
Descriptors: Economics Education, Economics, Achievement Tests, Psychometrics
Gasteiger, Hedwig; Bruns, Julia; Benz, Christiane; Brunner, Esther; Sprenger, Priska – ZDM: The International Journal on Mathematics Education, 2020
Measurement instruments of early childhood teachers' mathematical pedagogical content knowledge (MPCK) have to consider the special characteristics of early childhood teaching. Early childhood teaching includes some planned activities but in contrast to learning in school, it is often motivated and generated by situations which unfold…
Descriptors: Mathematics Instruction, Pedagogical Content Knowledge, Multiple Choice Tests, Kindergarten
Suciati; Munadi, Sudji; Sugiman; Febriyanti, Wiwin Dwi Ratna – European Journal of Educational Research, 2020
This study aims to design mathematical literacy instruments that have evidence of content and construct validity and are reliable for use as an assessment for learning. The research involved eight experts as instrument validators and 273 eighth-grade students of junior high school in Yogyakarta Province. The results showed that the ten…
Descriptors: Numeracy, Mathematics Tests, Test Construction, Test Validity
Jean-Yves Bégin; Luc Touchette; Caroline Couture; Cassandre Blais – International Journal of Nurture in Education, 2020
The Boxall Profile provides a framework for the structured observation of children in nurture groups. It is a detailed and rigorously trialled normative diagnostic instrument developed for teachers and teaching assistants to measure children's levels of emotional and behavioural functioning. Moreover, it highlights specific targets for…
Descriptors: Psychometrics, French, Observation, Children
Davison, Mark L. – Measurement: Interdisciplinary Research and Perspectives, 2016
The answer to the question, "Ability, speed, or both?" may be "both at once" if speed is simply a manifestation of ability. If differences in speed are manifestations of differences in ability, then both speed and ability may reflect a single dimension best characterized by a single score. While measurement of speed has proven…
Descriptors: Measurement, Ability, Reaction Time, Timed Tests
Irvin, P. Shawn – Behavioral Research and Teaching, 2016
The Distributed Item Review (DIR) is a secure and flexible, web-based system designed to present test items to expert reviewers across a broad geographic area for evaluation of important dimensions of quality (e.g., alignment with standards, bias, sensitivity, and student accessibility). The DIR is comprised of essential features that allow system…
Descriptors: Test Items, Test Reviews, Test Validity, Guides
Shoufan, Abdulhadi – IEEE Transactions on Education, 2017
The concept of intrinsic complexity explains why different problems of the same type, tackled by the same problem solver, can require different times to solve and yield solutions of different quality. This paper proposes a general four-step approach that can be used to establish a model for the intrinsic complexity of a problem class in terms of…
Descriptors: Test Items, Difficulty Level, Problem Solving, Models
Borsboom, Denny; Wijsen, Lisa D. – Assessment in Education: Principles, Policy & Practice, 2017
The central role of educational testing practices in contemporary societies can hardly be overstated. It is furthermore evident that psychometric models regulate, justify, and legitimize the processes through which educational testing practices are used. In this commentary, the authors offer some observations that may be relevant for the analyses…
Descriptors: Educational Assessment, Learning, Psychometrics, Power Structure
Sellbjer, Stefan – Assessment & Evaluation in Higher Education, 2017
Effective feedback presupposes that students understand the task on which feedback is given. But what about the teachers formulating and assessing the task? Do they always understand it as intended? And if so, feedback on what? The purpose of this study is to examine how university teachers individually understand tasks distributed to students.…
Descriptors: College Faculty, Comprehension, Student Evaluation, Feedback (Response)
Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017
In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…
Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification
Bryant, William – Practical Assessment, Research & Evaluation, 2017
As large-scale standardized tests move from paper-based to computer-based delivery, opportunities arise for test developers to make use of items beyond traditional selected and constructed response types. Technology-enhanced items (TEIs) have the potential to provide advantages over conventional items, including broadening construct measurement,…
Descriptors: Standardized Tests, Test Items, Computer Assisted Testing, Test Format
Esen, Ayse – ProQuest LLC, 2017
Detecting Differential Item Functioning (DIF) is an early step and very critical to investigate any possible bias between groups (e.g., males vs. females). Many early DIF studies only focused on two-group comparison. However, there are many cases where more than two groups exist: Cross-cultural studies are administered in many countries and any…
Descriptors: Test Bias, Cross Cultural Studies, Ethnicity, Error Patterns

Peer reviewed
Direct link
