Publication Date
In 2025 | 100 |
Since 2024 | 349 |
Since 2021 (last 5 years) | 1302 |
Since 2016 (last 10 years) | 2767 |
Since 2006 (last 20 years) | 4995 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 653 |
Teachers | 561 |
Researchers | 250 |
Students | 201 |
Administrators | 80 |
Policymakers | 22 |
Parents | 17 |
Counselors | 8 |
Community | 7 |
Support Staff | 3 |
Media Staff | 1 |
More ▼ |
Location
Canada | 223 |
Turkey | 222 |
Australia | 155 |
Germany | 114 |
United States | 97 |
China | 86 |
Florida | 86 |
Taiwan | 75 |
Indonesia | 74 |
United Kingdom | 71 |
California | 65 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 4 |
Meets WWC Standards with or without Reservations | 4 |
Does not meet standards | 1 |
Mingjia Ma – ProQuest LLC, 2023
Response time is an important research topic in the field of psychometrics. This dissertation tries to explore some response time properties across several item characteristics and examinee characteristics, as well as the interactions between response time and response outcomes, using data from a statewide mathematics assessment in two grades.…
Descriptors: Reaction Time, Mathematics Tests, Standardized Tests, State Standards
Ayfer Sayin; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023
Developments in the field of education have significantly affected test development processes, and computer-based test applications have been started in many institutions. In our country, research on the application of measurement and evaluation tools in the computer environment for use with distance education is gaining momentum. A large pool of…
Descriptors: Turkish, Literature, Test Items, Item Banks
Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023
In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…
Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Nedungadi, Sachin; Rinco Michels, Olga; Kreke, Patricia J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2022
Practice examinations developed at the ACS Examinations Institute ask students to self-report mental effort when answering items. This self-reported mental effort together with performance can be represented in the form of a cognitive efficiency graph for each student giving information on the utilization of cognitive resources and content…
Descriptors: Cognitive Processes, Science Tests, Test Items, Difficulty Level
Michelle Cheong – Journal of Computer Assisted Learning, 2025
Background: Increasingly, students are using ChatGPT to assist them in learning and even completing their assessments, raising concerns of academic integrity and loss of critical thinking skills. Many articles suggested educators redesign assessments that are more 'Generative-AI-resistant' and to focus on assessing students on higher order…
Descriptors: Artificial Intelligence, Performance Based Assessment, Spreadsheets, Models
Paul Kim; Wilson Wang; Curtis J. Bonk – Journal of Educational Computing Research, 2025
Following the launch of the generative AI Web application, Ask.SMILE, designed to evaluate the cognitive levels of questions asked, 2559 educators generated 25,973 question-feedback sets over a three-month period, with an average of over 10 questions per participant. Analyses revealed a significant improvement in question quality from initial…
Descriptors: Artificial Intelligence, Technology Uses in Education, Test Wiseness, Test Items
Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025
This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…
Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests
Jonathan Seiden – Annenberg Institute for School Reform at Brown University, 2025
Direct assessments of early childhood development (ECD) are a cornerstone of research in developmental psychology and are increasingly used to evaluate programs and policies in lower- and middle-income countries. Despite strong psychometric properties, these assessments are too expensive and time consuming for use in large-scale monitoring or…
Descriptors: Young Children, Child Development, Performance Based Assessment, Developmental Psychology
Lars Andersson Hult; Anders Persson – Journal of Social Science Education, 2025
Purpose: This article's purpose is to examine the manifestations of the evolving modern society and what we now identify as civics or other contemporary social issues in the final examination questions from 1914 to 1937 at four teacher education institutions in Uppsala, Falun, Lund, and Landskrona. Design/methodology/approach: The method can be…
Descriptors: Civics, Tests, Preservice Teacher Education, Test Items
Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024
Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…
Descriptors: Influences, Models, Measurement Techniques, Reliability
David G. Schreurs; Jaclyn M. Trate; Shalini Srinivasan; Melonie A. Teichert; Cynthia J. Luxford; Jamie L. Schneider; Kristen L. Murphy – Chemistry Education Research and Practice, 2024
With the already widespread nature of multiple-choice assessments and the increasing popularity of answer-until-correct, it is important to have methods available for exploring the validity of these types of assessments as they are developed. This work analyzes a 20-question multiple choice assessment covering introductory undergraduate chemistry…
Descriptors: Multiple Choice Tests, Test Validity, Introductory Courses, Science Tests
Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024
Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024
Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…
Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods
Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024
As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…
Descriptors: Psychometrics, Ethics, Decision Making, Algorithms