Publication Date
In 2025 | 3 |
Since 2024 | 19 |
Since 2021 (last 5 years) | 46 |
Since 2016 (last 10 years) | 73 |
Since 2006 (last 20 years) | 104 |
Descriptor
Test Construction | 471 |
Test Validity | 137 |
Elementary Secondary Education | 101 |
Test Items | 83 |
Literature Reviews | 80 |
Test Reliability | 72 |
Evaluation Methods | 59 |
Higher Education | 55 |
Test Use | 54 |
Test Format | 53 |
Foreign Countries | 52 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 5 |
Baker, Eva L. | 4 |
Ellington, Henry | 4 |
Haladyna, Tom | 4 |
Roid, Gale | 4 |
Downing, Steven M. | 3 |
Haladyna, Thomas M. | 3 |
O'Neil, Harold F., Jr. | 3 |
Quellmalz, Edys S. | 3 |
Reckase, Mark D. | 3 |
Roid, Gale H. | 3 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 30 |
Researchers | 27 |
Teachers | 12 |
Policymakers | 3 |
Administrators | 2 |
Counselors | 1 |
Location
Australia | 9 |
United States | 7 |
China | 6 |
Turkey | 5 |
Canada | 4 |
United Kingdom | 4 |
France | 3 |
Spain | 3 |
United Kingdom (England) | 3 |
California | 2 |
Florida | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Séverin Lions; María Paz Blanco; Pablo Dartnell; Carlos Monsalve; Gabriel Ortega; Julie Lemarié – Applied Measurement in Education, 2024
Multiple-choice items are universally used in formal education. Since they should assess learning, not test-wiseness or guesswork, they must be constructed following the highest possible standards. Hundreds of item-writing guides have provided guidelines to help test developers adopt appropriate strategies to define the distribution and sequence…
Descriptors: Test Construction, Multiple Choice Tests, Guidelines, Test Items
Ato Kwamina Arhin – Acta Educationis Generalis, 2024
Introduction: This article aimed at digging deep into distractors used for mathematics multiple-choice items. The quality of distractors may be more important than their number and the stem in a multiple-choice question. Little attention is given to this aspect of item writing especially, mathematics multiple-choice questions. This article…
Descriptors: Testing, Multiple Choice Tests, Test Items, Mathematics Tests
Virginia A. Ressa; Sheryl S. Lazarus; Christopher M. Rogers; Kascinda Fleming; Mari Quanbeck – National Center on Educational Outcomes, 2024
Research on test accommodations provides valuable information that informs policy and practice. This report presents a synthesis of the research literature published in 2022 on testing accommodations for U.S. elementary and secondary students (K-12). The National Center on Educational Outcomes (NCEO) has reported on accommodations research since…
Descriptors: Elementary Secondary Education, Testing Accommodations, Academic Accommodations (Disabilities), Teacher Attitudes
Lions, Séverin; Monsalve, Carlos; Dartnell, Pablo; Blanco, María Paz; Ortega, Gabriel; Lemarié, Julie – Applied Measurement in Education, 2022
Multiple-choice tests are widely used in education, often for high-stakes assessment purposes. Consequently, these tests should be constructed following the highest standards. Many efforts have been undertaken to advance item-writing guidelines intended to improve tests. One important issue is the unwanted effects of the options' position on test…
Descriptors: Multiple Choice Tests, High Stakes Tests, Test Construction, Guidelines
Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024
This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…
Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation
Christian X. Navarro-Cota; Ana I. Molina; Miguel A. Redondo; Carmen Lacave – IEEE Transactions on Education, 2024
Contribution: This article describes the process used to create a questionnaire to evaluate the usability of mobile learning applications (CECAM). The questionnaire includes specific questions to assess user interface usability and pedagogical usability. Background: Nowadays, mobile applications are expanding rapidly and are commonly used in…
Descriptors: Usability, Questionnaires, Electronic Learning, Computer Oriented Programs
Dongkwang Shin; Jang Ho Lee – ELT Journal, 2024
Although automated item generation has gained a considerable amount of attention in a variety of fields, it is still a relatively new technology in ELT contexts. Therefore, the present article aims to provide an accessible introduction to this powerful resource for language teachers based on a review of the available research. Particularly, it…
Descriptors: Language Tests, Artificial Intelligence, Test Items, Automation
Peter A. Edelsbrunner; Bianca A. Simonsmeier; Michael Schneider – Educational Psychology Review, 2025
Knowledge is an important predictor and outcome of learning and development. Its measurement is challenged by the fact that knowledge can be integrated and homogeneous, or fragmented and heterogeneous, which can change through learning. These characteristics of knowledge are at odds with current standards for test development, demanding a high…
Descriptors: Meta Analysis, Predictor Variables, Learning Processes, Knowledge Level
Daniel M. Settlage; Jim R. Wollscheid – Journal of the Scholarship of Teaching and Learning, 2024
The examination of the testing mode effect has received increased attention as higher education has shifted to remote testing during the COVID-19 pandemic. We believe the testing mode effect consists of four components: the ability to physically write on the test, the method of answer recording, the proctoring/testing environment, and the effect…
Descriptors: College Students, Macroeconomics, Tests, Answer Sheets
Sahin, Melek Gülsah; Yildirim, Yildiz; Boztunç Öztürk, Nagihan – Participatory Educational Research, 2023
Literature review shows that the development process of an achievement test is mainly investigated in dissertations. Moreover, preparing a form that will shed light on developing an achievement test is expected to guide those who will administer the test. In this line, the current study aims to create an "Achievement Test Development Process…
Descriptors: Achievement Tests, Test Construction, Records (Forms), Mathematics Achievement
Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024
Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…
Descriptors: Test Items, Test Construction, Difficulty Level, Prediction
Lei Jiang; Na Yu – Education and Information Technologies, 2024
This research aims to address the challenges of digital transformation in education by understanding the digital competence of teachers through a mixed-methods approach. The grounded theory is employed to develop the Teachers' Digital Competence Model (TDCM), which is structured around three facets: development, pedagogy, and ethics. Within these…
Descriptors: Educational Technology, Teacher Competencies, Technological Literacy, Ethics
Mattar, João; Ramos, Daniela Karine; Lucas, Margarida Rocha – Education and Information Technologies, 2022
The purpose of this article is to compare digital competence assessment instruments based on DigComp related frameworks. The study aims to answer four questions: (a) What types of instruments based on these frameworks are available? (b) How were these instruments created from these frameworks? (c) What procedures were used to guarantee the…
Descriptors: Evaluation Methods, Literature Reviews, Test Construction, Competence
Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024
The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…
Descriptors: Test Items, Test Construction, Sample Size, Scaling
Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024
Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…
Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software