Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 9 |
Descriptor
Computer Software | 19 |
Item Analysis | 19 |
Test Construction | 19 |
Test Items | 12 |
Computer Assisted Testing | 7 |
Difficulty Level | 6 |
Evaluation Methods | 5 |
Foreign Countries | 5 |
Language Tests | 4 |
Multiple Choice Tests | 4 |
Scoring | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 14 |
Reports - Research | 8 |
Reports - Descriptive | 3 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Information Analyses | 2 |
Computer Programs | 1 |
Guides - Non-Classroom | 1 |
Opinion Papers | 1 |
Education Level
Secondary Education | 2 |
Higher Education | 1 |
Audience
Researchers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 2 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023
Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…
Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis
Kyeng Gea Lee; Mark J. Lee; Soo Jung Lee – International Journal of Technology in Education and Science, 2024
Online assessment is an essential part of online education, and if conducted properly, has been found to effectively gauge student learning. Generally, textbased questions have been the cornerstone of online assessment. Recently, however, the emergence of generative artificial intelligence has added a significant challenge to the integrity of…
Descriptors: Artificial Intelligence, Computer Software, Biology, Science Instruction
Zhang, Lishan; VanLehn, Kurt – Interactive Learning Environments, 2021
Despite their drawback, multiple-choice questions are an enduring feature in instruction because they can be answered more rapidly than open response questions and they are easily scored. However, it can be difficult to generate good incorrect choices (called "distractors"). We designed an algorithm to generate distractors from a…
Descriptors: Semantics, Networks, Multiple Choice Tests, Teaching Methods
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Alammary, Ali – IEEE Transactions on Learning Technologies, 2021
Developing effective assessments is a critical component of quality instruction. Assessments are effective when they are well-aligned with the learning outcomes, can confirm that all intended learning outcomes are attained, and their obtained grades are accurately reflecting the level of student achievement. Developing effective assessments is not…
Descriptors: Outcomes of Education, Alignment (Education), Student Evaluation, Data Analysis
Liao, Linyu – English Language Teaching, 2020
As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…
Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests
Beauchamp, David; Constantinou, Filio – Research Matters, 2020
Assessment is a useful process as it provides various stakeholders (e.g., teachers, parents, government, employers) with information about students' competence in a particular subject area. However, for the information generated by assessment to be useful, it needs to support valid inferences. One factor that can undermine the validity of…
Descriptors: Computational Linguistics, Inferences, Validity, Language Usage
Adeleke, A. A.; Joshua, E. O. – Journal of Education and Practice, 2015
Physics literacy plays a crucial part in global technological development as several aspects of science and technology apply concepts and principles of physics in their operations. However, the acquisition of scientific literacy in physics in our society today is not encouraging enough to the desirable standard. Therefore, this study focuses on…
Descriptors: Physics, Secondary School Students, Scientific Literacy, Foreign Countries
Shuqun, Yang; Shuliang, Ding; Zhiqiang, Yao – International Journal of Distance Education Technologies, 2009
Cognitive diagnosis (CD) plays an important role in intelligent tutoring system. Computerized adaptive testing (CAT) is adaptive, fair, and efficient, which is suitable to large-scale examination. Traditional cognitive diagnostic test needs quite large number of items, the efficient and tailored CAT could be a remedy for it, so the CAT with…
Descriptors: Monte Carlo Methods, Distance Education, Adaptive Testing, Intelligent Tutoring Systems
Ji, Mindy F. – 1999
Item and test analyses can be used to revise and improve both test items and the test as a whole. Recommendations for item and test analysis practices as they are reported in commonly used measurement textbooks are summarized. A heuristic data set is used to illustrate test and item analysis practices. Techniques developed in this paper are…
Descriptors: Computation, Computer Software, Item Analysis, Test Construction

Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 1984
The purpose of this paper is to describe some of the current changes in test development that are taking place because of the availability and capabilities of computers, especially microcomputers. Item banking and test assembly are discussed, and a comprehensive testing system is described. (BW)
Descriptors: Computer Assisted Testing, Computer Software, Educational Testing, Elementary Secondary Education
Applegate, Brooks – 1987
Computer programs are presented to plot item and test characteristic curves and item information functions for parameter estimates produced by the LOGIST and BICAL computer programs. These programs provide data in tabular format, but their usefulness in test development and measurement courses can be greatly enhanced by graphic plots of the item…
Descriptors: Computer Assisted Instruction, Computer Graphics, Computer Oriented Programs, Computer Software
Skaggs, Gary; Stevenson, Jose – 1986
This study assesses the accuracy of ASCAL, a microcomputer-based program for estimating item parameters for the three-parameter logistic model in item response theory. Item responses are generated from a three-parameter model, and item parameter estimates from ASCAL are compared to the generating item parameters and to estimates produced by…
Descriptors: Algorithms, Comparative Analysis, Computer Software, Estimation (Mathematics)
Dirkzwager, Arie – International Journal of Testing, 2003
The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…
Descriptors: Psychometrics, Probability, Models, Measurement
Madsen, Harold S. – 1986
The most appropriate statistical model for the small-scale (n<100) studies common in language testing research is the Rasch one-parameter logistic model. The Rasch model provides a wide range of options for conducting research, refining existing examinations, and developing tailored (computerized adaptive) language tests. Three investigations…
Descriptors: Computer Assisted Testing, Computer Software, English (Second Language), Item Analysis
Previous Page | Next Page ยป
Pages: 1 | 2