Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 30 |
Descriptor
Computer Software | 42 |
Evaluation Methods | 42 |
Test Construction | 42 |
Computer Assisted Testing | 22 |
Foreign Countries | 18 |
Educational Technology | 14 |
Student Evaluation | 12 |
Test Items | 12 |
Internet | 8 |
Teaching Methods | 7 |
Comparative Analysis | 6 |
More ▼ |
Source
Author
Baker, Eva L. | 2 |
Kleinhans, Janne | 2 |
Mott, Michael S. | 2 |
Schumann, Matthias | 2 |
Abdullah Al Fraidan | 1 |
Adler-Nevo, Gili | 1 |
Alghazali, Tawfeeq | 1 |
Altas, Irfan | 1 |
Aydin, Fatih | 1 |
Bahar, Mehmet | 1 |
Baker, Jason, Ed. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 8 |
Postsecondary Education | 7 |
Elementary Secondary Education | 4 |
Secondary Education | 3 |
Elementary Education | 2 |
Adult Education | 1 |
Grade 6 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Audience
Practitioners | 1 |
Teachers | 1 |
Location
Turkey | 3 |
Australia | 2 |
Germany | 2 |
Italy | 2 |
Canada | 1 |
China | 1 |
Cyprus | 1 |
Hong Kong | 1 |
Ireland | 1 |
Israel | 1 |
Saudi Arabia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
International English… | 1 |
Preliminary Scholastic… | 1 |
SAT (College Admission Test) | 1 |
State Trait Anxiety Inventory | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Bryan R. Drost; Char Shryock – Phi Delta Kappan, 2025
Creating assessment questions aligned to standards is a time-consuming task for teachers, but large language models such as ChatGPT can help. Bryan Drost & Char Shryock describe a three-step process for using ChatGPT to create assessments: 1) Ask ChatGPT to break standards into measurable targets. 2) Determine how much time to spend on each…
Descriptors: Artificial Intelligence, Computer Software, Technology Integration, Teaching Methods
Abdullah Al Fraidan – International Journal of Distance Education Technologies, 2025
This study explores vocabulary assessment practices in Saudi Arabia's hybrid EFL ecosystem, leveraging platforms like Blackboard and Google Forms. The focus is on identifying prevalent test formats and evaluating their alignment with modern pedagogical goals. To classify vocabulary assessment formats in hybridized EFL contexts and recommend the…
Descriptors: Vocabulary Development, English (Second Language), Second Language Learning, Second Language Instruction
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019
In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…
Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit
Beauchamp, David; Constantinou, Filio – Research Matters, 2020
Assessment is a useful process as it provides various stakeholders (e.g., teachers, parents, government, employers) with information about students' competence in a particular subject area. However, for the information generated by assessment to be useful, it needs to support valid inferences. One factor that can undermine the validity of…
Descriptors: Computational Linguistics, Inferences, Validity, Language Usage
Kalkan, Ömür Kaya; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
Linear factor analysis models used to examine constructs underlying the responses are not very suitable for dichotomous or polytomous response formats. The associated problems cannot be eliminated by polychoric or tetrachoric correlations in place of the Pearson correlation. Therefore, we considered parameters obtained from the NOHARM and FACTOR…
Descriptors: Sample Size, Nonparametric Statistics, Factor Analysis, Correlation
Kleinhans, Janne; Schumann, Matthias – Interactive Technology and Smart Education, 2015
Purpose: This paper investigates the potential of computerized adaptive testing for CMs to reduce test time. In the context of education and training, competency measurement (CM) is a central challenge in competency management. For complex CMs, a compromise must be addressed between the time available and the quality of the measurements.…
Descriptors: Computer Assisted Testing, Educational Technology, Time, Measurement
Greiff, Samuel; Wustenberg, Sascha; Holt, Daniel V.; Goldhammer, Frank; Funke, Joachim – Educational Technology Research and Development, 2013
Complex Problem Solving (CPS) skills are essential to successfully deal with environments that change dynamically and involve a large number of interconnected and partially unknown causal influences. The increasing importance of such skills in the 21st century requires appropriate assessment and intervention methods, which in turn rely on adequate…
Descriptors: Evaluation Methods, Computer Software, Computer Assisted Testing, Intervention
Kleinhans, Janne; Schumann, Matthias – International Association for Development of the Information Society, 2015
In the context of education and training, competency measurement (CM) is a central challenge in competency management. For complex CMs, a compromise must be addressed between the time available and the number of dimensions to be measured or the quality of the measurements. Increasing the efficiency of existing tests for CMs therefore poses a key…
Descriptors: Foreign Countries, Competence, Allied Health Personnel, Computer Assisted Testing
Davey, Tim – Council of Chief State School Officers, 2011
Some brand names are used generically to describe an entire class of products that perform the same function. "Kleenex," "Xerox," "Thermos," and "Band-Aid" are good examples. The term "computerized adaptive testing" (CAT) is similar in that it is often applied uniformly across a diverse family of testing methods. Although the various members of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Delivery Systems, Evaluation Methods
Luecht, Richard M.; Sireci, Stephen G. – College Board, 2011
Over the past four decades, there has been incremental growth in computer-based testing (CBT) as a viable alternative to paper-and-pencil testing. However, the transition to CBT is neither easy nor inexpensive. As Drasgow, Luecht, and Bennett (2006) noted, many design engineering, test development, operations/logistics, and psychometric changes…
Descriptors: College Entrance Examinations, Computer Assisted Testing, Educational Technology, Evaluation Methods
Lee, Chung-Ping; Lou, Shi-Jer; Shih, Ru-Chu; Tseng, Kuo-Hung – Turkish Online Journal of Educational Technology - TOJET, 2011
This study uses the analytical hierarchy process (AHP) to quantify important knowledge management behaviors and to analyze the weight scores of elementary school students' behaviors in knowledge transfer, sharing, and creation. Based on the analysis of Expert Choice and tests for validity and reliability, this study identified the weight scores of…
Descriptors: Knowledge Management, Elementary School Students, Student Behavior, Evaluation Methods
Gomiero, Tiziano; Croce, Luigi; Grossi, Enzo; Luc, De Vreese; Buscema, Massimo; Mantesso, Ulrico; De Bastiani, Elisa – Online Submission, 2011
The aim of this paper is to present a shortened version of the SIS (support intensity scale) obtained by the application of mathematical models and instruments, adopting special algorithms based on the most recent developments in artificial adaptive systems. All the variables of SIS applied to 1,052 subjects with ID (intellectual disabilities)…
Descriptors: Foreign Countries, Mathematical Models, Mental Retardation, Measures (Individuals)
Lee, Sang-Heui – ProQuest LLC, 2010
Although there have been a number of studies on large scale implementation of proprietary enterprise information systems (EIS), open-source software (OSS) for EIS has received limited attention in spite of its potential as a disruptive innovation. Cost saving is the main driver for adopting OSS among the other possible benefits including security…
Descriptors: Information Systems, Open Source Technology, Computer Software, Computer Software Evaluation