ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	11

Descriptor

Computer Software	14
Test Items	14
Test Validity	14
Foreign Countries	6
Item Analysis	6
Test Construction	6
Item Response Theory	4
Statistical Analysis	4
Test Reliability	4
Artificial Intelligence	3
Difficulty Level	3
High Stakes Tests	3
Psychometrics	3
Scoring	3
Accuracy	2
Achievement Tests	2
Adults	2
Comparative Analysis	2
Computer Assisted Testing	2
Computer Games	2
English (Second Language)	2
Evaluation Methods	2
Evaluators	2
Goodness of Fit	2
Language Tests	2
More ▼

Source

Online Submission	2
Australian Association for…	1
Educational and Psychological…	1
Grantee Submission	1
IEEE Transactions on Learning…	1
InSight: A Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Pedagogical…	1
Language Assessment Quarterly	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	9
Speeches/Meeting Papers	4
Reports - Evaluative	2

Education Level

Higher Education	2
Secondary Education	2
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1
High Schools	1

Audience

Location

Indonesia	1
Italy	1
Japan (Tokyo)	1
Nigeria	1
South Africa	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
International English…	1
Peabody Picture Vocabulary…	1
Torrance Tests of Creative…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

The Feasibility of Computerized Adaptive Testing of the National Benchmark Test: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Musa Adekunle Ayanwale; Mdutshekelwa Ndlovu – Journal of Pedagogical Research, 2024

The COVID-19 pandemic has had a significant impact on high-stakes testing, including the national benchmark tests in South Africa. Current linear testing formats have been criticized for their limitations, leading to a shift towards Computerized Adaptive Testing [CAT]. Assessments with CAT are more precise and take less time. Evaluation of CAT…

Descriptors: Adaptive Testing, Benchmarking, National Competency Tests, Computer Assisted Testing

Automatic Multiple Choice Question Generation From Text: A Survey

Peer reviewed

Direct link

Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020

Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…

Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Benthik Android Physics Comic Effectiveness for Vector Representation and Crtitical Thinking Students' Improvement

Peer reviewed
PDF on ERIC

Download full text

Maghfiroh, Anissa; Kuswanto, Heru – International Journal of Instruction, 2022

This research aims to reveal the effectiveness of the use of Kofie GeBoL media in improving (1) vector representation ability and (2) critical thinking ability in physics instruction. It is a descriptive quantitative study with the quasi-experiment design. It was conducted in two stages: empirical try out and implementation of Kofie GeboL to see…

Descriptors: Physics, Instructional Effectiveness, Critical Thinking, Thinking Skills

A Cognitive Diagnostic Assessment Study of the Reading Comprehension Section of the Preliminary English Test (PET)

Peer reviewed
PDF on ERIC

Download full text

Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023

Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…

Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Making Better Tests with the Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018

This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…

Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy

Development and Validation of Scientific Literacy Achievement Test to Assess Senior Secondary School Students' Literacy Acquisition in Physics

Peer reviewed
PDF on ERIC

Download full text

Adeleke, A. A.; Joshua, E. O. – Journal of Education and Practice, 2015

Physics literacy plays a crucial part in global technological development as several aspects of science and technology apply concepts and principles of physics in their operations. However, the acquisition of scientific literacy in physics in our society today is not encouraging enough to the desirable standard. Therefore, this study focuses on…

Descriptors: Physics, Secondary School Students, Scientific Literacy, Foreign Countries

iBank

Download full text

Bermundo, Cesar B.; Bermundo, Alex B.; Ballester, Rex C. – Australian Association for Research in Education (NJ1), 2012

iBank is a project that utilizes a software to create an item Bank that store quality questions, generate test and print exam. The items are from analyze teacher-constructed test questions that provides the basis for discussing test results, by determining why a test item is or not discriminating between the better and poorer students, and by…

Descriptors: Test Items, Computer Software, Test Results, Test Construction

Construct Validity and Measurement Invariance of the Peabody Picture Vocabulary Test-III Form A

Peer reviewed

Direct link

Pae, Hye K.; Greenberg, Daphne; Morris, Robin D. – Language Assessment Quarterly, 2012

The aim of this study was to apply the Rasch model to an analysis of the psychometric properties of the Peabody Picture Vocabulary Test--III Form A (PPVT--IIIA) items with struggling adult readers. The PPVT--IIIA was administered to 229 African American adults whose isolated word reading skills were between third and fifth grades. Conformity of…

Descriptors: African Americans, Test Items, Construct Validity, Test Validity

A Short Version of SIS (Support Intensity Scale): The Utility of the Application of Artificial Adaptive Systems

Download full text

Gomiero, Tiziano; Croce, Luigi; Grossi, Enzo; Luc, De Vreese; Buscema, Massimo; Mantesso, Ulrico; De Bastiani, Elisa – Online Submission, 2011

The aim of this paper is to present a shortened version of the SIS (support intensity scale) obtained by the application of mathematical models and instruments, adopting special algorithms based on the most recent developments in artificial adaptive systems. All the variables of SIS applied to 1,052 subjects with ID (intellectual disabilities)…

Descriptors: Foreign Countries, Mathematical Models, Mental Retardation, Measures (Individuals)

An Evaluation of "Polyweighting" in Domain-Referenced Testing.

Sympson, J. Bradford; Haladyna, Thomas M. – 1988

A new approach to polychotomous scoring of test items, similar to "max-alpha" scaling (MAS) and known as polyweighting, has been developed. Unlike MAS, this new method of polychotomous scoring provides scoring weights for a given item that are independent of the difficulty of other items in the analysis. Moreover, the scoring weights are…

Descriptors: Computer Software, Difficulty Level, Item Analysis, Latent Trait Theory

An Analysis of Item Exposure and Item Parameter Drift on a Take-Home Recertification Exam

Download full text

Giordano, Carolyn; Subhiyah, Raja; Hess, Brian – Online Submission, 2005

There are few certifying or recertifying examinations in the medical field that are given in a take-home format. This stems from a concern that examinees may discuss items with peers, or save copies of items on the exam and then pass them on to others. This study examined if item exposure on take-home examinations influences the difficulty of the…

Descriptors: Computer Software, Test Items, Certification, Licensing Examinations (Professions)

Rasch Model Applications To Determine the Equivalence of a Readiness Test in Two Languages.

Download full text

Lang, William Steve; And Others – 1993

The Lollipop Test (La Prueba Lollipop) is a bilingual preschool readiness test (in both English and Spanish) that has been the subject of a number of studies to assess validity and detect cultural bias. Such studies have not dealt with item analysis as a way to measure cultural fairness. The Rasch model was used in a study of the Lollipop test…

Descriptors: Bias, Blacks, Computer Software, Cultural Awareness

Adeleke, A. A.	1
Alghazali, Tawfeeq	1
Ballester, Rex C.	1
Bermundo, Alex B.	1
Bermundo, Cesar B.	1
Buscema, Massimo	1
Croce, Luigi	1
Dawood, Abdul Kareem Shareef	1
De Bastiani, Elisa	1
Denis Dumas	1
Giordano, Carolyn	1
Gomiero, Tiziano	1
Greenberg, Daphne	1
Grossi, Enzo	1
Haladyna, Thomas M.	1
Hess, Brian	1
Joshua, E. O.	1
Kadhim, Qasim Khlaif	1
Karlin, Omar	1
Karlin, Sayaka	1
Kelly Berthiaume	1
Khorramdel, Lale	1
Kuswanto, Heru	1
Lang, William Steve	1
More ▼