ERIC - Search Results

Publication Date

In 2025	3
Since 2024	10
Since 2021 (last 5 years)	27
Since 2016 (last 10 years)	50
Since 2006 (last 20 years)	71

Descriptor

Computer Assisted Testing	98
Test Items	98
Test Validity	98
Test Construction	47
Test Reliability	34
Foreign Countries	31
Adaptive Testing	29
Difficulty Level	20
Test Format	20
Item Response Theory	18
Scores	16
Scoring	15
English (Second Language)	14
Item Analysis	13
Language Tests	13
Psychometrics	13
Second Language Learning	13
Mathematics Tests	12
Achievement Tests	11
Elementary School Students	10
Item Banks	10
Models	9
Reading Tests	9
Comparative Analysis	8
Evaluation Methods	8
More ▼

Publication Type

Reports - Research	63
Journal Articles	61
Reports - Evaluative	18
Speeches/Meeting Papers	11
Reports - Descriptive	8
Tests/Questionnaires	4
Dissertations/Theses -…	3
Numerical/Quantitative Data	3
Guides - Non-Classroom	2
Information Analyses	2
Opinion Papers	2
Books	1
Collected Works - General	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - General	1
More ▼

Education Level

Higher Education	17
Postsecondary Education	16
Elementary Education	15
Secondary Education	15
Middle Schools	7
Elementary Secondary Education	6
Grade 8	6
High Schools	6
Early Childhood Education	5
Junior High Schools	5
Primary Education	5
Grade 7	4
Grade 4	3
Grade 5	3
Intermediate Grades	3
Grade 1	2
Grade 2	2
Grade 6	2
Kindergarten	2
Adult Education	1
Grade 10	1
Grade 12	1
Grade 9	1
More ▼

Audience

Researchers	3
Practitioners	2
Teachers	2
Administrators	1

Location

California	4
China	4
Florida	3
Turkey	3
Germany	2
Indonesia	2
Nebraska	2
North Carolina	2
Arkansas	1
Australia	1
Canada	1
Connecticut	1
Delaware	1
European Union	1
Georgia	1
Idaho	1
Illinois	1
Indiana	1
Iowa	1
Iran	1
Ireland	1
Israel	1
Italy	1
Japan	1
Latin America	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 98 results Save | Export

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design

Peer reviewed

Direct link

Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024

To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…

Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

A Suggestive Approach for Assessing Item Quality, Usability and Validity of Automatic Item Generation

Peer reviewed

Direct link

Falcão, Filipe; Pereira, Daniela Marques; Gonçalves, Nuno; De Champlain, Andre; Costa, Patrício; Pêgo, José Miguel – Advances in Health Sciences Education, 2023

Automatic Item Generation (AIG) refers to the process of using cognitive models to generate test items using computer modules. It is a new but rapidly evolving research area where cognitive and psychometric theory are combined into digital framework. However, assessment of the item quality, usability and validity of AIG relative to traditional…

Descriptors: Computer Assisted Testing, Test Construction, Test Items, Automation

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

The Feasibility of Computerized Adaptive Testing of the National Benchmark Test: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Musa Adekunle Ayanwale; Mdutshekelwa Ndlovu – Journal of Pedagogical Research, 2024

The COVID-19 pandemic has had a significant impact on high-stakes testing, including the national benchmark tests in South Africa. Current linear testing formats have been criticized for their limitations, leading to a shift towards Computerized Adaptive Testing [CAT]. Assessments with CAT are more precise and take less time. Evaluation of CAT…

Descriptors: Adaptive Testing, Benchmarking, National Competency Tests, Computer Assisted Testing

Application of the Professional Maturity Scale as a Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Süleyman Demir; Derya Çobanoglu Aktan; Nese Güler – International Journal of Assessment Tools in Education, 2023

This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose,…

Descriptors: Computer Assisted Testing, Adaptive Testing, Vocational Maturity, Measures (Individuals)

Applying Alternative Method to Evaluate Online Problem-Solving Skill Inventory (OPSI) Using Rasch Model Analysis

Peer reviewed

Direct link

Che Lah, Noor Hidayah; Tasir, Zaidatun; Jumaat, Nurul Farhana – Educational Studies, 2023

The aim of the study was to evaluate the extended version of the Problem-Solving Inventory (PSI) via an online learning setting known as the Online Problem-Solving Inventory (OPSI) through the lens of Rasch Model analysis. To date, there is no extended version of the PSI for online settings even though many researchers have used it; thus, this…

Descriptors: Problem Solving, Measures (Individuals), Electronic Learning, Item Response Theory

Developing Internet-Based "Tests of Aptitude for Language Learning (TALL)": An Open Research Endeavour

Peer reviewed

Direct link

Junlan Pan; Emma Marsden – Language Testing, 2024

"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…

Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction

Item Equivalence Verification According to Test Information Media of the Optician National Licensing Examination: Focused on the Smart Device Based and Paper Based Tests Including Multimedia Items

Peer reviewed
PDF on ERIC

Download full text

Jang, Jung Un; Kim, Eun Joo – Journal of Curriculum and Teaching, 2022

This study conducts the validity of the pen-and-paper and smart-device-based tests on optician's examination. The developed questions for each media were based on the national optician's simulation test. The subjects of this study were 60 students enrolled in E University. The data analysis was performed to verify the equivalence of the two…

Descriptors: Optometry, Licensing Examinations (Professions), Test Format, Test Validity

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Developing and Piloting a Computerized Adaptive Test for a Culturally Appropriate Measure of Adaptive Behavior

Peer reviewed

Direct link

Chen, Mo; Nah, Yong-Hwee; Waschl, Nicolette; Poon, Kenneth; Chen, Ping – Journal of Psychoeducational Assessment, 2022

Culturally bounded in nature, adaptive behavior is the degree to which a person meets the requirements of personal independence and social responsibilities. This study aimed to develop a computerized adaptive test (CAT) of a culturally appropriate adaptive behavior measure (i.e., the Activities and Participation Rating Scale [APRS]) in the…

Descriptors: Computer Assisted Testing, Cultural Relevance, Test Construction, Test Items

Validity of Multiple-Choice Digital Formative Assessment for Assessing Students' (Mis)Conceptions: Evidence from a Mixed-Methods Study in Algebra

Peer reviewed

Direct link

Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024

Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…

Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions

Assessment of Basic Competencies in Adults: Item Pool Validity and Reliability Study

Peer reviewed
PDF on ERIC

Download full text

Toker, Turker – International Journal of Curriculum and Instruction, 2023

Achievement tests are among the most widely used data collection tools to measure the knowledge and skill levels of individuals. For this reason, the existence of valid and reliable achievement tests that can perfectly reveal the competencies that a person should have in any discipline is of great importance. The purpose of this research is to…

Descriptors: Basic Skills, Evaluation Methods, Test Items, Test Validity

Proving Content Validity of Android-Based Higher Order Thinking Skill Assessment for Science and Mathematics Preservice Teacher

Peer reviewed
PDF on ERIC

Download full text

Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025

Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…

Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills

Investigating the Role of Response Format in Computer-Based Lecture Comprehension Tasks

Peer reviewed

Direct link

Stefan O'Grady – International Journal of Listening, 2025

Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…

Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Measurement:…	4
Journal of Educational…	3
Language Assessment Quarterly	3
ETS Research Report Series	2
Education and Information…	2
Grantee Submission	2
Journal of Psychoeducational…	2
Nebraska Department of…	2
Practical Assessment,…	2
ProQuest LLC	2
AERA Online Paper Repository	1
Advances in Health Sciences…	1
American Institutes for…	1
Applied Measurement in…	1
College Board	1
Communique	1
Education	1
Educational Assessment	1
Educational Research and…	1
Educational Sciences: Theory…	1
Educational Studies	1
Educational Studies in…	1
Educational Testing Service	1
Educational and Psychological…	1
English Language Teaching	1
More ▼

Wainer, Howard	4
Bennett, Randy Elliot	2
Bulut, Okan	2
Petscher, Yaacov	2
Rock, Donald A.	2
Abedi, Jamal	1
Ackerman, Debra J.	1
Aditya Shah	1
Ajay Devmane	1
Albanese, Mark A.	1
Alderton, David L.	1
Anthony, Jason L.	1
Arce-Ferrer, Alvaro J.	1
Arslan, Burcu	1
Aviad-Levitzky, Tami	1
Barron, Ann E.	1
Bastianello, Tamara	1
Becker, Kirk A.	1
Bejar, Issac I.	1
Ben-Porath, Yossef S.	1
Berberoglu, Giray	1
Bergstrom, Betty	1
Bergstrom, Betty A.	1
Biancarosa, Gina	1
More ▼

Test of English as a Foreign…	5
Peabody Picture Vocabulary…	3
Program for International…	2
Stanford Achievement Tests	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Computer Attitude Scale	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Force Concept Inventory	1
Graduate Record Examinations	1
Hidden Figures Test	1
International Association for…	1
International English…	1
Iowa Tests of Basic Skills	1
Minnesota Multiphasic…	1
National Assessment of…	1
Progress in International…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Torrance Tests of Creative…	1
Trends in International…	1
Woodcock Johnson Tests of…	1
More ▼