ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	23

Descriptor

Computer Assisted Testing	30
Evaluation Methods	30
Evaluation Criteria	26
Foreign Countries	10
Student Evaluation	9
Educational Assessment	7
Educational Technology	7
College Students	6
Computer Software	6
Test Construction	6
Internet	5
Adaptive Testing	4
Case Studies	4
Comparative Analysis	4
Correlation	4
Feedback (Response)	4
Higher Education	4
Item Analysis	4
Models	4
Performance Based Assessment	4
Technology Integration	4
Accuracy	3
Adolescents	3
Artificial Intelligence	3
Classification	3
More ▼

Publication Type

Journal Articles	23
Reports - Research	12
Reports - Evaluative	9
Reports - Descriptive	6
Speeches/Meeting Papers	3
Information Analyses	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1

Education Level

Higher Education	10
Postsecondary Education	9
Elementary Secondary Education	3
Secondary Education	2
High Schools	1

Audience

Location

Australia	2
Denmark	2
Germany	2
Indonesia	2
Japan	2
Turkey	2
Asia	1
Brazil	1
Connecticut	1
Egypt	1
Estonia	1
Florida	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Kazakhstan	1
Netherlands	1
New Zealand	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
College Level Examination…	1
National Assessment of Adult…	1
Nelson Denny Reading Tests	1
Program for International…	1
SAT (College Admission Test)	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Development and Calibration of an Instrument Measuring Attitudes toward Statistics Using Classical and Modern Test Theory

Peer reviewed
PDF on ERIC

Download full text

Ezi Apino; Edi Istiyono; Heri Retnawati; Widihastuti Widihastuti; Kana Hidayati – Journal of Pedagogical Research, 2024

Assessment of attitudes towards statistics [ATS] is needed to support the success of statistics education in tertiary institutions, so measuring instruments with high accuracy is required. However, existing instruments to measure ATS have not considered the use of technology as an essential variable affecting success in statistics education. The…

Descriptors: Foreign Countries, College Students, College Faculty, Statistics Education

Student Self-Reflection as a Tool for Managing GenAI Use in Large Class Assessment

Peer reviewed

Direct link

Celeste Combrinck; Nelé Loubser – Discover Education, 2025

Written assignments for large classes pose a far more significant challenge in the age of the GenAI revolution. Suggestions such as oral exams and formative assessments are not always feasible with many students in a class. Therefore, we conducted a study in South Africa and involved 280 Honors students to explore the usefulness of Turnitin's AI…

Descriptors: Foreign Countries, Artificial Intelligence, Large Group Instruction, Alternative Assessment

A Computer Vision System for an Automated Scoring of a Hand-Drawn Geometric Figure

Peer reviewed

Direct link

Shinta Estri Wahyuningrum; Gilles van Luijtelaar; Augustina Sulastri; Marc P. H. Hendriks; Ridwan Sanjaya; Tom Heskes – SAGE Open, 2024

Visual Reproduction is a condition to measure Visual Spatial Memory as one of the cognitive domains commonly used to measure visuo-spatial memory. Geometric figures serve as stimulus material, and probands have to reproduce the figures from memory through a hand drawing. The scoring of the drawing has subjective elements. This study aims to…

Descriptors: Automation, Scores, Geometry, Visual Aids

Examining Assessment Principles and Practices within the Online Components of an Initial Teacher Education Programme in Aotearoa New Zealand

Peer reviewed

Direct link

Penny Smith; Tracey Carlyon – Assessment Matters, 2023

Learning and assessment that drives learner success should be a key tenet of all initial teacher education programmes. Initial teacher education providers in Aotearoa New Zealand must use an assessment framework to ensure that graduating teachers meet the Teaching Council standards. As a part of a review of their assessment practices, academic…

Descriptors: Foreign Countries, Beginning Teachers, Beginning Teacher Induction, Teacher Education

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023

Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…

Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

An Evaluation Framework and Instrument for Evaluating e-Assessment Tools

Peer reviewed
PDF on ERIC

Download full text

Singh, Upasana Gitanjali; de Villiers, Mary Ruth – International Review of Research in Open and Distributed Learning, 2017

e-Assessment, in the form of tools and systems that deliver and administer multiple choice questions (MCQs), is used increasingly, raising the need for evaluation and validation of such systems. This research uses literature and a series of six empirical action research studies to develop an evaluation framework of categories and criteria called…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Test Selection, Action Research

Use of Online Assessment Tools by Instructional Designers-by-Assignment: Necessary Features and Functionalities

Direct link

Halloran, Jo-Ann – ProQuest LLC, 2013

Government entities set criteria for institutions that have teacher educator programs to use online assessment tools to show continuous ongoing evaluation, and use data from the tools to guide the improvement of courses. The purpose of this qualitative, multi-case study was to discover how Instructional Designers-by-Assignment (IDBA) are using…

Descriptors: Instructional Design, Student Evaluation, Computer Assisted Testing, Evaluation Methods

Automated Essay Scoring: Psychometric Guidelines and Practices

Peer reviewed

Direct link

Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013

In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…

Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics

The Applicability of Multidimensional Computerized Adaptive Testing for Cognitive Ability Measurement in Organizational Assessment

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013

Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…

Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)

Evaluation of the "e-rater"® Scoring Engine for the "TOEFL"® Independent and Integrated Prompts. Research Report. ETS RR-12-06

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…

Descriptors: Scoring, Prompting, Evaluators, Computer Software

The D-Optimality Item Selection Criterion in the Early Stage of CAT: A Study with the Graded Response Model

Peer reviewed

Direct link

Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S. – Journal of Educational and Behavioral Statistics, 2008

During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Evaluation Criteria, Item Analysis

Basic Skills Assessment

Peer reviewed

Direct link

Yin, Alexander C.; Volkwein, J. Fredericks – New Directions for Institutional Research, 2010

After surveying 1,827 students in their final year at eighty randomly selected two-year and four-year public and private institutions, American Institutes for Research (2006) reported that approximately 30 percent of students in two-year institutions and nearly 20 percent of students in four-year institutions have only basic quantitative…

Descriptors: Standardized Tests, Basic Skills, College Admission, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2

ETS Research Report Series	2
Journal of Educational and…	2
Assessing Writing	1
Assessment Matters	1
Campus-Wide Information…	1
Computers & Education	1
Discover Education	1
EDUCAUSE Quarterly	1
ETS Research Institute	1
Evaluation and Program…	1
Grantee Submission	1
International Association for…	1
International Journal of…	1
International Journal on…	1
International Review of…	1
Journal of Pedagogical…	1
New Directions for…	1
Online Submission	1
Personnel Psychology	1
ProQuest LLC	1
Proceedings of the ASIS…	1
Research Quarterly for…	1
SAGE Open	1
System	1
Turkish Online Journal of…	1
More ▼

Ramineni, Chaitanya	2
Williamson, David M.	2
Amit Sevak	1
Augustina Sulastri	1
Baba, A. Fevzi	1
Barron, Colin	1
Berger, Martijn P. F.	1
Bridgeman, Brent	1
Brusilovsky, Peter	1
Celeste Combrinck	1
Chu, Hui-Chun	1
Chun Wang	1
Daniel Fishtein	1
Davey, Tim	1
Drabenstott, Karen M.	1
Edi Istiyono	1
Ezi Apino	1
Fleishman, Edwin A.	1
Franks, B. Don	1
Gathercoal, Paul	1
Gilles van Luijtelaar	1
Glas, Cees A. W.	1
Gu, Lixiong	1
Halloran, Jo-Ann	1
Han, Kerem	1
More ▼