ERIC - Search Results

Publication Date

In 2026	1
Since 2025	29
Since 2022 (last 5 years)	133
Since 2017 (last 10 years)	273
Since 2007 (last 20 years)	413

Descriptor

Computer Assisted Testing	618
Test Validity	618
Test Reliability	264
Test Construction	196
Foreign Countries	186
Language Tests	111
Test Items	102
Scores	96
English (Second Language)	93
Evaluation Methods	88
Second Language Learning	83
Adaptive Testing	77
Test Format	73
Higher Education	69
Student Evaluation	68
Correlation	67
Scoring	67
Comparative Analysis	65
Elementary School Students	64
Psychometrics	64
Student Attitudes	56
College Students	54
Language Proficiency	53
Testing	52
Item Response Theory	47
More ▼

Education Level

Higher Education	137
Postsecondary Education	112
Elementary Education	85
Secondary Education	73
Middle Schools	37
Early Childhood Education	30
Elementary Secondary Education	28
High Schools	27
Junior High Schools	27
Primary Education	25
Grade 5	22
Intermediate Grades	22
Grade 4	21
Grade 3	17
Grade 8	15
Grade 6	12
Grade 7	12
Grade 2	11
Kindergarten	9
Adult Education	8
Grade 1	6
Grade 9	6
Preschool Education	6
Grade 10	4
Grade 11	2
More ▼

Audience

Researchers	13
Practitioners	12
Administrators	9
Teachers	3
Policymakers	2
Counselors	1

Location

China	17
Canada	14
Indonesia	13
Australia	12
Germany	11
Turkey	11
California	10
New York	8
United Kingdom	7
United Kingdom (England)	7
Taiwan	6
United States	6
Florida	5
Iran	5
Japan	5
France	4
Israel	4
Malaysia	4
North Carolina	4
Singapore	4
United Arab Emirates	4
Greece	3
Hong Kong	3
Hungary	3
Illinois	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
Family Educational Rights and…	1
Health Insurance Portability…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1
Race to the Top	1

What Works Clearinghouse Rating

Test Validity X

Showing 196 to 210 of 618 results Save | Export

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

An Adaptive Test Analysis Based on Students' Motivation

Peer reviewed
PDF on ERIC

Download full text

Yoshioka, Sérgio R. I.; Ishitani, Lucila – Informatics in Education, 2018

Computerized Adaptive Testing (CAT) is now widely used. However, inserting new items into the question bank of a CAT requires a great effort that makes impractical the wide application of CAT in classroom teaching. One solution would be to use the tacit knowledge of the teachers or experts for a pre-classification and calibrate during the…

Descriptors: Student Motivation, Adaptive Testing, Computer Assisted Testing, Item Response Theory

A Quantitative Analysis of TOEFL iBT Using an Interpretive Model of Test Validity

Peer reviewed

Direct link

Esfandiari, Mohammad Reza; Riasati, Mohammad Javad; Vaezian, Helia; Rahimi, Forough – Language Testing in Asia, 2018

Background: Validity is a notable concept in language testing which has concerned many researchers and scholars in the field of language testing due to its importance in decision making process. Tests' results always introduce consequences to test takers' lives which emphasizes the need to ensure their validity. Detecting and delineating the…

Descriptors: Computer Assisted Testing, Test Validity, Language Tests, English (Second Language)

Development of the English Listening and Reading Computerized Revised Token Test into Cantonese: Validity, Reliability, and Sensitivity/Specificity in People with Aphasia and Healthy Controls

Peer reviewed

Direct link

Bakhtiar, Mehdi; Wong, Min Ney; Tsui, Emily Ka Yin; McNeil, Malcolm R. – Journal of Speech, Language, and Hearing Research, 2020

Purpose: This study reports the psychometric development of the Cantonese versions of the English Computerized Revised Token Test (CRTT) for persons with aphasia (PWAs) and healthy controls (HCs). Method: The English CRTT was translated into standard Chinese for the Reading--Word Fade version (CRTT-R-[subscript WF]-Cantonese) and into formal…

Descriptors: Psychometrics, Sino Tibetan Languages, Computer Assisted Testing, Aphasia

Automated Assessment of Complex Programming Tasks Using SIETTE

Peer reviewed

Direct link

Conejo, Ricardo; Barros, Beatriz; Bertoa, Manuel F. – IEEE Transactions on Learning Technologies, 2019

This paper presents an innovative method to tackle the automatic evaluation of programming assignments with an approach based on well-founded assessment theories (Classical Test Theory (CTT) and Item Response Theory (IRT)) instead of heuristic assessment as in other systems. CTT and/or IRT are used to grade the results of different items of…

Descriptors: Computer Assisted Testing, Grading, Programming, Item Response Theory

The Development of a Web-Based Assessment System to Identify Students' Misconception Automatically on Linear Kinematics with a Four-Tier Instrument Test

Peer reviewed

Direct link

Pujayanto, Pujayanto; Budiharti, Rini; Adhitama, Egy; Nuraini, Niken Rizky Amalia; Putri, Hanung Vernanda – Physics Education, 2018

This research proposes the development of a web-based assessment system to identify students' misconception. The system, named WAS (web-based assessment system), can identify students' misconception profile on linear kinematics automatically after the student has finished the test. The test instrument was developed and validated. Items were…

Descriptors: Misconceptions, Physics, Science Instruction, Databases

Maintaining the Validity of the NAEP Frameworks and Assessments in Civics and U.S. History

Download full text

O'Malley, Fran; Norton, Scott – American Institutes for Research, 2022

This paper provides the National Center for Education Statistics (NCES), National Assessment Governing Board (NAGB), and the National Assessment of Educational Progress (NAEP) community with information that may help maintain the validity and utility of the NAEP assessments for civics and U.S. history as revisions are planned to the NAEP…

Descriptors: National Competency Tests, United States History, Test Validity, Governing Boards

Topic Familiarity Matters: A Critical Analysis of TOEFL iBT Reading Section

Peer reviewed
PDF on ERIC

Download full text

Toker, Deniz – TESL-EJ, 2019

The central purpose of this paper is to examine validity problems arising from the multiple-choice items and technical passages in the Test of English as a Foreign Language Internet-based Test (TOEFL iBT) reading section, primarily concentrating on construct-irrelevant variance (Messick, 1989). My personal TOEFL iBT experience, along with my…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Computer-Automated Approach for Scoring Short Essays in an Introductory Statistics Course

Peer reviewed

Direct link

Zimmerman, Whitney Alicia; Kang, Hyun Bin; Kim, Kyung; Gao, Mengzhao; Johnson, Glenn; Clariana, Roy; Zhang, Fan – Journal of Statistics Education, 2018

Over two semesters short essay prompts were developed for use with the Graphical Interface for Knowledge Structure (GIKS), an automated essay scoring system. Participants were students in an undergraduate-level online introductory statistics course. The GIKS compares students' writing samples with an expert's to produce keyword occurrence and…

Descriptors: Undergraduate Students, Introductory Courses, Statistics, Computer Assisted Testing

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

A Computer Adaptive Measure of Reading Motivation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Davis, Marcia H.; Wang, Wenhao; Kingston, Neal M.; Hock, Michael; Tonks, Stephen M.; Tiemann, Gail – Grantee Submission, 2020

Background: The importance of reading motivation has led to the development of a large number of self-report reading motivation measures; however, there is still a need for a usable measure of adolescent reading motivation that captures a large number of theoretically and empirically distinct constructs. Methods: The current paper details the…

Descriptors: Reading Motivation, Computer Assisted Testing, Adaptive Testing, Measures (Individuals)

A Comparison of Spoken and Written Language Use in Traditional and Technology-Mediated Learning Environments. TOEFL® Research Report. RR-94. ETS RR-21-16

Peer reviewed
PDF on ERIC

Download full text

Kyle, Kristopher; Choe, Ann Tai; Eguchi, Masaki; LaFlair, Geoff; Ziegler, Nicole – ETS Research Report Series, 2021

A key piece of a validity argument for a language assessment tool is clear overlap between assessment tasks and the target language use (TLU) domain (i.e., the domain description inference). The TOEFL 2000 Spoken and Written Academic Language (T2K-SWAL) corpus, which represents a variety of academic registers and disciplines in traditional…

Descriptors: Comparative Analysis, Second Language Learning, English (Second Language), Language Tests

Developing IRT-Based Physics Critical Thinking Skill Test: A CAT to Answer 21st Century Challenge

Peer reviewed
PDF on ERIC

Download full text

Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019

The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…

Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics

Corpus Linguistics and Language Testing: Navigating Uncharted Waters

Peer reviewed

Direct link

Egbert, Jesse – Language Testing, 2017

The use of corpora and corpus linguistic methods in language testing research is increasing at an accelerated pace. The growing body of language testing research that uses corpus linguistic data is a testament to their utility in test development and validation. Although there are many reasons to be optimistic about the future of using corpus data…

Descriptors: Language Tests, Second Language Learning, Computational Linguistics, Best Practices

« Previous Page | Next Page »

Pages: 1 | ... | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | ... | 42

Language Assessment Quarterly	18
Language Testing	18
ETS Research Report Series	17
Educational Measurement:…	14
Grantee Submission	12
ProQuest LLC	12
Online Submission	11
Journal of Psychoeducational…	10
Assessment for Effective…	9
Educational and Psychological…	7
Computers in Human Behavior	6
Education and Information…	6
Journal of Educational…	6
Measurement and Evaluation in…	6
New York State Education…	6
Psychological Assessment	6
Applied Measurement in…	5
Journal of Computer Assisted…	5
Journal of Speech, Language,…	5
Turkish Online Journal of…	5
Advances in Health Sciences…	4
Assessment	4
Evaluation and the Health…	4
International Association for…	4
International Journal of…	4
More ▼

McKown, Clark	5
Petscher, Yaacov	5
Bulut, Okan	4
Garcia Laborda, Jesus	4
Wainer, Howard	4
Wise, Steven L.	4
Alonzo, Julie	3
Bejar, Isaac I.	3
Bennett, Randy Elliot	3
Cory, Charles H.	3
Ecalle, Jean	3
Federico, Pat-Anthony	3
He, Lianzhen	3
Larson, Jerry W.	3
Ling, Guangming	3
Magnan, Annie	3
Nese, Joseph F. T.	3
Or, Caleb	3
Rock, Donald A.	3
Russo-Ponsaran, Nicole M.	3
Tindal, Gerald	3
Tock, Jamie	3
Weiss, David J.	3
Xi, Xiaoming	3
More ▼

Journal Articles	446
Reports - Research	408
Reports - Evaluative	97
Reports - Descriptive	56
Speeches/Meeting Papers	51
Tests/Questionnaires	33
Information Analyses	21
Opinion Papers	17
Dissertations/Theses -…	14
Guides - Non-Classroom	9
Numerical/Quantitative Data	8
Books	7
Collected Works - General	7
Collected Works - Proceedings	6
Guides - General	5
Guides - Classroom - Teacher	2
Reports - General	2
Collected Works - Serials	1
ERIC Digests in Full Text	1
ERIC Publications	1
More ▼

Test of English as a Foreign…	33
Armed Services Vocational…	7
Gates MacGinitie Reading Tests	6
International English…	5
Peabody Picture Vocabulary…	5
Wechsler Intelligence Scale…	4
ACT Assessment	3
Armed Forces Qualification…	3
Minnesota Multiphasic…	3
National Assessment of…	3
Program for International…	3
SAT (College Admission Test)	3
Woodcock Johnson Tests of…	3
Autism Diagnostic Observation…	2
Behavior Assessment System…	2
Dynamic Indicators of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Basic Skills	2
Measures of Academic Progress	2
Pediatric Evaluation of…	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Vineland Adaptive Behavior…	2
Battelle Developmental…	1
Bayley Scales of Infant and…	1
More ▼