ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Error of Measurement	9
Language Proficiency	9
Scores	9
Language Tests	6
English (Second Language)	5
Second Language Learning	5
Comparative Analysis	3
Foreign Countries	3
Test Construction	3
Test Format	3
Test Reliability	3
Test Validity	3
Culture Fair Tests	2
Reading Comprehension	2
Second Language Instruction	2
Test Interpretation	2
Test Items	2
Test Length	2
Academic Achievement	1
Animation	1
Bayesian Statistics	1
Bilingualism	1
College Entrance Examinations	1
Computer Assisted Testing	1
Correlation	1
More ▼

Source

Education and Information…	1
Educational Research and…	1
English Language Teaching	1
International Journal of…	1
International Online Journal…	1
Journal of Educational…	1
Online Submission	1

Author

Afghari, Akbar	1
Anwar, Samsul	1
Cantor, Nancy K.	1
Gelbal, Selahattin	1
Ghafournia, Narjes	1
Henning, Grant	1
Hoover, H. D.	1
Jeffry White	1
Karakolidis, Anastasios	1
Li, Min	1
Mustafa, Faisal	1
O'Leary, Michael	1
Ozdemir, Burhanettin	1
Polat, Murat	1
Scully, Darina	1
Solano-Flores, Guillermo	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
High Schools	1
Secondary Education	1

Audience

Researchers

Location

Greece	1
Iran	1
Ireland (Dublin)	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Cognitive Abilities Test	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

A Rank-Order Alternative for Nonparametric Analysis with the General Linear Model

Peer reviewed
PDF on ERIC

Download full text

Jeffry White – Journal of Educational Research and Practice, 2024

Violations of normality and homogeneity are common in educational data. When this occurs, the use of parametric statistics may be inappropriate. A generalized form of nonparametric analyses based on the Puri and Sen L statistic provides an alternative approach. Using a chi-square distribution, this technique is easy to apply and has significant…

Descriptors: Nonparametric Statistics, Learning Analytics, Evaluation Methods, Guidance

Animated Videos in Assessment: Comparing Validity Evidence from and Test-Takers' Reactions to an Animated and a Text-Based Situational Judgment Test

Peer reviewed

Direct link

Karakolidis, Anastasios; O'Leary, Michael; Scully, Darina – International Journal of Testing, 2021

The linguistic complexity of many text-based tests can be a source of construct-irrelevant variance, as test-takers' performance may be affected by factors that are beyond the focus of the assessment itself, such as reading comprehension skills. This experimental study examined the extent to which the use of animated videos, as opposed to written…

Descriptors: Animation, Vignettes, Video Technology, Test Format

Distinguishing TOEFL Score: What Is the Lowest Score Considered a TOEFL Score?

Peer reviewed
PDF on ERIC

Download full text

Mustafa, Faisal; Anwar, Samsul – Online Submission, 2018

Paper-based TOEFL scores have been used to determine the level of English proficiency for EFL learners for various purposes. However, in repeat tests some lower scores fluctuate despite no additional classroom learning, thus they cannot be used to judge the English level of those taking the test. There is limited research into the lowest score…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Generalizability Theory and the Fair and Valid Assessment of Linguistic Minorities

Peer reviewed

Direct link

Solano-Flores, Guillermo; Li, Min – Educational Research and Evaluation, 2013

We discuss generalizability (G) theory and the fair and valid assessment of linguistic minorities, especially emergent bilinguals. G theory allows examination of the relationship between score variation and language variation (e.g., variation of proficiency across languages, language modes, and social contexts). Studies examining score variation…

Descriptors: Measurement, Testing, Language Proficiency, Test Construction

The Interaction between Cognitive Test-Taking Strategies, Reading Ability, and Reading Comprehension Test Performance of Iranian EFL Learners

Peer reviewed
PDF on ERIC

Download full text

Ghafournia, Narjes; Afghari, Akbar – English Language Teaching, 2013

The study scrutinized the probable interaction between using cognitive test-taking strategies, reading proficiency, and reading comprehension test performance of Iranian postgraduate students, who studied English as a foreign language. The study also probed the extent to which the participants' test performance was related to the use of certain…

Descriptors: Foreign Countries, Reading Comprehension, Reading Tests, English (Second Language)

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

The Reliability and Validity of Writing Assessment: An Investigation of Rater, Prompt within Mode, and Prompt between Mode Sources of Error.

Cantor, Nancy K.; Hoover, H. D. – 1986

This paper isolates and examines separately three distinct sources of error in essay scores: lack of agreement between raters; inconsistencies in performance within mode of discourse, and inconsistencies in performance between modes of discourse. Essay prompts in the Iowa Tests of Basic Skills (ITBS) Writing Supplement were designed to assess…

Descriptors: Academic Achievement, Cues, Elementary Secondary Education, Error of Measurement