ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Descriptor

Comparative Analysis	11
Error of Measurement	11
Language Tests	11
English (Second Language)	7
Second Language Learning	6
Foreign Countries	5
Language Proficiency	5
Scores	4
Second Language Instruction	4
Item Response Theory	3
Native Language	3
Test Format	3
Language Skills	2
Multiple Choice Tests	2
Pretests Posttests	2
Reading Comprehension	2
Test Construction	2
Test Reliability	2
Testing Programs	2
Validity	2
Academic Language	1
Achievement Tests	1
Adults	1
Bayesian Statistics	1
Bilingualism	1
More ▼

Source

Language Testing	2
ProQuest LLC	2
Education and Information…	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
Journal of Educational…	1
Journal of Educational and…	1
Language Assessment Quarterly	1

Publication Type

Journal Articles	9
Reports - Research	9
Dissertations/Theses -…	2

Education Level

Higher Education	4
Postsecondary Education	4
Secondary Education	2
Elementary Education	1
Grade 6	1
High Schools	1
Intermediate Grades	1

Audience

Location

Turkey	2
Germany	1
Iran	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Does Modality Matter? Aural and Written Vocabulary in Second Language Listening and Reading Comprehension

Direct link

Takehiro Iizuka – ProQuest LLC, 2024

This study examined the significance of the mode of delivery--aural versus written--in second language (L2) vocabulary knowledge and L2 comprehension skills. One of the unique aspects of listening comprehension that sets it apart from reading comprehension is the mode of delivery--language input is delivered not visually but aurally. Somewhat…

Descriptors: Reading Comprehension, Listening Comprehension, Language Skills, Error of Measurement

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill

Peer reviewed
PDF on ERIC

Download full text

Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…

Descriptors: Evaluators, Training, Comparative Analysis, Academic Language

A Rank-Order Alternative for Nonparametric Analysis with the General Linear Model

Peer reviewed
PDF on ERIC

Download full text

Jeffry White – Journal of Educational Research and Practice, 2024

Violations of normality and homogeneity are common in educational data. When this occurs, the use of parametric statistics may be inappropriate. A generalized form of nonparametric analyses based on the Puri and Sen L statistic provides an alternative approach. Using a chi-square distribution, this technique is easy to apply and has significant…

Descriptors: Nonparametric Statistics, Learning Analytics, Evaluation Methods, Guidance

Measuring the Development of General Language Skills in English as a Foreign Language--Longitudinal Invariance of the C-Test

Peer reviewed

Direct link

Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023

Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies

Equating in Small-Scale Language Testing Programs

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan – Language Testing, 2017

Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…

Descriptors: Language Tests, Equated Scores, Testing Programs, Comparative Analysis

Output-Based Instruction, Learning Styles and Vocabulary Learning in the EFL Context of Iran

Peer reviewed
PDF on ERIC

Download full text

Rastegar, Behnaz; Safari, Fatemeh – International Journal of Education and Literacy Studies, 2017

Language learners' productive role in teaching and learning processes has recently been the focus of attention. Therefore, this study aimed at investigating the effect of oral vs. written output-based instruction on English as a foreign language (EFL) learners' vocabulary learning with a focus on reflective vs. impulsive learning styles. To this…

Descriptors: Cognitive Style, English (Second Language), Second Language Learning, Foreign Countries

Comparing Yes/No Angoff and Bookmark Standard Setting Methods in the Context of English Assessment

Peer reviewed

Direct link

Hsieh, Mingchuan – Language Assessment Quarterly, 2013

The Yes/No Angoff and Bookmark method for setting standards on educational assessment are currently two of the most popular standard-setting methods. However, there is no research into the comparability of these two methods in the context of language assessment. This study compared results from the Yes/No Angoff and Bookmark methods as applied to…

Descriptors: Standard Setting (Scoring), Comparative Analysis, Language Tests, Multiple Choice Tests

Investigating the Justifiability of an Additional Test Use: An Application of Assessment Use Argument to an English as a Foreign Language Test

Direct link

Wang, Huan – ProQuest LLC, 2010

Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…

Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction

Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

Peer reviewed

Direct link

Liu, Yuming; Schulz, E. Matthew; Yu, Lei – Journal of Educational and Behavioral Statistics, 2008

A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…

Descriptors: Reading Comprehension, Test Format, Markov Processes, Educational Testing

Gelbal, Selahattin	1
Gutierrez Arvizu, Maria Nelly	1
Hartig, Johannes	1
Hsieh, Mingchuan	1
Isbell, Daniel	1
Jamieson, Joan	1
Jeffry White	1
Karakaya, Ismail	1
Klinger, Thorsten	1
LaFlair, Geoffrey T.	1
Liu, Yuming	1
May, L. D. Nicolas	1
Naumann, Alexander	1
Ozdemir, Burhanettin	1
Polat, Murat	1
Rastegar, Behnaz	1
Safari, Fatemeh	1
Sata, Mehmet	1
Schnoor, Birger	1
Schulz, E. Matthew	1
Takehiro Iizuka	1
Usanova, Irina	1
Wang, Huan	1
Yu, Lei	1
More ▼