ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Item Response Theory	7
Language Proficiency	7
Test Theory	7
Language Tests	6
Foreign Countries	4
Test Reliability	4
Comparative Analysis	3
English (Second Language)	3
Item Analysis	3
Second Language Learning	3
Generalizability Theory	2
High Stakes Tests	2
Multiple Choice Tests	2
Scores	2
Second Language Instruction	2
Statistical Analysis	2
Test Validity	2
Testing	2
Trend Analysis	2
Accountability	1
Accuracy	1
Bilingualism	1
College Entrance Examinations	1
College Students	1
Computer Assisted Testing	1
More ▼

Source

International Online Journal…	1
Journal on Educational…	1
Language Testing	1
Online Submission	1
ProQuest LLC	1
RELC Journal: A Journal of…	1
Turkish Online Journal of…	1

Author

Salmani-Nodoushan, Mohammad…	2
Ellis, David P.	1
Ji, Xiaoli	1
Longabach, Tanya	1
Peyton, Vicki	1
Polat, Murat	1
Retnawati, Heri	1
Zhao, Ping	1

Publication Type

Journal Articles	6
Reports - Research	4
Reports - Descriptive	2
Dissertations/Theses -…	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Education	1

Audience

Location

China	1
Indonesia	1
Japan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Validation of the Mandarin Version of the Vocabulary Size Test

Peer reviewed

Direct link

Zhao, Ping; Ji, Xiaoli – RELC Journal: A Journal of Language Teaching and Research, 2018

This article provides preliminary validity evidence for the shorter Mandarin version of the Vocabulary Size Test (VST) under the content aspect, technical quality, substantive and generalizability aspect of Messick's (1995) construct validity framework. The shorter version with 177 Chinese university students in three proficiency levels indicates…

Descriptors: Language Tests, Test Validity, Mandarin Chinese, Second Language Learning

The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

Peer reviewed
PDF on ERIC

Download full text

Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)

Item-Analysis Methods and Their Implications for the ILTA Guidelines for Practice: A Comparison of the Effects of Classical Test Theory and Item Response Theory Models on the Outcome of a High-Stakes Entrance Exam

Direct link

Ellis, David P. – ProQuest LLC, 2011

The current version of the International Language Testing Association (ILTA) Guidelines for Practice requires language testers to pretest items before including them on an exam, or when pretesting is not possible, to conduct post-hoc item analysis to ensure any malfunctioning items are excluded from scoring. However, the guidelines are devoid of…

Descriptors: Item Response Theory, High Stakes Tests, College Entrance Examinations, Item Analysis

Measurement Theory in Language Testing: Past Traditions and Current Trends

Peer reviewed
PDF on ERIC

Download full text

Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Measurement Theory in Language Testing: Past Traditions and Current Trends

Download full text

Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory