ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Language Tests	8
Scores	8
Test Theory	8
Comparative Analysis	5
English (Second Language)	5
Language Proficiency	5
Foreign Countries	4
Item Response Theory	3
Second Language Learning	3
Correlation	2
High Stakes Tests	2
Student Evaluation	2
Test Reliability	2
Testing	2
Accuracy	1
Achievement Gains	1
Achievement Tests	1
Computer Assisted Testing	1
Criterion Referenced Tests	1
Diagnostic Tests	1
Difficulty Level	1
Error of Measurement	1
Ethnicity	1
Evaluation Methods	1
Examiners	1
More ▼

Source

Language Testing	2
International Journal of…	1
International Online Journal…	1
Language, Speech, and Hearing…	1
Pegem Journal of Education…	1
Turkish Online Journal of…	1

Author

Polat, Murat	2
Brooks, Lindsay	1
Haberman, Shelby J.	1
Hua, Te-Fang	1
Papageorgiou, Spiros	1
Powers, Donald	1
Retnawati, Heri	1
Ross, Steven	1
Schedl, Mary	1
Sinharay, Sandip	1
Thomas, Jo Anne	1
Toraman, Cetin	1
Turhan, Nihan S.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	7
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	2

Audience

Practitioners

Location

Canada	1
Europe	1
Indonesia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

Peer reviewed
PDF on ERIC

Download full text

Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)

An Empirical Investigation of Population Invariance in the Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – International Journal of Testing, 2014

Recently there has been an increasing level of interest in subtest scores, or subscores, for their potential diagnostic value. Haberman (2008) suggested a method to determine if a subscore has added value over the total score. Researchers have often been interested in the performance of subgroups--for example, those based on gender or…

Descriptors: Scores, Achievement Tests, Language Tests, English (Second Language)

Interacting in Pairs in a Test of Oral Proficiency: Co-Constructing a Better Performance

Peer reviewed

Direct link

Brooks, Lindsay – Language Testing, 2009

This study, framed within sociocultural theory, examines the interaction of adult ESL test-takers in two tests of oral proficiency: one in which they interacted with an examiner (the individual format) and one in which they interacted with another student (the paired format). The data for the eight pairs in this study were drawn from a larger…

Descriptors: Testing, Rating Scales, Program Effectiveness, Interaction

An Approach to Gain Score Dependability and Validity for Criterion-Referenced Language Tests.

Download full text

Ross, Steven; Hua, Te-Fang – 1994

A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…

Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education

A Standardized Method for Collecting and Analyzing Language Samples of Preschool and Primary Children in the Public Schools.

Peer reviewed

Thomas, Jo Anne – Language, Speech, and Hearing Services in Schools, 1989

To streamline the process of collecting spontaneous language samples, a modified format of the "Multilevel Informal Language Inventory" was administered to 150 public school children, aged four-six. Administration, scoring, and analysis required approximately 35 minutes per child. Scores obtained were used to compute normative data on a…

Descriptors: Diagnostic Tests, Evaluation Methods, Handicap Identification, Language Handicaps