ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Descriptor

Language Proficiency	8
Test Reliability	8
Test Theory	8
Language Tests	7
Item Response Theory	4
English (Second Language)	3
Statistical Analysis	3
Comparative Analysis	2
Generalizability Theory	2
Higher Education	2
Item Analysis	2
Scores	2
Second Language Learning	2
Test Validity	2
Testing	2
Trend Analysis	2
Accountability	1
Accuracy	1
Achievement Gains	1
Annotated Bibliographies	1
College Students	1
Computer Assisted Testing	1
Correlation	1
Criterion Referenced Tests	1
Elementary School Students	1
More ▼

Source

Annual Review of Applied…	1
Journal on Educational…	1
Language Testing	1
Online Submission	1
Turkish Online Journal of…	1

Author

Salmani-Nodoushan, Mohammad…	2
Bachman, Lyle F.	1
Douglas, Dan	1
Hua, Te-Fang	1
Longabach, Tanya	1
Moy, Raymond	1
Peyton, Vicki	1
Retnawati, Heri	1
Ross, Steven	1

Publication Type

Journal Articles	5
Reports - Research	5
Speeches/Meeting Papers	3
Reports - Descriptive	2
Information Analyses	1

Education Level

Elementary Education

Audience

Location

Indonesia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

Peer reviewed
PDF on ERIC

Download full text

Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)

Measurement Theory in Language Testing: Past Traditions and Current Trends

Peer reviewed
PDF on ERIC

Download full text

Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Measurement Theory in Language Testing: Past Traditions and Current Trends

Download full text

Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Score Equating and Nominally Parallel Language Tests.

Moy, Raymond – 1982

Score equating requires that the forms to be equated are functionally parallel. That is, the two test forms should rank order examinees in a similar fashion. In language proficiency testing situations, this assumption is often put into doubt because of the numerous tests that have been proposed as measures of language proficiency and the…

Descriptors: Equated Scores, Language Proficiency, Language Tests, Latent Trait Theory

An Approach to Gain Score Dependability and Validity for Criterion-Referenced Language Tests.

Download full text

Ross, Steven; Hua, Te-Fang – 1994

A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…

Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education

Developments in Language Testing.

Peer reviewed

Douglas, Dan – Annual Review of Applied Linguistics, 1995

Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…

Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests

Investigating Variability in Tasks and Rater Judgments in a Performance Test of Foreign Language Speaking.

Download full text

Bachman, Lyle F.; And Others – 1993

This paper outlines the development of a performance assessment measure of language speaking ability, the Language Ability Assessment System (LAAS), which is highly reliable and can be examined for reliability through modern measurement theories, such as generalizability theory (G-theory) and the many-facet Rasch theory. LAAS was developed to…

Descriptors: College Students, Higher Education, Interrater Reliability, Language Proficiency