Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 10 |
Descriptor
Scores | 11 |
English (Second Language) | 10 |
Language Tests | 10 |
Second Language Learning | 10 |
Models | 8 |
Computer Assisted Testing | 5 |
Correlation | 5 |
Foreign Students | 3 |
Second Language Instruction | 3 |
Task Analysis | 3 |
Accuracy | 2 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 9 |
Reports - Research | 8 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 5 |
Postsecondary Education | 5 |
Elementary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 11 |
Graduate Record Examinations | 1 |
Praxis Series | 1 |
What Works Clearinghouse Rating
Tenison, Caitlin; Ling, Guangming; McCulla, Laura – International Journal of Artificial Intelligence in Education, 2023
In this paper we use historic score-reporting records and test-taker metadata to inform data-driven recommendations that support international students in their choice of undergraduate institutions for study in the United States. We investigate the use of Structural Topic Modeling (STM) as a context-aware, probabilistic recommendation method that…
Descriptors: Foreign Students, Undergraduate Students, College Choice, Models
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Esfandiari, Mohammad Reza; Riasati, Mohammad Javad; Vaezian, Helia; Rahimi, Forough – Language Testing in Asia, 2018
Background: Validity is a notable concept in language testing which has concerned many researchers and scholars in the field of language testing due to its importance in decision making process. Tests' results always introduce consequences to test takers' lives which emphasizes the need to ensure their validity. Detecting and delineating the…
Descriptors: Computer Assisted Testing, Test Validity, Language Tests, English (Second Language)
Liu, Ren; Huggins-Manley, Anne Corinne; Bulut, Okan – Educational and Psychological Measurement, 2018
Developing a diagnostic tool within the diagnostic measurement framework is the optimal approach to obtain multidimensional and classification-based feedback on examinees. However, end users may seek to obtain diagnostic feedback from existing item responses to assessments that have been designed under either the classical test theory or item…
Descriptors: Models, Item Response Theory, Psychometrics, Test Construction
Abdi Tabari, Mahmoud; Miller, Michol – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2021
Although several studies have explored the effects of task sequencing on second language (L2) production, there is no established set of criteria to sequence tasks for learners in L2 writing classrooms. This study examined the effect of simple ?complex task sequencing manipulated along both resource-directing (± number of elements) and…
Descriptors: Language Fluency, Task Analysis, Second Language Learning, Second Language Instruction
Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018
The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Nizonkiza, Déogratias – Studies in Second Language Learning and Teaching, 2012
The present study explores the relationship between controlled productive knowledge of collocations and L2 proficiency, the role of frequency in controlled productive knowledge of collocations, and the quantifiability of controlled productive collocational knowledge growth alongside L2 proficiency and word frequency levels. A proficiency measure…
Descriptors: Word Frequency, Language Proficiency, Phrase Structure, Second Language Learning
Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013
This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation
Way, Walter D.; Reese, Clyde M. – 1991
The use of two alternative item response theory (IRT) estimation models in the scaling and equating of the Test of English as a Foreign Language (TOEFL) was explored; and item scaling and test equating results based on these models were compared with results based on the three-parameter (3PL) model currently being used with the TOEFL. Models were…
Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Goodness of Fit