Publication Date
| In 2026 | 0 |
| Since 2025 | 9 |
| Since 2022 (last 5 years) | 54 |
| Since 2017 (last 10 years) | 195 |
| Since 2007 (last 20 years) | 414 |
Descriptor
Source
Author
| Attali, Yigal | 7 |
| Bridgeman, Brent | 6 |
| Petscher, Yaacov | 5 |
| Barkaoui, Khaled | 4 |
| Bennett, Randy Elliot | 3 |
| Breyer, F. Jay | 3 |
| Coniam, David | 3 |
| Kim, Young-Suk Grace | 3 |
| Lee, Yong-Won | 3 |
| Magliano, Joseph P. | 3 |
| Makransky, Guido | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 4 |
| Practitioners | 3 |
| Students | 2 |
| Teachers | 2 |
Location
| China | 19 |
| Germany | 17 |
| Japan | 12 |
| Australia | 10 |
| Canada | 10 |
| Turkey | 10 |
| United Kingdom | 9 |
| California | 7 |
| Netherlands | 7 |
| United States | 7 |
| Hong Kong | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012
Scoring models for the "e-rater"® system were built and evaluated for the "TOEFL"® exam's independent and integrated writing prompts. Prompt-specific and generic scoring models were built, and evaluation statistics, such as weighted kappas, Pearson correlations, standardized differences in mean scores, and correlations with…
Descriptors: Scoring, Prompting, Evaluators, Computer Software
Cho, Yeonsuk; Bridgeman, Brent – Language Testing, 2012
This study examined the relationship between scores on the TOEFL Internet-Based Test (TOEFL iBT[R]) and academic performance in higher education, defined here in terms of grade point average (GPA). The academic records for 2594 undergraduate and graduate students were collected from 10 universities in the United States. The data consisted of…
Descriptors: Evidence, Academic Records, Graduate Students, Universities
Wilson, Damian Vergara – Heritage Language Journal, 2012
This paper illustrates a method of item analysis used to identify discriminating multiple-choice items in placement data. The data come from two rounds of pilots given to both SHL students and Spanish as a Second Language (SSL) students. In the first round, 104 items were administered to 507 students. After discarding poor items, the second round…
Descriptors: Heritage Education, Graphs, Item Analysis, Correlation
Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012
Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…
Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring
Biber, Douglas; Gray, Bethany – ETS Research Report Series, 2013
One of the major innovations of the "TOEFL iBT"® test is the incorporation of integrated tasks complementing the independent tasks to which examinees respond. In addition, examinees must produce discourse in both modes (speech and writing). The validity argument for the TOEFL iBT includes the claim that examinees vary their discourse in…
Descriptors: Discourse Analysis, English (Second Language), Second Language Learning, Language Tests
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Atkins, Andrew – Research-publishing.net, 2013
This paper provides a discussion of the results of a cross-sectional examination of linguistic variables that are predicted to influence L2 reading fluency. This study is part of a larger, longitudinal mixed-methods study into reading fluency development using online Timed Reading (TR) with participants from a mid-to-high level private university…
Descriptors: Reading Fluency, Web Sites, Case Studies, Second Language Learning
Verbout, Mary F. – ProQuest LLC, 2013
Multiple-choice tests of punctuation and usage are used throughout the United States to assess the writing skills of new community college students in order to place them in either a basic writing course or first-year composition. To determine whether using the COMPASS Writing Test (CWT) is a valid placement at a community college, student test…
Descriptors: Predictive Validity, Multiple Choice Tests, Student Placement, Community Colleges
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Weigle, Sara Cushing – Language Testing, 2010
Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…
Descriptors: Correlation, Validity, Writing Ability, English (Second Language)
Tahmasebi, Soheila; Rahimi, Ali – Teaching English with Technology, 2013
Due to the unquestionable roles of technology in language classes, it might be necessary to use computers in assessing language knowledge. This study aimed to examine how computers may be used to assess language ability of [English for Specific Purposes] ESP students. Sixty computer-major university students at Abadan University are the…
Descriptors: Computer Assisted Testing, Language Tests, Language Skills, Second Language Instruction
Liao, Chen-Huei; Kuo, Bor-Chen; Pai, Kai-Chih – Turkish Online Journal of Educational Technology - TOJET, 2012
Automated scoring by means of Latent Semantic Analysis (LSA) has been introduced lately to improve the traditional human scoring system. The purposes of the present study were to develop a LSA-based assessment system to evaluate children's Chinese sentence construction skills and to examine the effectiveness of LSA-based automated scoring function…
Descriptors: Foreign Countries, Program Effectiveness, Scoring, Personality
Lau, Sing; Cheung, Ping Chung – Thinking Skills and Creativity, 2010
With a sample of Grade 4 Chinese students, the present study examined whether the electronic version was comparable to the paper-and-pencil version of the Wallach-Kogan Creativity Tests (WKCT). It was found that the two versions generated similar patterns of reliability coefficients and inter-correlation coefficients for the eight creativity…
Descriptors: Foreign Countries, Creativity, Grade 4, Test Reliability
Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013
This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation
Oberle, Eva; Schonert-Reichl, Kimberly A.; Lawlor, Molly Stewart; Thomson, Kimberly C. – Journal of Early Adolescence, 2012
This study examined the relationship between the executive control process of inhibition and self-reported dispositional mindfulness, controlling for gender, grade, and cortisol levels in 99 (43% female) fourth- and fifth-graders ([X-bar] = 10.23 years, SD = 0.53). Students completed a measure of mindful attention awareness and a computerized…
Descriptors: Intervention, Early Adolescents, Inhibition, Cognitive Development

Peer reviewed
Direct link
