Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Publication Type
Reports - Research | 23 |
Journal Articles | 17 |
Speeches/Meeting Papers | 4 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Primary Education | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Grade 3 | 1 |
Grade 6 | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
Iowa Tests of Basic Skills | 1 |
Test of Written English | 1 |
What Works Clearinghouse Rating
José Manuel Arencibia Alemán; Astrid Marie Jorde Sandsør; Henrik Daae Zachrisson; Sigrid Blömeke – Assessment in Education: Principles, Policy & Practice, 2024
Modest correlations between teacher-assigned grades and external assessments of academic achievement (r = 0.40-0.60) have led many educational stakeholders to deem grades subjective and unreliable. However, theoretical and methodological challenges, such as construct misalignment, data unavailability and sample unrepresentativeness, limit the…
Descriptors: Grades (Scholastic), Grading, Achievement Tests, Test Validity
Haerim Hwang; Hyunwoo Kim – Language Testing, 2024
Given the lack of computational tools available for assessing second language (L2) production in Korean, this study introduces a novel automated tool called the Korean Syntactic Complexity Analyzer (KOSCA) for measuring syntactic complexity in L2 Korean production. As an open-source graphic user interface (GUI) developed in Python, KOSCA provides…
Descriptors: Korean, Natural Language Processing, Syntax, Computer Graphics
Romig, John Elwood; Miller, Alexandra A.; Therrien, William J.; Lloyd, John W. – Exceptionality, 2021
Researchers studying curriculum-based measurement of written expression have used a variety of writing prompt types and durations when establishing criterion validity of these tools. The purpose of this study was to determine through meta-analytic procedures whether any prompt type or duration was superior to others in terms of criterion validity.…
Descriptors: Curriculum Based Assessment, Writing Evaluation, Prompting, Meta Analysis
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Mekonnen, Abebayehu Messele – Educational Psychology in Practice, 2023
This exploratory study aimed at developing a dyslexia assessment tool in the Amharic language and to collect initial reliability and validity data on the tool designed to identify dyslexia in Grade 3. The developed battery consists of 10 tests. Data were collected from 121 Amharic-speaking children, aged 9-12 years. Evidence of construct validity…
Descriptors: Dyslexia, Screening Tests, Identification, Grade 3
Kyle, Kristopher; Eguchi, Masaki; Choe, Ann Tai; LaFlair, Geoff – Language Testing, 2022
In the realm of language proficiency assessments, the domain description inference and the extrapolation inference are key components of a validity argument. Biber et al.'s description of the lexicogrammatical features of the spoken and written registers in the T2K-SWAL corpus has served as support for the TOEFL iBT test's domain description and…
Descriptors: Language Variation, Written Language, Speech Communication, Inferences
Romig, John Elwood; Therrien, William J.; Lloyd, John W. – Journal of Special Education, 2017
We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…
Descriptors: Meta Analysis, Curriculum Based Assessment, Written Language, Scoring Formulas
Wilkins, Jesse L. M.; Norton, Anderson; Boyce, Steven J. – Mathematics Educator, 2013
Previous research has documented schemes and operations that undergird students' understanding of fractions. This prior research was based, in large part, on small-group teaching experiments. However, written assessments are needed in order for teachers and researchers to assess students' ways of operating on a whole-class scale. In this study,…
Descriptors: Test Validity, Mathematics Instruction, Mathematical Concepts, Evaluation Methods
Ahangari, Saeideh; Barghi, Ali Hamed – Language Testing in Asia, 2012
Almost no language test is void of grammar items, and the reason is probably the assumption that there exists a positive correlation between examinees' grammar knowledge and the actual demonstrable level of accuracy in communication. Meanwhile, examples abound where many examinees relatively do well on grammar knowledge tests despite failing to…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Grammar
Biber, Douglas; Gray, Bethany – ETS Research Report Series, 2013
One of the major innovations of the "TOEFL iBT"® test is the incorporation of integrated tasks complementing the independent tasks to which examinees respond. In addition, examinees must produce discourse in both modes (speech and writing). The validity argument for the TOEFL iBT includes the claim that examinees vary their discourse in…
Descriptors: Discourse Analysis, English (Second Language), Second Language Learning, Language Tests
McMaster, Kristen L.; Du, Xiaoqing; Petursdottir, Anna-Lind – Journal of Learning Disabilities, 2009
The purpose of the two studies reported in this article was to examine technical features of curriculum-based measures for beginning writers. In Study 1, 50 first graders responded to word copying, sentence copying, and story prompts. In Study 2, 50 additional first graders responded to letter, picture-word, picture-theme, and photo prompts. In…
Descriptors: Curriculum Based Assessment, Grade 1, Writing Tests, Cues

Burns, Matthew K.; Symington, Todd – Assessment for Effective Intervention, 2003
A study compared the Spontaneous Writing Quotient (SWQ) of the Test of Writing Language-3 to teacher progress ratings of 147 students (grades 3-5) in the local writing curriculum. Corrected coefficients ranged from .39 to .48, offering only low to moderate support for the criterion-related validity relative to the local curriculum. (Contains…
Descriptors: Elementary Education, Evaluation Methods, Learning Disabilities, Student Evaluation
Marzano, Robert J. – 1975
The purpose of this study was to examine the reliability of the analytical method of grading essays in relation to the holistic method. It was hypothesized that the use of the analytic method to rate college composition papers produces high rater reliability at the expense of biasing the raters and thus lowering the validity of the grades. Six…
Descriptors: Educational Research, English Instruction, Evaluation Methods, Higher Education

Buck, Gary – ELT Journal, 1989
Examination of the reliability and validity of paper-and-pencil pronunciation tests of English as a second language in Osaka (Japan) showed very low reliability. Correlations with more direct measures of pronunciation showed very low validity of written pronunciation tests. Sample tests are appended. (Author/CB)
Descriptors: English (Second Language), Foreign Countries, Higher Education, Language Tests

Heefner, Deanna L.; Shaw, Pamela Carson – Volta Review, 1996
A study of 943 personal narratives of students with deafness evaluated the effectiveness of the Six-Trait Analytical Scale in assessing student's writing. The scale was found to be reliable and valid in evaluating written narratives of students with and without hearing losses and is recommended as a valuable diagnostic instrument and instructional…
Descriptors: Deafness, Elementary Secondary Education, Evaluation Methods, Personal Narratives
Previous Page | Next Page »
Pages: 1 | 2