ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	11

Source

Language Testing	2
Assessment for Effective…	1
Assessment in Education:…	1
ELT Journal	1
ETS Research Report Series	1
Educational Psychology in…	1
Exceptionality	1
Journal of Learning…	1
Journal of School Psychology	1
Journal of Special Education	1
Language Assessment Quarterly	1
Language Learning	1
Language Testing in Asia	1
Mathematics Educator	1
Remedial and Special Education	1
Volta Review	1
More ▼

Publication Type

Reports - Research	23
Journal Articles	17
Speeches/Meeting Papers	4
Information Analyses	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Higher Education	2
Postsecondary Education	2
Primary Education	2
Early Childhood Education	1
Grade 1	1
Grade 3	1
Grade 6	1
Secondary Education	1

Audience

Location

Ethiopia	1
France	1
Iran	1
Japan	1
Norway	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Iowa Tests of Basic Skills	1
Test of Written English	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Teacher-Assigned Grades and External Exams: Sources of Discrepancy

Peer reviewed

Direct link

José Manuel Arencibia Alemán; Astrid Marie Jorde Sandsør; Henrik Daae Zachrisson; Sigrid Blömeke – Assessment in Education: Principles, Policy & Practice, 2024

Modest correlations between teacher-assigned grades and external assessments of academic achievement (r = 0.40-0.60) have led many educational stakeholders to deem grades subjective and unreliable. However, theoretical and methodological challenges, such as construct misalignment, data unavailability and sample unrepresentativeness, limit the…

Descriptors: Grades (Scholastic), Grading, Achievement Tests, Test Validity

Korean Syntactic Complexity Analyzer (KOSCA): An NLP Application for the Analysis of Syntactic Complexity in Second Language Production

Peer reviewed

Direct link

Haerim Hwang; Hyunwoo Kim – Language Testing, 2024

Given the lack of computational tools available for assessing second language (L2) production in Korean, this study introduces a novel automated tool called the Korean Syntactic Complexity Analyzer (KOSCA) for measuring syntactic complexity in L2 Korean production. As an open-source graphic user interface (GUI) developed in Python, KOSCA provides…

Descriptors: Korean, Natural Language Processing, Syntax, Computer Graphics

Meta-Analysis of Prompt and Duration for Curriculum-Based Measurement of Written Language

Peer reviewed

Direct link

Romig, John Elwood; Miller, Alexandra A.; Therrien, William J.; Lloyd, John W. – Exceptionality, 2021

Researchers studying curriculum-based measurement of written expression have used a variety of writing prompt types and durations when establishing criterion validity of these tools. The purpose of this study was to determine through meta-analytic procedures whether any prompt type or duration was superior to others in terms of criterion validity.…

Descriptors: Curriculum Based Assessment, Writing Evaluation, Prompting, Meta Analysis

The Development and Initial Validation of O-WSVLT, a Meaning-Recall Online L2 Spanish Vocabulary Levels Test

Peer reviewed

Direct link

Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024

Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…

Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development

Dyslexia Assessment in Amharic

Peer reviewed

Direct link

Mekonnen, Abebayehu Messele – Educational Psychology in Practice, 2023

This exploratory study aimed at developing a dyslexia assessment tool in the Amharic language and to collect initial reliability and validity data on the tool designed to identify dyslexia in Grade 3. The developed battery consists of 10 tests. Data were collected from 121 Amharic-speaking children, aged 9-12 years. Evidence of construct validity…

Descriptors: Dyslexia, Screening Tests, Identification, Grade 3

Register Variation in Spoken and Written Language Use across Technology-Mediated and Non-Technology-Mediated Learning Environments

Peer reviewed

Direct link

Kyle, Kristopher; Eguchi, Masaki; Choe, Ann Tai; LaFlair, Geoff – Language Testing, 2022

In the realm of language proficiency assessments, the domain description inference and the extrapolation inference are key components of a validity argument. Biber et al.'s description of the lexicogrammatical features of the spoken and written registers in the T2K-SWAL corpus has served as support for the TOEFL iBT test's domain description and…

Descriptors: Language Variation, Written Language, Speech Communication, Inferences

Meta-Analysis of Criterion Validity for Curriculum-Based Measurement in Written Language

Peer reviewed

Direct link

Romig, John Elwood; Therrien, William J.; Lloyd, John W. – Journal of Special Education, 2017

We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…

Descriptors: Meta Analysis, Curriculum Based Assessment, Written Language, Scoring Formulas

Validating a Written Instrument for Assessing Students' Fractions Schemes and Operations

Peer reviewed
PDF on ERIC

Download full text

Wilkins, Jesse L. M.; Norton, Anderson; Boyce, Steven J. – Mathematics Educator, 2013

Previous research has documented schemes and operations that undergird students' understanding of fractions. This prior research was based, in large part, on small-group teaching experiments. However, written assessments are needed in order for teachers and researchers to assess students' ways of operating on a whole-class scale. In this study,…

Descriptors: Test Validity, Mathematics Instruction, Mathematical Concepts, Evaluation Methods

Consistency of Measured Accuracy in Grammar Knowledge Tests and Writing: TOEFL PBT

Peer reviewed

Direct link

Ahangari, Saeideh; Barghi, Ali Hamed – Language Testing in Asia, 2012

Almost no language test is void of grammar items, and the reason is probably the assumption that there exists a positive correlation between examinees' grammar knowledge and the actual demonstrable level of accuracy in communication. Meanwhile, examples abound where many examinees relatively do well on grammar knowledge tests despite failing to…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Grammar

Discourse Characteristics of Writing and Speaking Task Types on the "TOEFL iBT"® Test: A Lexico-Grammatical Analysis. "TOEFL iBT"® Research Report. TOEFL iBT-19. Research Report. RR-13-04

Peer reviewed
PDF on ERIC

Download full text

Biber, Douglas; Gray, Bethany – ETS Research Report Series, 2013

One of the major innovations of the "TOEFL iBT"® test is the incorporation of integrated tasks complementing the independent tasks to which examinees respond. In addition, examinees must produce discourse in both modes (speech and writing). The validity argument for the TOEFL iBT includes the claim that examinees vary their discourse in…

Descriptors: Discourse Analysis, English (Second Language), Second Language Learning, Language Tests

Technical Features of Curriculum-Based Measures for Beginning Writers

Peer reviewed

Direct link

McMaster, Kristen L.; Du, Xiaoqing; Petursdottir, Anna-Lind – Journal of Learning Disabilities, 2009

The purpose of the two studies reported in this article was to examine technical features of curriculum-based measures for beginning writers. In Study 1, 50 first graders responded to word copying, sentence copying, and story prompts. In Study 2, 50 additional first graders responded to letter, picture-word, picture-theme, and photo prompts. In…

Descriptors: Curriculum Based Assessment, Grade 1, Writing Tests, Cues

A Comparison of the Spontaneous Writing Quotient of the Test of Written Language (3rd ed.) and Teacher Ratings of Writing Progress.

Peer reviewed

Burns, Matthew K.; Symington, Todd – Assessment for Effective Intervention, 2003

A study compared the Spontaneous Writing Quotient (SWQ) of the Test of Writing Language-3 to teacher progress ratings of 147 students (grades 3-5) in the local writing curriculum. Corrected coefficients ranged from .39 to .48, offering only low to moderate support for the criterion-related validity relative to the local curriculum. (Contains…

Descriptors: Elementary Education, Evaluation Methods, Learning Disabilities, Student Evaluation

On the Validity of Analytic Ratings.

Download full text

Marzano, Robert J. – 1975

The purpose of this study was to examine the reliability of the analytical method of grading essays in relation to the holistic method. It was hypothesized that the use of the analytic method to rate college composition papers produces high rater reliability at the expense of biasing the raters and thus lowering the validity of the grades. Six…

Descriptors: Educational Research, English Instruction, Evaluation Methods, Higher Education

Written Tests of Pronunciation: Do They Work?

Peer reviewed

Buck, Gary – ELT Journal, 1989

Examination of the reliability and validity of paper-and-pencil pronunciation tests of English as a second language in Osaka (Japan) showed very low reliability. Correlations with more direct measures of pronunciation showed very low validity of written pronunciation tests. Sample tests are appended. (Author/CB)

Descriptors: English (Second Language), Foreign Countries, Higher Education, Language Tests

Assessing the Written Narratives of Deaf Students Using the Six-Trait Analytical Scale.

Peer reviewed

Heefner, Deanna L.; Shaw, Pamela Carson – Volta Review, 1996

A study of 943 personal narratives of students with deafness evaluated the effectiveness of the Six-Trait Analytical Scale in assessing student's writing. The scale was found to be reliable and valid in evaluating written narratives of students with and without hearing losses and is recommended as a valuable diagnostic instrument and instructional…

Descriptors: Deafness, Elementary Secondary Education, Evaluation Methods, Personal Narratives

Previous Page | Next Page »

Pages: 1 | 2

Test Validity	23
Written Language	23
Language Tests	9
Second Language Learning	9
Test Reliability	8
English (Second Language)	6
Evaluation Methods	6
Student Evaluation	6
Foreign Countries	5
Higher Education	5
Academic Achievement	4
Curriculum Based Assessment	4
Grammar	4
Language Proficiency	4
Computational Linguistics	3
Elementary School Students	3
Oral Language	3
Syntax	3
Test Construction	3
Writing Evaluation	3
Writing Skills	3
Writing Tests	3
Achievement Tests	2
College Students	2
Computer Assisted Testing	2
More ▼

Lloyd, John W.	2
Romig, John Elwood	2
Therrien, William J.	2
Ahangari, Saeideh	1
Arnaud, Pierre J. L.	1
Astrid Marie Jorde Sandsør	1
Barghi, Ali Hamed	1
Berninger, Virginia W.	1
Biber, Douglas	1
Boyce, Steven J.	1
Buck, Gary	1
Burns, Matthew K.	1
Choe, Ann Tai	1
Claudia Helena…	1
Doyle, Teresa F.	1
Du, Xiaoqing	1
Eguchi, Masaki	1
Fewster, Saima	1
Fotos, Sandra S.	1
Gaies, Stephen J.	1
Gray, Bethany	1
Haerim Hwang	1
Heefner, Deanna L.	1
Henrik Daae Zachrisson	1
More ▼