ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	20

Descriptor

Statistical Analysis	25
Writing Tests	25
Test Reliability	11
English (Second Language)	9
Correlation	8
Foreign Countries	8
Interrater Reliability	8
Reliability	8
Scores	8
Test Validity	7
College Entrance Examinations	5
Comparative Analysis	5
Scoring	5
Writing Skills	5
Academic Achievement	4
Elementary School Students	4
Essays	4
Language Tests	4
Mathematics Tests	4
Reading Tests	4
Scoring Rubrics	4
Second Language Learning	4
Writing Evaluation	4
College Students	3
Computer Assisted Testing	3
More ▼

Publication Type

Reports - Research	22
Journal Articles	18
Tests/Questionnaires	3
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Elementary Education	6
Higher Education	6
Secondary Education	6
High Schools	4
Postsecondary Education	4
Early Childhood Education	3
Intermediate Grades	3
Primary Education	3
Elementary Secondary Education	2
Grade 10	2
Grade 11	2
Grade 3	2
Grade 4	2
Middle Schools	2
Grade 1	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
Two Year Colleges	1
More ▼

Audience

Location

Australia	1
California	1
Canada	1
Egypt	1
France	1
Georgia	1
India	1
Italy	1
Lebanon	1
New York	1
Norway	1
South Korea	1
Utah	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	4
ACT Assessment	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

A Nonparametric Procedure for Exploring Differences in Rating Quality across Test-Taker Subgroups in Rater-Mediated Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Language Testing, 2019

Differences in rater judgments that are systematically related to construct-irrelevant characteristics threaten the fairness of rater-mediated writing assessments. Accordingly, it is essential that researchers and practitioners examine the degree to which the psychometric quality of rater judgments is comparable across test-taker subgroups.…

Descriptors: Nonparametric Statistics, Interrater Reliability, Differences, Writing Tests

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Exploring Incomplete Rating Designs with Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A.; Patil, Yogendra J. – Educational and Psychological Measurement, 2018

Recent research has explored the use of models adapted from Mokken scale analysis as a nonparametric approach to evaluating rating quality in educational performance assessments. A potential limiting factor to the widespread use of these techniques is the requirement for complete data, as practical constraints in operational assessment systems…

Descriptors: Scaling, Data, Interrater Reliability, Writing Tests

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Evaluating the Stability of Test Score Means for the "TOEIC"® Speaking and Writing Tests. Research Report. ETS RR-17-50

Peer reviewed
PDF on ERIC

Download full text

Qu, Yanxuan; Huo, Yan; Chan, Eric; Shotts, Matthew – ETS Research Report Series, 2017

For educational tests, it is critical to maintain consistency of score scales and to understand the sources of variation in score means over time. This practice helps to ensure that interpretations about test takers' abilities are comparable from one administration (or one form) to another. This study examines the consistency of reported scores…

Descriptors: Scores, English (Second Language), Language Tests, Second Language Learning

Handwriting in Lebanese Bigraphic Children: Standardization of the BHK Scale

Peer reviewed

Direct link

Matta Abizeid, Carla; Tabsh Nakib, Amira; Younès Harb, Céleste; Ghantous Faddoul, Shereen; Albaret, Jean-Michel – Journal of Occupational Therapy, Schools & Early Intervention, 2017

Educational systems in Lebanon are bilingual. They simultaneously impose two handwriting systems in Arabic and Latin. This historically driven situation could constitute a significant impact on the process and development of handwriting skills. Using an accurate and valid handwriting evaluation tool standardized for the Lebanese population is a…

Descriptors: Foreign Countries, Handwriting, Writing Skills, Writing Tests

Consistency and Stability of Italian Children's Spelling in Dictation versus Composition Assessments

Peer reviewed

Direct link

Bigozzi, Lucia; Tarchi, Christian; Pinto, Giuliana – Reading & Writing Quarterly, 2017

The purpose of this study was to investigate consistency in spelling skills across 2 different tasks of written production (dictation vs. composition) and stability of performance across 4 different grades. We assessed 2nd, 3rd, 4th, and 5th graders' spelling performance through 4 tasks: 2 dictation tasks (passage and sentences) and 2 composition…

Descriptors: Foreign Countries, Spelling, Reliability, Verbal Communication

Evaluating Comparative Judgment as an Approach to Essay Scoring

Peer reviewed

Direct link

Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016

As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…

Descriptors: Essays, Scoring, Comparative Analysis, Evaluators

Examining the Technical Adequacy of Progress Monitoring Using Early Writing Curriculum-Based Measures

Peer reviewed

Direct link

Hampton, David D.; Lembke, Erica S. – Reading & Writing Quarterly, 2016

The purpose of this study was to examine 4 early writing measures used to monitor the early writing progress of 1st-grade students. We administered the measures to 23 1st-grade students biweekly for a total of 16 weeks. We obtained 3-min samples and conducted analyses for each 1-min increment. We scored samples using 2 different methods: correct…

Descriptors: Progress Monitoring, Curriculum Based Assessment, Writing Tests, Outcome Measures

Analysis of Rater Severity on Written Expression Exam Using Many Faceted Rasch Measurement

Peer reviewed
PDF on ERIC

Download full text

Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014

This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…

Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement

Performance Assessment of High and Low Income Families through "Online RAW Achievement Battery Test" of Primary Grade Students

Peer reviewed
PDF on ERIC

Download full text

Ahmed, Tamim; Hanif, Maria – Journal of Education and Practice, 2016

This study is intended to investigate student's achievement capability among two families i.e. Low and High income families and designed for primary level learners. A Reading, Arithmetic and Writing (RAW) Achievement test that was developed as a part of another research study (Tamim Ahmed Khan, 2015) was adopted for this study. Both English medium…

Descriptors: Low Income, Performance Based Assessment, Elementary School Students, Achievement Tests

Measuring Essay Assessment: Intra-Rater and Inter-Rater Reliability

Peer reviewed
PDF on ERIC

Download full text

Kayapinar, Ulas – Eurasian Journal of Educational Research, 2014

Problem Statement: There have been many attempts to research the effective assessment of writing ability, and many proposals for how this might be done. In this sense, rater reliability plays a crucial role for making vital decisions about testees in different turning points of both educational and professional life. Intra-rater and inter-rater…

Descriptors: Interrater Reliability, Essay Tests, Writing Tests, Grading

Fostering the Memoir Writing Skills as a Creative Non-Fiction Genre Using a WebQuest Model

Download full text

Al-Sayed, Rania Kamal Muhammad; Abdel-Haq, Eman Muhammad; El-Deeb, Mervat Abou-Bakr; Ali, Mahsoub Abdel-Sadeq – Online Submission, 2016

The present study aimed at developing the memoir writing skills as a creative non-fiction genre of second year distinguished governmental language preparatory school pupils using the a WebQuest model. Fifty participants from second year at Hassan Abu-Bakr Distinguished Governmental Language School at Al-Qanater Al-Khairia(Qalubia Governorate) were…

Descriptors: Foreign Countries, Autobiographies, Personal Narratives, Writing Skills

Using the Method of Pairwise Comparison to Obtain Reliable Teacher Assessments

Peer reviewed
PDF on ERIC

Download full text

Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010

Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…

Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)

The Inter-Rater Reliability in Scoring Composition

Peer reviewed
PDF on ERIC

Download full text

Wang, Ping – English Language Teaching, 2009

This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…

Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

ETS Research Report Series	3
College Board	2
Reading & Writing Quarterly	2
Applied Measurement in…	1
Australian Educational…	1
Canadian Journal of…	1
College Entrance Examination…	1
Educational and Psychological…	1
English Language Teaching	1
Eurasian Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Occupational…	1
Language Testing	1
Language and Literacy Spectrum	1
National Center for Education…	1
Online Submission	1
ProQuest LLC	1
Psicologica: International…	1
Routledge, Taylor & Francis…	1
Scandinavian Journal of…	1
More ▼

Wind, Stefanie A.	2
Abdel-Haq, Eman Muhammad	1
Ahmed, Tamim	1
Al-Sayed, Rania Kamal Muhammad	1
Albaret, Jean-Michel	1
Ali, Mahsoub Abdel-Sadeq	1
Allen, Nancy	1
Andrews, Melissa	1
Bennett, Randy Elliot	1
Berge, Kjell Lars	1
Bigozzi, Lucia	1
Braswell, James	1
Breland, Hunter	1
Chan, Eric	1
Denison, D. Brian, Ed.	1
Dunn, David E.	1
El-Deeb, Mervat Abou-Bakr	1
Evensen, Lars Sigfred	1
Ewing, Maureen	1
Fasting, Rolf B.	1
Ferrara, Steve	1
Gentile, Claudia	1
Ghantous Faddoul, Shereen	1
Gomes, Hilary	1
Guastello, E. Francine	1
More ▼