Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 7 |
| Since 2007 (last 20 years) | 20 |
Descriptor
| Statistical Analysis | 25 |
| Writing Tests | 25 |
| Test Reliability | 11 |
| English (Second Language) | 9 |
| Correlation | 8 |
| Foreign Countries | 8 |
| Interrater Reliability | 8 |
| Reliability | 8 |
| Scores | 8 |
| Test Validity | 7 |
| College Entrance Examinations | 5 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 22 |
| Journal Articles | 18 |
| Tests/Questionnaires | 3 |
| Books | 1 |
| Collected Works - General | 1 |
| Dissertations/Theses -… | 1 |
| Numerical/Quantitative Data | 1 |
| Reports - Evaluative | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
| SAT (College Admission Test) | 4 |
| ACT Assessment | 1 |
| National Merit Scholarship… | 1 |
| Preliminary Scholastic… | 1 |
| Test of English as a Foreign… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A. – Language Testing, 2019
Differences in rater judgments that are systematically related to construct-irrelevant characteristics threaten the fairness of rater-mediated writing assessments. Accordingly, it is essential that researchers and practitioners examine the degree to which the psychometric quality of rater judgments is comparable across test-taker subgroups.…
Descriptors: Nonparametric Statistics, Interrater Reliability, Differences, Writing Tests
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Wind, Stefanie A.; Patil, Yogendra J. – Educational and Psychological Measurement, 2018
Recent research has explored the use of models adapted from Mokken scale analysis as a nonparametric approach to evaluating rating quality in educational performance assessments. A potential limiting factor to the widespread use of these techniques is the requirement for complete data, as practical constraints in operational assessment systems…
Descriptors: Scaling, Data, Interrater Reliability, Writing Tests
Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017
Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…
Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment
Qu, Yanxuan; Huo, Yan; Chan, Eric; Shotts, Matthew – ETS Research Report Series, 2017
For educational tests, it is critical to maintain consistency of score scales and to understand the sources of variation in score means over time. This practice helps to ensure that interpretations about test takers' abilities are comparable from one administration (or one form) to another. This study examines the consistency of reported scores…
Descriptors: Scores, English (Second Language), Language Tests, Second Language Learning
Matta Abizeid, Carla; Tabsh Nakib, Amira; Younès Harb, Céleste; Ghantous Faddoul, Shereen; Albaret, Jean-Michel – Journal of Occupational Therapy, Schools & Early Intervention, 2017
Educational systems in Lebanon are bilingual. They simultaneously impose two handwriting systems in Arabic and Latin. This historically driven situation could constitute a significant impact on the process and development of handwriting skills. Using an accurate and valid handwriting evaluation tool standardized for the Lebanese population is a…
Descriptors: Foreign Countries, Handwriting, Writing Skills, Writing Tests
Consistency and Stability of Italian Children's Spelling in Dictation versus Composition Assessments
Bigozzi, Lucia; Tarchi, Christian; Pinto, Giuliana – Reading & Writing Quarterly, 2017
The purpose of this study was to investigate consistency in spelling skills across 2 different tasks of written production (dictation vs. composition) and stability of performance across 4 different grades. We assessed 2nd, 3rd, 4th, and 5th graders' spelling performance through 4 tasks: 2 dictation tasks (passage and sentences) and 2 composition…
Descriptors: Foreign Countries, Spelling, Reliability, Verbal Communication
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Hampton, David D.; Lembke, Erica S. – Reading & Writing Quarterly, 2016
The purpose of this study was to examine 4 early writing measures used to monitor the early writing progress of 1st-grade students. We administered the measures to 23 1st-grade students biweekly for a total of 16 weeks. We obtained 3-min samples and conducted analyses for each 1-min increment. We scored samples using 2 different methods: correct…
Descriptors: Progress Monitoring, Curriculum Based Assessment, Writing Tests, Outcome Measures
Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014
This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…
Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement
Ahmed, Tamim; Hanif, Maria – Journal of Education and Practice, 2016
This study is intended to investigate student's achievement capability among two families i.e. Low and High income families and designed for primary level learners. A Reading, Arithmetic and Writing (RAW) Achievement test that was developed as a part of another research study (Tamim Ahmed Khan, 2015) was adopted for this study. Both English medium…
Descriptors: Low Income, Performance Based Assessment, Elementary School Students, Achievement Tests
Kayapinar, Ulas – Eurasian Journal of Educational Research, 2014
Problem Statement: There have been many attempts to research the effective assessment of writing ability, and many proposals for how this might be done. In this sense, rater reliability plays a crucial role for making vital decisions about testees in different turning points of both educational and professional life. Intra-rater and inter-rater…
Descriptors: Interrater Reliability, Essay Tests, Writing Tests, Grading
Al-Sayed, Rania Kamal Muhammad; Abdel-Haq, Eman Muhammad; El-Deeb, Mervat Abou-Bakr; Ali, Mahsoub Abdel-Sadeq – Online Submission, 2016
The present study aimed at developing the memoir writing skills as a creative non-fiction genre of second year distinguished governmental language preparatory school pupils using the a WebQuest model. Fifty participants from second year at Hassan Abu-Bakr Distinguished Governmental Language School at Al-Qanater Al-Khairia(Qalubia Governorate) were…
Descriptors: Foreign Countries, Autobiographies, Personal Narratives, Writing Skills
Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010
Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…
Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)
Wang, Ping – English Language Teaching, 2009
This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…
Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
