Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Test Reliability | 6 |
Writing Tests | 6 |
Foreign Countries | 3 |
Grade 7 | 3 |
Grade 8 | 3 |
Reading Tests | 3 |
Test Validity | 3 |
Gender Differences | 2 |
Generalizability Theory | 2 |
Grade 6 | 2 |
Language Tests | 2 |
More ▼ |
Source
ETS Research Report Series | 1 |
International Journal of… | 1 |
International Journal of… | 1 |
New York State Education… | 1 |
Practical Assessment,… | 1 |
ProQuest LLC | 1 |
Author
Deniz, Kaan Zulfikar | 1 |
Gomez, Pablo Garcia | 1 |
Gu, Lin | 1 |
Huebner, Alan | 1 |
Ilican, Emel | 1 |
Ling, Guangming | 1 |
Merrigan, Teresa E. | 1 |
Skar, Gustaf B. | 1 |
Turkan, Sultan | 1 |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Dissertations/Theses -… | 1 |
Guides - Classroom - Teacher | 1 |
Tests/Questionnaires | 1 |
Education Level
Junior High Schools | 6 |
Middle Schools | 6 |
Secondary Education | 6 |
Elementary Education | 5 |
Grade 7 | 3 |
Grade 8 | 3 |
Grade 6 | 2 |
Intermediate Grades | 2 |
Early Childhood Education | 1 |
Primary Education | 1 |
Audience
Teachers | 1 |
Location
Asia | 1 |
China | 1 |
Indiana | 1 |
Italy | 1 |
Maryland | 1 |
New Jersey | 1 |
New York | 1 |
Norway | 1 |
Pennsylvania | 1 |
Turkey | 1 |
Turkey (Ankara) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021
This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…
Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
New York State Education Department, 2024
The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…
Descriptors: Language Tests, Test Format, Language Arts, English Instruction
Ling, Guangming – International Journal of Testing, 2016
To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…
Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers
Gu, Lin; Turkan, Sultan; Gomez, Pablo Garcia – ETS Research Report Series, 2015
ELTeach is an online professional development program developed by Educational Testing Service (ETS) in collaboration with National Geographic Learning. The ELTeach program consists of two courses: English-for-Teaching and Professional Knowledge for English Language Teaching (ELT). Each course includes a coordinated assessment leading to a score…
Descriptors: Item Analysis, Test Items, English (Second Language), Second Language Instruction
Merrigan, Teresa E. – ProQuest LLC, 2012
The purpose of the current study was to evaluate the psychometric properties of alternative approaches to administering and scoring curriculum-based measurement for written expression. Specifically, three response durations (3, 5, and 7 minutes) and six score types (total words written, words spelled correctly, percent of words spelled correctly,…
Descriptors: Curriculum Based Assessment, Testing, Scoring, Writing Tests