Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 12 |
Descriptor
Examiners | 46 |
Test Reliability | 46 |
Test Validity | 20 |
Scoring | 11 |
Testing | 10 |
Testing Programs | 9 |
Grading | 8 |
Test Construction | 8 |
Test Format | 8 |
Test Results | 8 |
Interrater Reliability | 7 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Education | 6 |
Junior High Schools | 6 |
Middle Schools | 6 |
Secondary Education | 6 |
Grade 5 | 3 |
Grade 6 | 3 |
Grade 7 | 3 |
Grade 8 | 3 |
Grade 3 | 2 |
Grade 4 | 2 |
Higher Education | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Intelligence Scale… | 2 |
Adult Attachment Interview | 1 |
Denver Developmental… | 1 |
Graduate Record Examinations | 1 |
Raven Progressive Matrices | 1 |
Rorschach Test | 1 |
What Works Clearinghouse Rating
Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…
Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics
Wendler, Cathy; Glazer, Nancy; Cline, Frederick – ETS Research Report Series, 2019
One of the challenges in scoring constructed-response (CR) items and tasks is ensuring that rater drift does not occur during or across scoring windows. Rater drift reflects changes in how raters interpret and use established scoring criteria to assign essay scores. Calibration is a process used to help control rater drift and, as such, serves as…
Descriptors: College Entrance Examinations, Graduate Study, Accuracy, Test Reliability
New York State Education Department, 2024
The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…
Descriptors: Language Tests, Test Format, Language Arts, English Instruction
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Paper-Based Field Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Field Tests, and the Elementary-level (Grade 5) and Intermediate-level (Grade 8) Science Field Tests. School administrators must be thoroughly familiar with the…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
New York State Education Department, 2021
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written so that testing…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
New York State Education Department, 2020
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written so that testing…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
New York State Education Department, 2019
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Tests. School administrators must be thoroughly familiar with the contents of the manual and the policies and procedures must be followed as written so that testing…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Macqueen, Susy; Harding, Luke – Language Testing, 2009
In 2002 the University of Cambridge Local Examinations Syndicate (UCLES) implemented a revised version of the Certificate of Proficiency in English (CPE). CPE, which is the highest level of the Main Suite of Cambridge ESOL exams, comprises five modules, "Reading," "Writing," "Use of English," "Listening" and "Speaking," the latter of which is the…
Descriptors: Speech Communication, Test Reviews, Examiners, English (Second Language)

Rothman, Carole – Psychology in the Schools, 1974
The purpose of this study is to determine (1) whether the previously observed vulnerability of WISC subtests to tester effects appeared under ordinary testing conditions, and (2) which subtests were most susceptible to these effects. Results support the presence of both general and differential vulnerability of subtests. (Author)
Descriptors: Examiners, School Psychologists, Statistical Bias, Test Reliability
Powell, Thomas W. – Clinical Linguistics & Phonetics, 2006
The third edition of the "Boston Diagnostic Aphasia Examination" (Goodglass, Kaplan, and Barresi) introduced standardized procedures for coding discourse samples elicited using the well known Cookie Theft illustration. To evaluate the reliability of this discourse coding procedure, a transcribed sample was coded by 14 novice examiners…
Descriptors: Examiners, Interrater Reliability, Test Reliability, Aphasia

Murphy, R. J. L. – British Journal of Educational Psychology, 1978
Eight recent General Certificate of Education (GCE) examinations, containing mainly free-response questions, were investigated in terms of their marking reliability. The tests of 200 randomly selected candidates from each subject were re-marked by a senior GCE examiner, and these marks were compared with the marks awarded previously as a result of…
Descriptors: Educational Psychology, Examiners, Grading, Item Analysis
Baldauf, Richard B., Jr.; Bisazza, John A. – 1981
One hundred eighty limited-English-speaking college students from the Philippines, Hong Kong, India, Japan, Thailand, and Taiwan were administered the Michigan Test of Aural Comprehension (MTAC). An analysis of the MTAC indicated that the three unrestricted forms of the test show satisfactory internal consistency reliability comparable to that…
Descriptors: English (Second Language), Examiners, Experimenter Characteristics, Higher Education

Wood, R.; Quinn, B. – Educational Review, 1976
Impression marking of English Language essay and summary questions by pairs of examiners is shown, as expected, to be more reliable than single marking. Given the limited statistical information available, it is concluded that pairing of examiners can as well be done by random or quasi-random means as by attempts at calculated matching. (Editor/RK)
Descriptors: Bias, Educational Research, Essay Tests, Examiners