ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	25

Descriptor

Interrater Reliability	35
Test Items	35
Test Reliability	35
Test Construction	21
Test Validity	19
Foreign Countries	13
Scoring	12
Difficulty Level	8
Psychometrics	8
Testing	6
Content Validity	4
English (Second Language)	4
Evaluation Methods	4
Evaluators	4
Item Response Theory	4
Language Tests	4
Mathematics Tests	4
Measures (Individuals)	4
Scores	4
Student Evaluation	4
Comparative Analysis	3
Construct Validity	3
Cutting Scores	3
Data Analysis	3
Disability Identification	3
More ▼

Publication Type

Reports - Research	25
Journal Articles	23
Reports - Descriptive	7
Numerical/Quantitative Data	5
Speeches/Meeting Papers	3
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Elementary Education	4
Elementary Secondary Education	4
Postsecondary Education	4
Secondary Education	4
Grade 8	2
Grade 9	2
High Schools	2
Grade 1	1
Grade 2	1
Grade 3	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Location

Canada	2
Florida	2
New Mexico	2
South Africa	2
United States	2
Australia	1
India	1
Japan	1
Oregon	1
Sweden	1
Tennessee	1
Turkey	1
Turkey (Ankara)	1
United Kingdom (England)	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
ACT Assessment	1
Adult Attachment Interview	1
Clinical Evaluation of…	1
Dynamic Indicators of Basic…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1
TerraNova Multiple Assessments	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

The Development of a Test to Explore the Students' Mental Models and External Representation Patterns of Hanging Objects

Peer reviewed
PDF on ERIC

Download full text

Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021

This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…

Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water

Development of a Comprehensive Tool for School Health Policy Evaluation: The WellSAT WSCC

Peer reviewed

Direct link

Koriakin, Taylor A.; McKee, Sarah L.; Schwartz, Marlene B.; Chafouleas, Sandra M. – Journal of School Health, 2020

Background: Stakeholders increasingly recognize the role of policy in implementing Whole School, Whole Community, Whole Child (WSCC) frameworks in schools; however, few tools are currently available to assess alignment between district policies and WSCC concepts. The purpose of this study was to expand the Wellness School Assessment Tool (WellSAT)…

Descriptors: School Policy, Health Services, Health Promotion, Wellness

Development and Validation of a Survey Instrument for Measuring Pre-Service Teachers' Pedagogical Content Knowledge

Peer reviewed

Direct link

Martin, David; Jamieson-Proctor, Romina – International Journal of Research & Method in Education, 2020

In Australia, one of the key findings of the Teacher Education Ministerial Advisory Group was that not all graduating pre-service teachers possess adequate pedagogical content knowledge (PCK) to teach effectively. The concern is that higher education providers working with pre-service teachers are using pedagogical practices and assessments which…

Descriptors: Test Construction, Preservice Teachers, Pedagogical Content Knowledge, Foreign Countries

Inter-Rater Agreement in Assigning Cognitive Demand to Life Sciences Examination Questions

Peer reviewed

Direct link

Dempster, Edith R.; Kirby, Nicola F. – Perspectives in Education, 2018

Taxonomies of cognitive demand are frequently used to ensure that assessment tasks include questions ranging from low to high cognitive demand. This paper investigates inter-rater agreement among four evaluators on the cognitive demand of the South African National Senior Certificate Life Sciences examinations after training, practice and…

Descriptors: Interrater Reliability, Biological Sciences, Cognitive Processes, Test Items

Inter-Rater Agreement in Assigning Levels of Difficulty to Examination Questions in Life Sciences

Peer reviewed
PDF on ERIC

Download full text

Dempster, Edith R.; Kirby, Nicki F. – South African Journal of Education, 2018

Public perception of "declining standards" in school-leaving examinations often accompanies increases in pass rates in schoolleaving examinations. "Declining standards" to the public means easier examination papers. The present study evaluates a South African attempt to estimate the level of difficulty, as distinct from…

Descriptors: Foreign Countries, Interrater Reliability, Difficulty Level, Science Tests

Autism at a Glance: A Pilot Study Optimizing Thin-Slice Observations

Peer reviewed

Direct link

Hampton, Lauren H.; Curtis, Philip R.; Roberts, Megan Y. – Autism: The International Journal of Research and Practice, 2019

Borrowing from a clinical psychology observational methodology, thin-slice observations were used to assess autism characteristics in toddlers. Thin-slices are short observations taken from a longer behavior stream which are assigned ratings by multiple raters using a 5-point scale. The raters' observations are averaged together to assign a…

Descriptors: Autism, Pervasive Developmental Disorders, Observation, Toddlers

Validation of Sub-Constructs in Reading Comprehension Tests Using Teachers' Classification of Cognitive Targets

Peer reviewed

Direct link

Tengberg, Michael – Language Assessment Quarterly, 2018

Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…

Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Smarter Balanced Assessment Consortium: Alignment Study Report. Revised

Download full text

Smarter Balanced Assessment Consortium, 2016

The goal of this study was to gather comprehensive evidence about the alignment of the Smarter Balanced summative assessments to the Common Core State Standards (CCSS). Alignment of the Smarter Balanced summative assessments to the CCSS is a critical piece of evidence regarding the validity of inferences students, teachers and policy makers can…

Descriptors: Alignment (Education), Summative Evaluation, Common Core State Standards, Test Content

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Comparison of Integrated Testlet and Constructed-Response Question Formats

Peer reviewed

Direct link

Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014

Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…

Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests

Internal Structure of the Reflective Functioning Scale

Peer reviewed

Direct link

Taubner, Svenja; Horz, Susanne; Fischer-Kern, Melitta; Doering, Stephan; Buchheim, Anna; Zimmermann, Johannes – Psychological Assessment, 2013

The Reflective Functioning Scale (RFS) was developed to assess individual differences in the ability to mentalize attachment relationships. The RFS assesses mentalization from transcripts of the Adult Attachment Interview (AAI). A global score is given by trained coders on an 11-point scale ranging from antireflective to exceptionally reflective.…

Descriptors: Measures (Individuals), Attachment Behavior, Individual Differences, Adults

Developing an Observation Instrument to Support Authentic Independent Reading Time during School in a Data-Driven World

Peer reviewed
PDF on ERIC

Download full text

Williams, Lunetta M.; Hall, Katrina W.; Hedrick, Wanda B.; Lamkin, Marcia; Abendroth, Jennifer – Journal of Language and Literacy Education, 2013

The purpose of the present study was to develop an instrument to measure reading during in-school independent reading (ISIR). Procedures to establish validity and reliability of the instrument included videotaping and observing students during ISIR, gathering feedback from literacy experts, establishing interrater reliability, crosschecking…

Descriptors: Test Construction, Test Validity, Test Reliability, Video Technology

Previous Page | Next Page »

Pages: 1 | 2 | 3

New Mexico Public Education…	2
Assessment	1
Autism: The International…	1
Center for Research on…	1
ETS Research Report Series	1
Education Digest: Essential…	1
Educational Sciences: Theory…	1
European Journal of Science…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Language and…	1
Journal of School Health	1
Journal of Speech, Language,…	1
Journal on Educational…	1
Language Assessment Quarterly	1
Online Submission	1
Pegem Journal of Education…	1
Perspectives in Education	1
Physical Review Special…	1
Psychological Assessment	1
School Effectiveness and…	1
Smarter Balanced Assessment…	1
South African Journal of…	1
More ▼

Dempster, Edith R.	2
Abendroth, Jennifer	1
Alderson, J. Charles	1
Angoff, William H.	1
Atilgan, Hakan	1
Basokcu, Tahsin Oguz	1
Boccaccini, Marcus T.	1
Boldt, R. F.	1
Botting, Nicola	1
Braithwaite, Nicholas St. J.	1
Bryk, Anthony	1
Buchheim, Anna	1
Carifio, James	1
Chafouleas, Sandra M.	1
Chavez, Oscar	1
Chen, Ching-I	1
Chiat, Shula	1
Clifford, Jantina R.	1
Curtis, Philip R.	1
Demir, Elif Kübra	1
Dexter, Emily	1
Dodd, Barbara	1
Doering, Stephan	1
Fischer-Kern, Melitta	1
More ▼