ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	22
Since 2007 (last 20 years)	44

Descriptor

Difficulty Level	71
Interrater Reliability	71
Test Items	35
Foreign Countries	19
Higher Education	16
Scoring	14
Statistical Analysis	14
Test Reliability	13
Scores	12
English (Second Language)	11
Evaluators	11
Standard Setting (Scoring)	11
Comparative Analysis	10
Language Tests	10
Second Language Learning	10
Test Construction	10
Test Validity	10
Cognitive Processes	8
Computer Assisted Testing	8
Correlation	8
Cutting Scores	7
Item Analysis	7
Judges	7
Rating Scales	7
Reading Comprehension	7
More ▼

Publication Type

Reports - Research	49
Journal Articles	43
Speeches/Meeting Papers	15
Reports - Evaluative	12
Tests/Questionnaires	5
Dissertations/Theses -…	4
Information Analyses	2
Reports - Descriptive	2
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Higher Education	14
Postsecondary Education	12
Elementary Education	9
Secondary Education	6
Elementary Secondary Education	4
Grade 8	3
Early Childhood Education	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
Adult Education	1
Grade 10	1
Grade 3	1
Grade 4	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Researchers	3
Practitioners	1
Teachers	1

Location

California	3
Germany	3
Japan	3
Canada	2
Florida	2
Israel	2
Netherlands	2
New Jersey	2
Pennsylvania	2
Turkey	2
United States	2
Asia	1
Australia	1
Brazil	1
Connecticut	1
Cyprus	1
Denmark	1
Egypt	1
Estonia	1
Greece	1
Hawaii	1
Indonesia	1
Iran	1
Ireland	1
Italy	1
More ▼

Laws, Policies, & Programs

Pell Grant Program

Assessments and Surveys

Test of English as a Foreign…	3
Woodcock Johnson Tests of…	2
ACT Assessment	1
Adult Attachment Interview	1
SAT (College Admission Test)	1
Study Process Questionnaire	1
Test of English for…	1
edTPA (Teacher Performance…	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 71 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Evaluating Mathematics Lessons for Cognitive Demand: Applying a Discursive Lens to the Process of Achieving Inter-Rater Reliability

Peer reviewed

Direct link

Weingarden, Merav; Heyd-Metzuyanim, Einat – Journal of Mathematics Teacher Education, 2023

In this study, we examine "what went wrong" in our professional development program for encouraging cognitively demanding instruction, focusing on the difficulties we encountered in using an observational tool for evaluating this type of instruction and reaching inter-rater reliability. We do so through the lens of a discursive theory of…

Descriptors: Mathematics Instruction, Interrater Reliability, Cognitive Processes, Difficulty Level

Examination of Map Reading Skills with Orienteering Activity: An Example of Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Uyar, Seyma; Yayla, Onur; Zunber, Hidayet – International Journal of Assessment Tools in Education, 2022

The purpose of the current study is to examine the map reading skills of Social Studies pre-service teachers with orienteering, which is an activity-based and more active practice. To this end, a total of 10 students attending the Department of Social Studies Teaching in the Education Faculty of Burdur Mehmet Akif Ersoy University and taking the…

Descriptors: Map Skills, Navigation, Item Response Theory, Social Studies

Beyond Percent Correct: Measuring Change in Individual Picture Naming Ability

Peer reviewed

Direct link

Walker, Grant M.; Basilakos, Alexandra; Fridriksson, Julius; Hickok, Gregory – Journal of Speech, Language, and Hearing Research, 2022

Purpose: Meaningful changes in picture naming responses may be obscured when measuring accuracy instead of quality. A statistic that incorporates information about the severity and nature of impairments may be more sensitive to the effects of treatment. Method: We analyzed data from repeated administrations of a naming test to 72 participants with…

Descriptors: Naming, Change, Aphasia, Severity (of Disability)

Inter-Rater Agreement in Assigning Levels of Difficulty to Examination Questions in Life Sciences

Peer reviewed
PDF on ERIC

Download full text

Dempster, Edith R.; Kirby, Nicki F. – South African Journal of Education, 2018

Public perception of "declining standards" in school-leaving examinations often accompanies increases in pass rates in schoolleaving examinations. "Declining standards" to the public means easier examination papers. The present study evaluates a South African attempt to estimate the level of difficulty, as distinct from…

Descriptors: Foreign Countries, Interrater Reliability, Difficulty Level, Science Tests

Computer-Based and Paper-and-Pencil Tests: A Study in Calculus for STEM Majors

Peer reviewed

Direct link

Smolinsky, Lawrence; Marx, Brian D.; Olafsson, Gestur; Ma, Yanxia A. – Journal of Educational Computing Research, 2020

Computer-based testing is an expanding use of technology offering advantages to teachers and students. We studied Calculus II classes for science, technology, engineering, and mathematics majors using different testing modes. Three sections with 324 students employed: paper-and-pencil testing, computer-based testing, and both. Computer tests gave…

Descriptors: Test Format, Computer Assisted Testing, Paper (Material), Calculus

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

Measurement Properties of a Standardized Elicited Imitation Test: An Integrative Data Analysis

Peer reviewed

Direct link

Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022

Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…

Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning

Validation of Sub-Constructs in Reading Comprehension Tests Using Teachers' Classification of Cognitive Targets

Peer reviewed

Direct link

Tengberg, Michael – Language Assessment Quarterly, 2018

Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…

Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results

Rater Judgments and Word Difficulty: Conceptualizing the Substantive Validity of the VST

Peer reviewed
PDF on ERIC

Download full text

Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022

The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…

Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills

Low Inter-Rater Reliability of a High Stakes Performance Assessment of Teacher Candidates

Peer reviewed
PDF on ERIC

Download full text

Lyness, Scott A.; Peterson, Kent; Yates, Kenneth – Education Sciences, 2021

The Performance Assessment for California Teachers (PACT) is a high stakes summative assessment that was designed to measure pre-service teacher readiness. We examined the inter-rater reliability (IRR) of trained PACT evaluators who rated 19 candidates. As measured by Cohen's weighted kappa, the overall IRR estimate was 0.17 (poor strength of…

Descriptors: High Stakes Tests, Performance Based Assessment, Teacher Effectiveness, Academic Language

Smarter Balanced Assessment Consortium: Alignment Study Report. Revised

Download full text

Smarter Balanced Assessment Consortium, 2016

The goal of this study was to gather comprehensive evidence about the alignment of the Smarter Balanced summative assessments to the Common Core State Standards (CCSS). Alignment of the Smarter Balanced summative assessments to the CCSS is a critical piece of evidence regarding the validity of inferences students, teachers and policy makers can…

Descriptors: Alignment (Education), Summative Evaluation, Common Core State Standards, Test Content

Development, Reliability, and Validity of the Oral Reading Assessment for Mandarin-Speaking Children with Hearing Loss

Peer reviewed

Direct link

Hung, Yu-Chen; Chan, Yi-Chih – Deafness & Education International, 2020

Unlike their peers with typical hearing, reading and speech challenges observed among children with hearing loss may not only be caused by developmental issues but also hearing-related problems. Although conventional oral reading assessments are useful for identifying children at risk of reading difficulties, they do not help examiners identify…

Descriptors: Test Construction, Test Validity, Oral Reading, Reading Tests

Developing an Instrument to Detect Science Misconception of an Elementary School Teacher

Peer reviewed
PDF on ERIC

Download full text

Desstya, Anatri; Prasetyo, Zuhdan Kun; Suyanta; Susila, Ihwan; Irwanto – International Journal of Instruction, 2019

This study aims to report the development an instrument that is standardized (reviewed by validity, reliability, and difficulty index) to detect science misconception in an elementary school teacher. This study used a 4-D model; defining, designing, developing, and disseminating. First, it was prepared with 47 opened-ended questions, and then it…

Descriptors: Elementary School Teachers, Misconceptions, Evaluation Methods, Teacher Evaluation

Regression Effects in Angoff Ratings: Examples from Credentialing Exams

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…

Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	3
ProQuest LLC	3
Applied Measurement in…	2
ETS Research Report Series	2
Educational Measurement:…	2
Language Assessment Quarterly	2
Online Submission	2
Academic Medicine	1
Advances in Health Sciences…	1
Assessment & Evaluation in…	1
Cambridge Assessment	1
Cogent Education	1
Computer Science Education	1
Deafness & Education…	1
Edinburgh Working Papers in…	1
Education Digest: Essential…	1
Education Sciences	1
Educational Assessment	1
European Journal of Science…	1
Exceptional Children	1
Grantee Submission	1
High School Journal	1
International Association for…	1
International Journal of…	1
International Journal of…	1
More ▼

Lunz, Mary E.	4
Beach, Kristen D.	2
Bocian, Kathleen M.	2
O'Connor, Rollanda E.	2
O'Neill, Thomas R.	2
Reid, Jerry B.	2
Wyse, Adam E.	2
Al-hasanat, Hasan AbdRabbeh…	1
Alsma, Jelmer	1
Arieli-Attali, Meirav	1
Attali, Yigal	1
Basilakos, Alexandra	1
Bennett, Randy Elliot	1
Benton, Tom	1
Bijani, Houman	1
Billet, Amit	1
Blau, Ina	1
Braithwaite, Nicholas St. J.	1
Buchheim, Anna	1
Buhr, Dianne C.	1
Caspi, Avner	1
Chan, Yi-Chih	1
Chang, Lei	1
Chen, Ching-I	1
More ▼