ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	50

Descriptor

Evaluation Methods	80
Test Items	80
Test Reliability	50
Test Validity	37
Test Construction	27
Scores	20
Student Evaluation	20
Foreign Countries	19
Interrater Reliability	17
Item Response Theory	17
Reliability	17
Psychometrics	15
Difficulty Level	13
Correlation	10
Elementary Secondary Education	10
Item Analysis	10
Scoring	10
Test Bias	9
Computer Assisted Testing	8
Educational Assessment	8
Comparative Analysis	7
Measures (Individuals)	7
Multiple Choice Tests	7
Questionnaires	7
Test Use	7
More ▼

Publication Type

Journal Articles	47
Reports - Research	43
Reports - Evaluative	15
Speeches/Meeting Papers	14
Reports - Descriptive	6
Guides - Non-Classroom	5
Tests/Questionnaires	5
Information Analyses	3
Opinion Papers	3
Dissertations/Theses -…	2
Guides - General	2
Books	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	8
Elementary Education	7
Elementary Secondary Education	7
High Schools	7
Secondary Education	7
Grade 8	4
Middle Schools	4
Grade 4	3
Intermediate Grades	3
Junior High Schools	3
Grade 5	2
Grade 6	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	5
Teachers	5
Administrators	4
Support Staff	3
Researchers	2
Students	1

Location

China	2
India	2
Israel	2
Turkey	2
United Kingdom	2
United States	2
Canada	1
Colorado	1
Dominica	1
Egypt	1
Ethiopia	1
Germany	1
Grenada	1
Illinois (Chicago)	1
Indonesia	1
Netherlands	1
Oregon	1
Portugal	1
Saint Lucia	1
Saint Vincent and the…	1
South Africa	1
Texas	1
Washington	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	3
Individuals with Disabilities…	3
Rehabilitation Act 1973…	3
No Child Left Behind Act 2001	1

Assessments and Surveys

Center for Epidemiologic…	1
Graduate Record Examinations	1
Hidden Figures Test	1
Mayer Salovey Caruso…	1
National Assessment of…	1
Pennsylvania Educational…	1
SAT (College Admission Test)	1
Social Skills Improvement…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 80 results Save | Export

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Relating Pictorial and Verbal Forms of Assessments of the Particle Model of Matter in Two Communities of Students

Peer reviewed

Direct link

Langbeheim, Elon; Akaygun, Sevil; Adadan, Emine; Hlatshwayo, Manzini; Ramnarain, Umesh – International Journal of Science and Mathematics Education, 2023

Linking assessment and curriculum in science education, particularly within the topic of matter and its changes, is often taken for granted. Some of the fundamental elements of the assessment, such as the choice of wording and visual representations, as well as its relation to the curricular sequence, remain understudied. In addition, very few…

Descriptors: Student Evaluation, Evaluation Methods, Science Education, Test Items

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Variation in Assembling Assessments Using Automated Test Assembly Methodologies: Item-Pool Constraints and Response-Time Targets

Direct link

Aaron McVay – ProQuest LLC, 2021

As assessments move towards computerized testing and making continuous testing available the need for rapid assembly of forms is increasing. The objective of this study was to investigate variability in assembled forms through the lens of first- and second-order equity properties of equating, by examining three factors and their interactions. Two…

Descriptors: Automation, Computer Assisted Testing, Test Items, Reaction Time

Assessment of Basic Competencies in Adults: Item Pool Validity and Reliability Study

Peer reviewed
PDF on ERIC

Download full text

Toker, Turker – International Journal of Curriculum and Instruction, 2023

Achievement tests are among the most widely used data collection tools to measure the knowledge and skill levels of individuals. For this reason, the existence of valid and reliable achievement tests that can perfectly reveal the competencies that a person should have in any discipline is of great importance. The purpose of this research is to…

Descriptors: Basic Skills, Evaluation Methods, Test Items, Test Validity

A Study on the Assessment Methods and Experiences of Teachers at an Ethiopian University

Peer reviewed
PDF on ERIC

Download full text

Sewagegn, Abatihun A. – International Journal of Instruction, 2019

Assessment plays a significant role in determining the quality of education. This is particularly so when students are properly assessed using various appropriate methods of assessment. This study investigates teachers' assessment methods and the challenges they encounter in assessing learning in an Ethiopian university. A convergent parallel…

Descriptors: Evaluation Methods, Teaching Experience, College Faculty, Foreign Countries

Investigation of Rater Tendencies and Reliability in Different Assessment Methods with Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Electronic Journal of Elementary Education, 2020

One of the most commonly used methods for measuring higher-order thinking skills such as problem-solving or written expression is open-ended items. Three main approaches are used to evaluate responses to open-ended items: general evaluation, rating scales, and rubrics. In order to measure and improve problem-solving skills of students, firstly, an…

Descriptors: Interrater Reliability, Item Response Theory, Test Items, Rating Scales

A Review of Subscore Estimation Methods. ETS RR-18-17

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Qu, Yanxuan – ETS Research Report Series, 2018

Various subscore estimation methods that use auxiliary information to improve subscore accuracy and stability have been developed. This report provides a review of various subscore estimation methods described in the literature. The methodology of each method is described, then research studies on these subscore estimation methods are summarized.…

Descriptors: Scores, Evaluation Methods, Item Response Theory, Test Items

Comprehensive Assessment of a Project Based Learning Application in a Project Management Course

Peer reviewed
PDF on ERIC

Download full text

Torres, Anthony; Sriraman, Vedaraman; Ortiz, Araceli – International Journal of Instruction, 2021

The focus of this study is to implement multiple assessment methods in order to comprehensively assess the impact of a Project Based Learning (PrBL) application in construction project management course. The assessment methods include various direct (objective) and indirect (subjective) evaluations methods. These methods included a pre and post…

Descriptors: Active Learning, Student Projects, Construction Management, Student Attitudes

Exploring the Appropriateness of Test Accommodations for Chinese University Students with Hearing Impairment

Peer reviewed

Direct link

Sanyin Cheng – Journal of Developmental and Physical Disabilities, 2020

This research evaluates the appropriateness of test accommodations among Chinese university students with hearing impairment, using reliability estimates and exploratory factor analysis. Study 1 explores the appropriateness of test directions accommodation for one nonverbal assessment (Group Embedded Figures Test) and two verbal assessments with…

Descriptors: Testing Accommodations, College Students, Deafness, Hearing Impairments

Calibrated Parsing Items Evaluation: A Step towards Objectifying the Translation Assessment

Peer reviewed

Direct link

Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019

The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…

Descriptors: Test Items, Translation, Computer Software, Evaluators

Exploration of Factors Affecting the Added Value of Test Subscores

Peer reviewed

Direct link

Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019

Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…

Descriptors: Value Added Models, Scores, Sample Size, Correlation

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Modification and Validation of the Mixed-Format Engineering Concept Assessment for Middle School Students Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Koskey, Kristin L. K.; Makki, Nidaa; Ahmed, Wondimu; Garafolo, Nicholas G.; Visco, Donald P., Jr. – School Science and Mathematics, 2020

Integrating engineering into the K-12 science curriculum continues to be a focus in national reform efforts in science education. Although there is an increasing interest in research in and practice of integrating engineering in K-12 science education, to date only a few studies have focused on the development of an assessment tool to measure…

Descriptors: Middle School Students, Engineering, Design, Science Education

Developing an Instrument to Detect Science Misconception of an Elementary School Teacher

Peer reviewed
PDF on ERIC

Download full text

Desstya, Anatri; Prasetyo, Zuhdan Kun; Suyanta; Susila, Ihwan; Irwanto – International Journal of Instruction, 2019

This study aims to report the development an instrument that is standardized (reviewed by validity, reliability, and difficulty index) to detect science misconception in an elementary school teacher. This study used a 4-D model; defining, designing, developing, and disseminating. First, it was prepared with 47 opened-ended questions, and then it…

Descriptors: Elementary School Teachers, Misconceptions, Evaluation Methods, Teacher Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	3
International Journal of…	3
Online Submission	3
Smarter Balanced Assessment…	3
Applied Psychological…	2
Grantee Submission	2
International Journal of…	2
Journal of Experimental…	2
Journal of Psychoeducational…	2
ProQuest LLC	2
Psychology Teaching Review	2
Applied Measurement in…	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Accountability…	1
Clearing House	1
ETS Research Report Series	1
Educational Research and…	1
Educational Research and…	1
European Journal of Physics…	1
International Electronic…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
More ▼

Friedman, Greg	2
McGinty, Dixie	2
Michaels, Hillary	2
Neel, John H.	2
Ochieng, Charles	2
Yen, Shu Jing	2
Aaron McVay	1
Adadan, Emine	1
Ahmed, Wondimu	1
Akarsu, Bayram	1
Akaygun, Sevil	1
Akbari, Alireza	1
Alexander, Patricia A.	1
Ashley Karls	1
Askegaard, Lewis D.	1
Avery, Marybell	1
Bejar, Isaac I.	1
Bezruczko, Nikolaus	1
Birenbaum, Menucha	1
Boccaccini, Marcus T.	1
Boughton, Keith A.	1
Bowman, Michael L.	1
Brandriet, Alexandra R.	1
Bretz, Stacey Lowery	1
More ▼