ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	17

Descriptor

Test Theory	18
Grade 8	16
Item Response Theory	13
Test Items	11
Difficulty Level	9
Foreign Countries	7
Grade 6	7
Grade 4	6
Grade 7	6
Grade 5	5
Mathematics Tests	5
Comparative Analysis	4
Statistical Analysis	4
Test Reliability	4
Benchmarking	3
Computation	3
Generalizability Theory	3
Models	3
Public Schools	3
Reading Tests	3
Test Construction	3
Academic Achievement	2
Achievement Tests	2
Alignment (Education)	2
Correlation	2
More ▼

Source

Behavioral Research and…	4
Educational Sciences: Theory…	2
Applied Measurement in…	1
Assessment for Effective…	1
ETS Research Report Series	1
EURASIA Journal of…	1
Educational Research and…	1
Eurasian Journal of…	1
Gifted Child Quarterly	1
Journal of Pedagogical…	1
Malaysian Online Journal of…	1
Practical Assessment,…	1
ProQuest LLC	1
School Science and Mathematics	1
More ▼

Publication Type

Reports - Research	15
Journal Articles	13
Numerical/Quantitative Data	4
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Grade 8	18
Middle Schools	14
Secondary Education	14
Elementary Education	13
Junior High Schools	13
Grade 4	8
Grade 6	8
Grade 7	8
Grade 5	6
Intermediate Grades	6
Grade 3	4
High Schools	3
Grade 10	2
Grade 9	2
More ▼

Audience

Location

Turkey	3
Tennessee	2
United States	2
Colorado	1
Cyprus	1
Florida	1
New York	1
North Carolina	1
Norway	1
South Africa	1
South Korea	1
Texas	1
Turkey (Ankara)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
National Assessment of…	2
Progress in International…	1
Strengths and Difficulties…	1
Writing Apprehension Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Differentiating among High-Achieving Learners: A Comparison of Classical Test Theory and Item Response Theory on Above-Level Testing

Direct link

LeBeau, Brandon; Assouline, Susan G.; Mahatmya, Duhita; Lupkowski-Shoplik, Ann – Gifted Child Quarterly, 2020

This study investigated the application of item response theory (IRT) to expand the range of ability estimates for gifted (hereinafter referred to as high-achieving) students' performance on an above-level test. Using a sample of fourth- to sixth-grade high-achieving students (N = 1,893), we conducted a study to compare estimates from two…

Descriptors: Item Response Theory, Test Theory, Academically Gifted, High Achievement

A Comparison of Difficulty Indices Calculated for Open-Ended Items According to Classical Test Theory and Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Ilhan, Mustafa; Guler, Nese – Eurasian Journal of Educational Research, 2018

Purpose: This study aimed to compare difficulty indices calculated for open-ended items in accordance with the classical test theory (CTT) and the Many-Facet Rasch Model (MFRM). Although theoretical differences between CTT and MFRM occupy much space in the literature, the number of studies empirically comparing the two theories is quite limited.…

Descriptors: Difficulty Level, Test Items, Test Theory, Item Response Theory

Determination of Differential Item Functioning (DIF) According to SIBTEST, Lord's [Chi-squared], Raju's Area Measurement and Breslow-Day Methods

Peer reviewed
PDF on ERIC

Download full text

Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019

The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…

Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences

An Evaluation of the Psychometric Properties of Three Different Forms of Daly and Miller's Writing Apprehension Test through Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Güler, Nese; Ilhan, Mustafa; Güneyli, Ahmet; Demir, Süleyman – Educational Sciences: Theory and Practice, 2017

This study evaluates the psychometric properties of three different forms of the Writing Apprehension Test (WAT; Daly & Miller, 1975) through Rasch analysis. For this purpose, the fit statistics and correlation coefficients, and the reliability, separation ratio, and chi-square values for the facets of item and person calculated for the…

Descriptors: Writing Apprehension, Psychometrics, Item Response Theory, Tests

Common Core State Standards Benchmark Assessments: Item Alignment to the Shifts in Tennessee

Direct link

Stugart, Melissa – ProQuest LLC, 2016

Our nation is in the midst of one of the largest education reforms in decades centered on the adoption of the Common Core State Standards (CCSS) and aligned assessments. In an era of rising accountability measures and declining literacy proficiency, it is vital to ensure that educational resources, such as benchmark assessments, are appropriately…

Descriptors: Common Core State Standards, Benchmarking, Educational Assessment, Test Items

What CDM Can Tell about What Students Have Learned: An Analysis of TIMSS Eighth Grade Mathematics

Peer reviewed

Direct link

Choi, Kyong Mi; Lee, Young-Sun; Park, Yoon Soo – EURASIA Journal of Mathematics, Science & Technology Education, 2015

International trended assessments have long attempted to provide instructional information to educational researchers and classroom teachers. Studies have shown that traditional methods of item analysis have not provided specific information that can be directly applicable to improve student performance. To this end, cognitive diagnosis models…

Descriptors: International Assessment, Mathematics Tests, Grade 8, Models

The Relationship between CTT and IRT Approaches in Analyzing Item Characteristics

Peer reviewed
PDF on ERIC

Download full text

Abedalaziz, Nabeel; Leng, Chin Hai – Malaysian Online Journal of Educational Sciences, 2013

Most of the tests and inventories used by counseling psychologists have been developed using CTT; IRT derives from what is called latent trait theory. A number of important differences exist between CTT- versus IRT-based approaches to both test development and evaluation, as well as the process of scoring the response profiles of individual…

Descriptors: Test Theory, Item Response Theory, Difficulty Level, Models

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Initial Evidence for the Reliability and Validity of the Student Risk Screening Scale for Internalizing and Externalizing Behaviors at the Middle School Level

Peer reviewed

Direct link

Lane, Kathleen Lynne; Oakes, Wendy Peia; Carter, Erik W.; Lambert, Warren E.; Jenkins, Abbie B. – Assessment for Effective Intervention, 2013

We reported findings of an exploratory validation study of a revised universal screening instrument: the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE) for use with middle school students. Tested initially for use with elementary-age students, the SRSS-IE was adapted to include seven additional items reflecting…

Descriptors: Test Reliability, Test Validity, Screening Tests, Middle School Students

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Constructing Benchmarks for Monitoring Purposes: Evidence from South Africa

Peer reviewed

Direct link

Scherman, Vanessa; Howie, Sarah J.; Bosker, Roel J. – Educational Research and Evaluation, 2011

In information-rich environments, schools are often presented with a myriad of data from which decisions need to be made. The use of the information on a classroom level may be facilitated if performance could be described in terms of levels of proficiency or benchmarks. The aim of this article is to explore benchmarks using data from a monitoring…

Descriptors: Standard Setting, Foreign Countries, Grade 8, Ability

Studying Reliability of Open Ended Mathematics Items According to the Classical Test Theory and Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010

In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Previous Page | Next Page »

Pages: 1 | 2

Tindal, Gerald	4
Liu, Kimy	3
Guler, Nese	2
Ilhan, Mustafa	2
Ketterlin-Geller, Leanne R.	2
Abedalaziz, Nabeel	1
Alonzo, Julie	1
Anderson, Daniel	1
Assouline, Susan G.	1
Atar, Hakan Yavuz	1
Ayva Yörü, Fatma Gökçen	1
Beretvas, S. Natasha	1
Bosker, Roel J.	1
Carling, Kristy	1
Carter, Erik W.	1
Choi, Kyong Mi	1
Demir, Süleyman	1
Gelbal, Selahattin	1
Geller, Leanne Ketterlin	1
Güler, Nese	1
Güneyli, Ahmet	1
Howie, Sarah J.	1
Huebner, Alan	1
Jenkins, Abbie B.	1
Jung, Eunju	1
More ▼