Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 17 |
Descriptor
Test Theory | 18 |
Grade 8 | 16 |
Item Response Theory | 13 |
Test Items | 11 |
Difficulty Level | 9 |
Foreign Countries | 7 |
Grade 6 | 7 |
Grade 4 | 6 |
Grade 7 | 6 |
Grade 5 | 5 |
Mathematics Tests | 5 |
More ▼ |
Source
Author
Tindal, Gerald | 4 |
Liu, Kimy | 3 |
Guler, Nese | 2 |
Ilhan, Mustafa | 2 |
Ketterlin-Geller, Leanne R. | 2 |
Abedalaziz, Nabeel | 1 |
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Assouline, Susan G. | 1 |
Atar, Hakan Yavuz | 1 |
Ayva Yörü, Fatma Gökçen | 1 |
More ▼ |
Publication Type
Reports - Research | 15 |
Journal Articles | 13 |
Numerical/Quantitative Data | 4 |
Dissertations/Theses -… | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Grade 8 | 18 |
Middle Schools | 14 |
Secondary Education | 14 |
Elementary Education | 13 |
Junior High Schools | 13 |
Grade 4 | 8 |
Grade 6 | 8 |
Grade 7 | 8 |
Grade 5 | 6 |
Intermediate Grades | 6 |
Grade 3 | 4 |
More ▼ |
Audience
Location
Turkey | 3 |
Tennessee | 2 |
United States | 2 |
Colorado | 1 |
Cyprus | 1 |
Florida | 1 |
New York | 1 |
North Carolina | 1 |
Norway | 1 |
South Africa | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 3 |
National Assessment of… | 2 |
Progress in International… | 1 |
Strengths and Difficulties… | 1 |
Writing Apprehension Test | 1 |
What Works Clearinghouse Rating
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
LeBeau, Brandon; Assouline, Susan G.; Mahatmya, Duhita; Lupkowski-Shoplik, Ann – Gifted Child Quarterly, 2020
This study investigated the application of item response theory (IRT) to expand the range of ability estimates for gifted (hereinafter referred to as high-achieving) students' performance on an above-level test. Using a sample of fourth- to sixth-grade high-achieving students (N = 1,893), we conducted a study to compare estimates from two…
Descriptors: Item Response Theory, Test Theory, Academically Gifted, High Achievement
Ilhan, Mustafa; Guler, Nese – Eurasian Journal of Educational Research, 2018
Purpose: This study aimed to compare difficulty indices calculated for open-ended items in accordance with the classical test theory (CTT) and the Many-Facet Rasch Model (MFRM). Although theoretical differences between CTT and MFRM occupy much space in the literature, the number of studies empirically comparing the two theories is quite limited.…
Descriptors: Difficulty Level, Test Items, Test Theory, Item Response Theory
Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019
The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…
Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences
Güler, Nese; Ilhan, Mustafa; Güneyli, Ahmet; Demir, Süleyman – Educational Sciences: Theory and Practice, 2017
This study evaluates the psychometric properties of three different forms of the Writing Apprehension Test (WAT; Daly & Miller, 1975) through Rasch analysis. For this purpose, the fit statistics and correlation coefficients, and the reliability, separation ratio, and chi-square values for the facets of item and person calculated for the…
Descriptors: Writing Apprehension, Psychometrics, Item Response Theory, Tests
Stugart, Melissa – ProQuest LLC, 2016
Our nation is in the midst of one of the largest education reforms in decades centered on the adoption of the Common Core State Standards (CCSS) and aligned assessments. In an era of rising accountability measures and declining literacy proficiency, it is vital to ensure that educational resources, such as benchmark assessments, are appropriately…
Descriptors: Common Core State Standards, Benchmarking, Educational Assessment, Test Items
Choi, Kyong Mi; Lee, Young-Sun; Park, Yoon Soo – EURASIA Journal of Mathematics, Science & Technology Education, 2015
International trended assessments have long attempted to provide instructional information to educational researchers and classroom teachers. Studies have shown that traditional methods of item analysis have not provided specific information that can be directly applicable to improve student performance. To this end, cognitive diagnosis models…
Descriptors: International Assessment, Mathematics Tests, Grade 8, Models
Abedalaziz, Nabeel; Leng, Chin Hai – Malaysian Online Journal of Educational Sciences, 2013
Most of the tests and inventories used by counseling psychologists have been developed using CTT; IRT derives from what is called latent trait theory. A number of important differences exist between CTT- versus IRT-based approaches to both test development and evaluation, as well as the process of scoring the response profiles of individual…
Descriptors: Test Theory, Item Response Theory, Difficulty Level, Models
Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015
This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…
Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory
Lane, Kathleen Lynne; Oakes, Wendy Peia; Carter, Erik W.; Lambert, Warren E.; Jenkins, Abbie B. – Assessment for Effective Intervention, 2013
We reported findings of an exploratory validation study of a revised universal screening instrument: the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE) for use with middle school students. Tested initially for use with elementary-age students, the SRSS-IE was adapted to include seven additional items reflecting…
Descriptors: Test Reliability, Test Validity, Screening Tests, Middle School Students
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards
Scherman, Vanessa; Howie, Sarah J.; Bosker, Roel J. – Educational Research and Evaluation, 2011
In information-rich environments, schools are often presented with a myriad of data from which decisions need to be made. The use of the information on a classroom level may be facilitated if performance could be described in terms of levels of proficiency or benchmarks. The aim of this article is to explore benchmarks using data from a monitoring…
Descriptors: Standard Setting, Foreign Countries, Grade 8, Ability
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability
Wang, Jianjun – School Science and Mathematics, 2011
As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…
Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests
Previous Page | Next Page »
Pages: 1 | 2