ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	12

Descriptor

Hierarchical Linear Modeling	13
Item Response Theory	13
Scores	7
Comparative Analysis	6
Computation	6
Foreign Countries	5
Grade 8	5
Measurement	5
Mathematics Achievement	4
Mathematics Tests	4
Pretests Posttests	4
Regression (Statistics)	4
Student Evaluation	4
Academic Achievement	3
Educational Assessment	3
Grade 6	3
Grade 9	3
Instructional Effectiveness	3
Intervention	3
Statistical Analysis	3
Test Bias	3
Test Items	3
Achievement Tests	2
Control Groups	2
Correlation	2
More ▼

Source

Applied Measurement in…	2
ETS Research Report Series	2
Grantee Submission	2
Educational Policy	1
Educational and Psychological…	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Educational and…	1
National Center for Research…	1
ProQuest LLC	1

Publication Type

Journal Articles	11
Reports - Research	11
Dissertations/Theses -…	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Middle Schools	13
Junior High Schools	9
Secondary Education	9
Elementary Education	8
Grade 8	5
Intermediate Grades	5
Grade 6	3
Grade 9	3
High Schools	3
Grade 4	2
Grade 5	2
Grade 7	2
Early Childhood Education	1
Elementary Secondary Education	1
Primary Education	1
More ▼

Audience

Location

Canada	1
Colorado	1
Florida	1
Germany	1
Italy	1
Netherlands	1
New York	1
North Carolina	1
Qatar	1
Tennessee	1
Texas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	2
Early Childhood Longitudinal…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing all 13 results Save | Export

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Peer reviewed

Direct link

Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019

This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…

Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory

Introduction to Multilevel Item Response Theory Analysis: Descriptive and Explanatory Models

Peer reviewed

Direct link

Sulis, Isabella; Toland, Michael D. – Journal of Early Adolescence, 2017

Item response theory (IRT) models are the main psychometric approach for the development, evaluation, and refinement of multi-item instruments and scaling of latent traits, whereas multilevel models are the primary statistical method when considering the dependence between person responses when primary units (e.g., students) are nested within…

Descriptors: Hierarchical Linear Modeling, Item Response Theory, Psychometrics, Evaluation Methods

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Multilevel Multidimensional Item Response Model with a Multilevel Latent Covariate

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cho, Sun-Joo; Bottge, Brian A. – Grantee Submission, 2015

In a pretest-posttest cluster-randomized trial, one of the methods commonly used to detect an intervention effect involves controlling pre-test scores and other related covariates while estimating an intervention effect at post-test. In many applications in education, the total post-test and pre-test scores that ignores measurement error in the…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Pretests Posttests, Scores

Measuring Student Ability, Classifying Schools, and Detecting Item Bias at School Level, Based on Student-Level Dichotomous Items

Peer reviewed

Direct link

Bennink, Margot; Croon, Marcel A.; Keuning, Jos; Vermunt, Jeroen K. – Journal of Educational and Behavioral Statistics, 2014

In educational measurement, responses of students on items are used not only to measure the ability of students, but also to evaluate and compare the performance of schools. Analysis should ideally account for the multilevel structure of the data, and school-level processes not related to ability, such as working climate and administration…

Descriptors: Academic Ability, Educational Assessment, Educational Testing, Test Bias

The Effects of Math Video Games on Learning: A Randomized Evaluation Study with Innovative Impact Estimation Techniques. CRESST Report 841

Download full text

Chung, Gregory K. W. K.; Choi, Kilchan; Baker, Eva L.; Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

A large-scale randomized controlled trial tested the effects of researcher-developed learning games on a transfer measure of fractions knowledge. The measure contained items similar to standardized assessments. Thirty treatment and 29 control classrooms (~1500 students, 9 districts, 26 schools) participated in the study. Students in treatment…

Descriptors: Video Games, Educational Games, Mathematics Instruction, Mathematics

Explore the Usefulness of Person-Fit Analysis on Large-Scale Assessment

Peer reviewed

Direct link

Cui, Ying; Mousavi, Amin – International Journal of Testing, 2015

The current study applied the person-fit statistic, l[subscript z], to data from a Canadian provincial achievement test to explore the usefulness of conducting person-fit analysis on large-scale assessments. Item parameter estimates were compared before and after the misfitting student responses, as identified by l[subscript z], were removed. The…

Descriptors: Measurement, Achievement Tests, Comparative Analysis, Test Items

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Multilevel Linkages between State Standards, Teacher Standards, and Student Achievement: Testing External versus Internal Standards-Based Education Models

Peer reviewed

Direct link

Lee, Jaekyung; Liu, Xiaoyan; Amo, Laura Casey; Wang, Weichun Leilani – Educational Policy, 2014

Drawing on national and state assessment datasets in reading and math, this study tested "external" versus "internal" standards-based education models. The goal was to understand whether and how student performance standards work in multilayered school systems under No Child Left Behind Act of 2001 (NCLB). Under the…

Descriptors: State Standards, Academic Standards, Student Evaluation, Academic Achievement

Detecting Intervention Effects Using a Multilevel Latent Transition Analysis with a Mixture IRT Model

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cho, Sun-Joo; Cohen, Allan S.; Bottge, Brian – Grantee Submission, 2013

A multilevel latent transition analysis (LTA) with a mixture IRT measurement model (MixIRTM) is described for investigating the effectiveness of an intervention. The addition of a MixIRTM to the multilevel LTA permits consideration of both potential heterogeneity in students' response to instructional intervention as well as a methodology for…

Descriptors: Intervention, Item Response Theory, Statistical Analysis, Models

Evaluating Academic Progress without a Vertical Scale. Research Report. ETS RR-12-07

Peer reviewed
PDF on ERIC

Download full text

Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012

Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…

Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation

Evaluation of the Effect of a Digital Mathematics Game on Academic Achievement

Direct link

Wale, Christine M. – ProQuest LLC, 2013

Digital games are widely popular and interest has increased for their use in education. Digital games are thought to be powerful instructional tools because they promote active learning and feedback, provide meaningful contexts to situate knowledge, create engagement and intrinsic motivation, and have the ability individualize instruction.…

Descriptors: Academic Achievement, Mathematics, Mathematics Instruction, Mathematical Aptitude

A Bayesian Hierarchical Model for Large-Scale Educational Surveys: An Application to the National Assessment of Educational Progress. Research Report. ETS RR-04-38

Peer reviewed
PDF on ERIC

Download full text

Johnson, Matthew S.; Jenkins, Frank – ETS Research Report Series, 2005

Large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) sample examinees to whom an exam will be administered. In most situations the sampling design is not a simple random sample and must be accounted for in the estimating model. After reviewing the current operational estimation procedure for NAEP, this…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, National Competency Tests, Sampling

Cho, Sun-Joo	2
Amo, Laura Casey	1
Baker, Eva L.	1
Bennink, Margot	1
Beretvas, S. Natasha	1
Bottge, Brian	1
Bottge, Brian A.	1
Cai, Li	1
Choi, Kilchan	1
Chung, Gregory K. W. K.	1
Cohen, Allan S.	1
Croon, Marcel A.	1
Cui, Ying	1
Frey, Andreas	1
Hecht, Martin	1
Jenkins, Frank	1
Johnson, Matthew S.	1
Keuning, Jos	1
Lall, Venessa F.	1
Lane, Suzanne	1
Lee, Jaekyung	1
Liu, Xiaoyan	1
Monfils, Lora	1
Mousavi, Amin	1
Murphy, Daniel L.	1
More ▼