ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Hierarchical Linear Modeling	9
Scores	4
Test Items	4
Comparative Analysis	3
Grade 8	3
Item Response Theory	3
Regression (Statistics)	3
Computation	2
Elementary School Teachers	2
Goodness of Fit	2
Grade 5	2
Longitudinal Studies	2
Mathematics Tests	2
Test Bias	2
Ability	1
Academic Achievement	1
Academic Standards	1
Achievement Gains	1
Alignment (Education)	1
Anxiety	1
Bayesian Statistics	1
Bias	1
College Admission	1
College Applicants	1
College Entrance Examinations	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	9
Reports - Research	9

Education Level

Secondary Education	4
Elementary Education	3
Junior High Schools	3
Middle Schools	3
Grade 5	2
Grade 8	2
Higher Education	2
Intermediate Grades	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 6	1
Grade 7	1
Grade 9	1
High Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

Colorado	1
Florida	1
Iran	1
New York	1
North Carolina	1
Tennessee	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Determining Reliability of Daily Measures: An Illustration with Data on Teacher Stress

Peer reviewed

Direct link

van Alphen, Thijmen; Jak, Suzanne; Jansen in de Wal, Joost; Schuitema, Jaap; Peetsma, Thea – Applied Measurement in Education, 2022

Intensive longitudinal data is increasingly used to study state-like processes such as changes in daily stress. Measures aimed at collecting such data require the same level of scrutiny regarding scale reliability as traditional questionnaires. The most prevalent methods used to assess reliability of intensive longitudinal measures are based on…

Descriptors: Test Reliability, Measures (Individuals), Anxiety, Data Collection

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Peer reviewed

Direct link

Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019

This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…

Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory

Item Parameter Drift in a Time-Varying Predictor

Peer reviewed

Direct link

Lee, HyeSun – Applied Measurement in Education, 2018

The current simulation study examined the effects of Item Parameter Drift (IPD) occurring in a short scale on parameter estimates in multilevel models where scores from a scale were employed as a time-varying predictor to account for outcome scores. Five factors, including three decisions about IPD, were considered for simulation conditions. It…

Descriptors: Test Items, Hierarchical Linear Modeling, Predictor Variables, Scores

A Multilevel Factor Analysis of Third-Party Evaluations of Noncognitive Constructs Used in Admissions Decision Making

Peer reviewed

Direct link

Oliveri, Maria; McCaffrey, Daniel; Ezzo, Chelsea; Holtzman, Steven – Applied Measurement in Education, 2017

The assessment of noncognitive traits is challenging due to possible response biases, "subjectivity" and "faking." Standardized third-party evaluations where an external evaluator rates an applicant on their strengths and weaknesses on various noncognitive traits are a promising alternative. However, accurate score-based…

Descriptors: Factor Analysis, Decision Making, College Admission, Likert Scales

Exploring Person Fit with an Approach Based on Multilevel Logistic Regression

Peer reviewed

Direct link

Walker, A. Adrienne; Engelhard, George, Jr. – Applied Measurement in Education, 2015

The idea that test scores may not be valid representations of what students know, can do, and should learn next is well known. Person fit provides an important aspect of validity evidence. Person fit analyses at the individual student level are not typically conducted and person fit information is not communicated to educational stakeholders. In…

Descriptors: Test Validity, Goodness of Fit, Educational Assessment, Hierarchical Linear Modeling

Centering, Scale Indeterminacy, and Differential Item Functioning Detection in Hierarchical Generalized Linear and Generalized Linear Mixed Models

Peer reviewed

Direct link

Cheong, Yuk Fai; Kamata, Akihito – Applied Measurement in Education, 2013

In this article, we discuss and illustrate two centering and anchoring options available in differential item functioning (DIF) detection studies based on the hierarchical generalized linear and generalized linear mixed modeling frameworks. We compared and contrasted the assumptions of the two options, and examined the properties of their DIF…

Descriptors: Test Bias, Hierarchical Linear Modeling, Comparative Analysis, Test Items

A Bayesian Hierarchical Selection Model for Academic Growth with Missing Data

Peer reviewed

Direct link

Allen, Jeff – Applied Measurement in Education, 2017

Using a sample of schools testing annually in grades 9-11 with a vertically linked series of assessments, a latent growth curve model is used to model test scores with student intercepts and slopes nested within school. Missed assessments can occur because of student mobility, student dropout, absenteeism, and other reasons. Missing data…

Descriptors: Achievement Gains, Academic Achievement, Growth Models, Scores

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Conceptualizing Teaching to the Test under Standards-Based Reform

Peer reviewed

Direct link

Welsh, Megan E.; Eastwood, Melissa; D'Agostino, Jerome V. – Applied Measurement in Education, 2014

Teacher and school accountability systems based on high-stakes tests are ubiquitous throughout the United States and appear to be growing as a catalyst for reform. As a result, educators have increased the proportion of instructional time devoted to test preparation. Although guidelines for what constitutes appropriate and inappropriate test…

Descriptors: High Stakes Tests, Instruction, Test Preparation, Grade 3

Allen, Jeff	1
Beretvas, S. Natasha	1
Cheong, Yuk Fai	1
D'Agostino, Jerome V.	1
Eastwood, Melissa	1
Engelhard, George, Jr.	1
Ezzo, Chelsea	1
Holtzman, Steven	1
Jak, Suzanne	1
Jansen in de Wal, Joost	1
Kamata, Akihito	1
Lane, Suzanne	1
Lee, HyeSun	1
McCaffrey, Daniel	1
Murphy, Daniel L.	1
Oliveri, Maria	1
Peetsma, Thea	1
Quesen, Sarah	1
Schuitema, Jaap	1
Walker, A. Adrienne	1
Welsh, Megan E.	1
van Alphen, Thijmen	1
More ▼