Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 15 |
Descriptor
Source
Author
Cho, Sun-Joo | 2 |
Quesen, Sarah | 2 |
Amo, Laura Casey | 1 |
Baker, Eva L. | 1 |
Barkaoui, Khaled | 1 |
Bottge, Brian | 1 |
Bottge, Brian A. | 1 |
Cai, Li | 1 |
Carstensen, Claus H. | 1 |
Chavez, Oscar | 1 |
Choi, Kilchan | 1 |
More ▼ |
Publication Type
Reports - Research | 11 |
Journal Articles | 9 |
Dissertations/Theses -… | 4 |
Education Level
Middle Schools | 7 |
Secondary Education | 6 |
Elementary Education | 5 |
Grade 8 | 4 |
Junior High Schools | 4 |
Intermediate Grades | 3 |
Grade 6 | 2 |
High Schools | 2 |
Elementary Secondary Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
More ▼ |
Audience
Location
Qatar | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Early Childhood Longitudinal… | 1 |
Iowa Tests of Educational… | 1 |
National Assessment of… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019
This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…
Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory
Kelcey, Ben – Society for Research on Educational Effectiveness, 2014
Valid and reliable measurement of teaching is essential to evaluating and improving teacher effectiveness and advancing large-scale policy-relevant research in education (Raudenbush & Sadoff, 2008). One increasingly common component of teaching evaluations is the direct observation of teachers in their classrooms. Classroom observations have…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Measurement, Psychometrics
Shin, Hyo Jeong – ProQuest LLC, 2015
This dissertation is comprised of three papers that propose and apply psychometric models to deal with complexities and challenges in large-scale assessments, focusing on modeling rater effects and complex learning progressions. In particular, three papers investigate extensions and applications of multilevel and multidimensional item response…
Descriptors: Item Response Theory, Psychometrics, Models, Measurement
Cho, Sun-Joo; Bottge, Brian A. – Grantee Submission, 2015
In a pretest-posttest cluster-randomized trial, one of the methods commonly used to detect an intervention effect involves controlling pre-test scores and other related covariates while estimating an intervention effect at post-test. In many applications in education, the total post-test and pre-test scores that ignores measurement error in the…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Pretests Posttests, Scores
Chung, Gregory K. W. K.; Choi, Kilchan; Baker, Eva L.; Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
A large-scale randomized controlled trial tested the effects of researcher-developed learning games on a transfer measure of fractions knowledge. The measure contained items similar to standardized assessments. Thirty treatment and 29 control classrooms (~1500 students, 9 districts, 26 schools) participated in the study. Students in treatment…
Descriptors: Video Games, Educational Games, Mathematics Instruction, Mathematics
Quesen, Sarah – ProQuest LLC, 2016
When studying differential item functioning (DIF) with students with disabilities (SWD) focal groups typically suffer from small sample size, whereas the reference group population is usually large. This makes it possible for a researcher to select a sample from the reference population to be similar to the focal group on the ability scale. Doing…
Descriptors: Test Items, Academic Accommodations (Disabilities), Testing Accommodations, Disabilities
Barkaoui, Khaled – Language Assessment Quarterly, 2013
This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…
Descriptors: Hierarchical Linear Modeling, Statistical Analysis, Multiple Regression Analysis, Generalizability Theory
Lee, Jaekyung; Liu, Xiaoyan; Amo, Laura Casey; Wang, Weichun Leilani – Educational Policy, 2014
Drawing on national and state assessment datasets in reading and math, this study tested "external" versus "internal" standards-based education models. The goal was to understand whether and how student performance standards work in multilayered school systems under No Child Left Behind Act of 2001 (NCLB). Under the…
Descriptors: State Standards, Academic Standards, Student Evaluation, Academic Achievement
Grouws, Douglas A.; Tarr, James E.; Chavez, Oscar; Sears, Ruthmae; Soria, Victor M.; Taylan, Rukiye D. – Journal for Research in Mathematics Education, 2013
This study examined the effect of 2 types of mathematics content organization on high school students' mathematics learning while taking account of curriculum implementation and student prior achievement. Hierarchical linear modeling with 3 levels showed that students who studied from the integrated curriculum were significantly advantaged over…
Descriptors: Secondary School Mathematics, Curriculum Implementation, Integrated Curriculum, High School Students
Cho, Sun-Joo; Cohen, Allan S.; Bottge, Brian – Grantee Submission, 2013
A multilevel latent transition analysis (LTA) with a mixture IRT measurement model (MixIRTM) is described for investigating the effectiveness of an intervention. The addition of a MixIRTM to the multilevel LTA permits consideration of both potential heterogeneity in students' response to instructional intervention as well as a methodology for…
Descriptors: Intervention, Item Response Theory, Statistical Analysis, Models
Thomas, Matthew – ProQuest LLC, 2013
This dissertation examines the relationship between an instructional style called Interactive-Engagement (IE) and gains on a measure of conceptual knowledge called the Calculus Concept Inventory (CCI). The data comes from two semesters of introductory calculus courses (Fall 2010 and Spring 2011), consisting of a total of 482 students from the…
Descriptors: Introductory Courses, Calculus, Mathematics Instruction, Teaching Styles
Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012
Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…
Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation
Wale, Christine M. – ProQuest LLC, 2013
Digital games are widely popular and interest has increased for their use in education. Digital games are thought to be powerful instructional tools because they promote active learning and feedback, provide meaningful contexts to situate knowledge, create engagement and intrinsic motivation, and have the ability individualize instruction.…
Descriptors: Academic Achievement, Mathematics, Mathematics Instruction, Mathematical Aptitude
Vaughn, Brandon K. – Journal on School Educational Technology, 2008
This study considers the importance of contextual effects on the quality of assessments on item bias and differential item functioning (DIF) in measurement. Often, in educational studies, students are clustered in teachers or schools, and the clusters could impact psychometric issues yet are largely ignored by traditional item analyses. A…
Descriptors: Test Bias, Educational Assessment, Educational Quality, Context Effect
von Davier, Alina A.; Carstensen, Claus H.; von Davier, Matthias – ETS Research Report Series, 2006
Measuring and linking competencies require special instruments, special data collection designs, and special statistical models. The measurement instruments are tests or tests forms, which can be used in the following situations: The same test can be given repeatedly; two or more parallel tests forms (i.e., forms intended to be similar in…
Descriptors: Scores, Measurement Techniques, Competence, Comparative Analysis