Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 3 |
Descriptor
Source
Journal of Educational… | 3 |
Applied Measurement in… | 2 |
Asia Pacific Education Review | 2 |
Educational and Psychological… | 1 |
Author
Lee, Guemin | 13 |
Fitzpatrick, Anne R. | 4 |
Frisbie, David A. | 2 |
Lewis, Daniel M. | 2 |
Gao, Furong | 1 |
Hwang, Jeong-Won | 1 |
Ito, Kyoko | 1 |
Jeon, Min-Jeong | 1 |
Kang, Sang-Jin | 1 |
Park, In-Yong | 1 |
Publication Type
Journal Articles | 8 |
Reports - Research | 8 |
Reports - Evaluative | 5 |
Speeches/Meeting Papers | 5 |
Numerical/Quantitative Data | 3 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lee, Guemin; Park, In-Yong – Asia Pacific Education Review, 2012
Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
Descriptors: Generalizability Theory, Simulation, Computation, Item Response Theory
Jeon, Min-Jeong; Lee, Guemin; Hwang, Jeong-Won; Kang, Sang-Jin – Asia Pacific Education Review, 2009
The purpose of this study was to investigate the methods of estimating the reliability of school-level scores using generalizability theory and multilevel models. Two approaches, "student within schools" and "students within schools and subject areas," were conceptualized and implemented in this study. Four methods resulting from the combination…
Descriptors: Generalizability Theory, Scores, Reliability, Statistical Analysis
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores

Lee, Guemin; Fitzpatrick, Anne R. – Journal of Educational Measurement, 2003
Studied three procedures for estimating the standard errors of school passing rates using a generalizability theory model and considered the effects of student sample size. Results show that procedures differ in terms of assumptions about the populations from which students were sampled, and student sample size was found to have a large effect on…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Sampling

Lee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999
Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…
Descriptors: Generalizability Theory, Models, National Surveys, Reliability
Lee, Guemin; Frisbie, David A. – 1997
Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…
Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Scores

Lee, Guemin – Journal of Educational Measurement, 2002
Studied the effects of items, passages, contents, themes, and types of passages on the reliability and standard errors of measurement for complex reading comprehension tests using seven different generalizability theory models. Results suggest that passages and themes should be taken into account when evaluating the reliability of test scores for…
Descriptors: Error of Measurement, Generalizability Theory, Models, Reading Comprehension

Lee, Guemin – Journal of Educational Measurement, 2000
Studied the appropriateness and implications of incorporating a testlet definition into the estimation of procedures of the conditional standard error of measurement (SEM) for tests composed of testlets. Simulation results for several methods show that an item-based method using a generalizability theory model provided good estimates of the…
Descriptors: Comparative Analysis, Error of Measurement, Estimation (Mathematics), Generalizability Theory
Lee, Guemin – 2000
The purpose of this study was to investigate the relative appropriateness of several procedures for estimating reliability and standard errors of measurement of complex reading comprehension tests. Seven generalizability theory models were conceptualized by incorporating one or several factors of items, passages, themes, contents, and types of…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Models

Fitzpatrick, Anne R.; Lee, Guemin; Gao, Furong – Applied Measurement in Education, 2001
Used generalizability theory to assess the variation in school scores across very short test forms that measured mathematics scores in grades 4 and 8. More than 25,000 students took each form of the 3 tests for each grade. Results demonstrate the lack of comparability in school scores across short, nonparallel tests forms and the importance of…
Descriptors: Comparative Analysis, Elementary School Students, Generalizability Theory, Institutional Characteristics
Lee, Guemin; Fitzpatrick, Anne R. – 2001
The percentage of students at/above a cut point (PAAC) is one of the most common measures used for reporting school-level performance relative to a proficiency standard (L. Cronbach, N. Bradburn, and D. Horvitz, 1994). The two purposes of this study were to introduce procedures for estimating standard errors for school PAACs under a…
Descriptors: Academic Achievement, Cutting Scores, Elementary Education, Elementary School Students
Lee, Guemin; Lewis, Daniel M. – 2001
The Bookmark Standard Setting Procedure (Lewis, Mitzel, and Green, 1996) is an item-response-theory-based standard setting method that has been widely implemented by state testing programs. The primary purposes of this study were to: (1) estimate standard errors for cutscores that result from Bookmark standard settings under a generalizability…
Descriptors: Cutting Scores, Elementary School Students, Elementary Secondary Education, Error of Measurement
Lee, Guemin; Fitzpatrick, Anne R.; Ito, Kyoko – 2001
School test performance is commonly summarized in terms of the percentage of students at or above a cut score (PAAC) that has been set on a test. Two approaches to estimating the standard errors for school PAACs were examined in this study: conditional standard errors and overall standard errors. The tests used were English language arts and…
Descriptors: Academic Achievement, Cutting Scores, Elementary Education, Elementary School Students