Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Testing Programs | 37 |
State Programs | 22 |
Scores | 8 |
Educational Assessment | 7 |
Achievement Tests | 6 |
Evaluation Methods | 6 |
High Schools | 6 |
Item Response Theory | 6 |
Mathematics Tests | 6 |
Test Construction | 6 |
Test Use | 6 |
More ▼ |
Source
Applied Measurement in… | 37 |
Author
Buckendahl, Chad W. | 2 |
Holland, Paul W. | 2 |
Miller, G. Edward | 2 |
Pomplun, Mark | 2 |
Twing, Jon S. | 2 |
Wainer, Howard | 2 |
Albano, Anthony D. | 1 |
Anderson, David W. | 1 |
Bolt, Sara E. | 1 |
Brian F. French | 1 |
Chen, Wen-Hung | 1 |
More ▼ |
Publication Type
Journal Articles | 37 |
Reports - Evaluative | 15 |
Reports - Research | 13 |
Reports - Descriptive | 9 |
Information Analyses | 2 |
Historical Materials | 1 |
Legal/Legislative/Regulatory… | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 3 |
Elementary Education | 2 |
Grade 3 | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Grade 11 | 1 |
Grade 2 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
Texas Assessment of Academic… | 5 |
SAT (College Admission Test) | 3 |
National Assessment of… | 2 |
Iowa Tests of Basic Skills | 1 |
What Works Clearinghouse Rating
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016
The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…
Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs
Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015
This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009
The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…
Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation
Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009
In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…
Descriptors: Test Items, Test Content, Testing Programs, Simulation
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Goldberg, Gail Lynn; Roswell, Barbara Sherr – Applied Measurement in Education, 2001
To determine the factors that contribute to or compromise the effectiveness of multiscored items, this study combined analysis of statewide score data from the 1996 Maryland School Performance Assessment Program tests with systematic analyses of 60 activities providing measures of writing, language usage, or both, and one or more content areas.…
Descriptors: Performance Based Assessment, Scores, State Programs, Testing Programs
Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004
Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…
Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs

Ercikan, Kadriye – Applied Measurement in Education, 1997
Linking scores from the National Assessment of Educational Progress (NAEP) to statewide test results was studied. Results based on an equipercentile procedure suggest that such a link does not provide precise information. Information from a linking study should be limited to rough estimates of students in each NAEP achievement level. (SLD)
Descriptors: Equated Scores, Estimation (Mathematics), National Surveys, State Programs

Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999
Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…
Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards

Holland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990
The attempt by D.Edwards and C. B. Cummings to adjust state mean Scholastic Aptitude Test Scores for differential participation rates with a "fuzzy truncation model" satisfies three criteria the authors previously defined but falls short for two. Omission of sensitivity studies mars the otherwise exemplary study. (SLD)
Descriptors: College Entrance Examinations, Criteria, Higher Education, Participation

Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998
A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)
Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards