ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	8

Descriptor

Testing Programs	37
State Programs	22
Scores	8
Educational Assessment	7
Achievement Tests	6
Evaluation Methods	6
High Schools	6
Item Response Theory	6
Mathematics Tests	6
Test Construction	6
Test Use	6
Equated Scores	5
High School Students	5
Test Items	5
Validity	5
Court Litigation	4
Elementary Secondary Education	4
Performance Based Assessment	4
Standardized Tests	4
Standards	4
Student Evaluation	4
Test Reliability	4
Test Results	4
Academic Achievement	3
College Entrance Examinations	3
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	37
Reports - Evaluative	15
Reports - Research	13
Reports - Descriptive	9
Information Analyses	2
Historical Materials	1
Legal/Legislative/Regulatory…	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Grade 3	2
Secondary Education	2
Early Childhood Education	1
Grade 11	1
Grade 2	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Georgia	2
Canada	1
Colorado	1
Kansas	1
South Carolina	1
Texas	1
Vermont	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Texas Assessment of Academic…	5
SAT (College Admission Test)	3
National Assessment of…	2
Iowa Tests of Basic Skills	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Conducting a Lifecycle Audit of the National Assessment of Educational Progress

Peer reviewed

Direct link

Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009

The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…

Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation

Comparing DIF across Math and Reading/Language Arts Tests for Students Receiving a Read-Aloud Accommodation

Peer reviewed

Direct link

Bolt, Sara E.; Ysseldyke, James E. – Applied Measurement in Education, 2006

Although testing accommodations are commonly provided to students with disabilities within large-scale testing programs, research findings on how well accommodations allow for comparable measurement of student knowledge and skill remain inconclusive. The purpose of this study was to examine the extent to which 1 commonly held belief about testing…

Descriptors: Oral Reading, Testing Accommodations, Disabilities, Special Needs Students

Stability of School-Level Scores from Large-Scale Student Assessment.

Peer reviewed

Sicoly, Fiore – Applied Measurement in Education, 2002

Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)

Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics

Sources of Uncertainty Often Ignored in Adjusting State Mean SAT Scores for Differential Participation Rates: The Rules of the Game.

Peer reviewed

Holland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990

Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…

Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias

Are Multiple Measures Meaningful?: Lessons from a Statewide Performance Assessment.

Peer reviewed

Goldberg, Gail Lynn; Roswell, Barbara Sherr – Applied Measurement in Education, 2001

To determine the factors that contribute to or compromise the effectiveness of multiscored items, this study combined analysis of statewide score data from the 1996 Maryland School Performance Assessment Program tests with systematic analyses of 60 activities providing measures of writing, language usage, or both, and one or more content areas.…

Descriptors: Performance Based Assessment, Scores, State Programs, Testing Programs

Testing the Basic Skills in the High School--What's in the Future?

Peer reviewed

Fisher, Thomas H. – Applied Measurement in Education, 1988

Future trends in high school basic skills testing programs are discussed. Topics include subject area testing, national comparison testing, test security concerns, and technology's impact. The future is likely to bring more testing, rather than less, but there will be significant changes in the ways tests are implemented. (SLD)

Descriptors: Basic Skills, Educational Change, Educational Testing, Educational Trends

Estimation of the All Tests Pass Rate When No Examinee Took All of the Tests

Peer reviewed

Direct link

Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004

Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…

Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs

Linking Statewide Tests to the National Assessment of Educational Progress: Accuracy of Combining Test Results across States.

Peer reviewed

Ercikan, Kadriye – Applied Measurement in Education, 1997

Linking scores from the National Assessment of Educational Progress (NAEP) to statewide test results was studied. Results based on an equipercentile procedure suggest that such a link does not provide precise information. Information from a linking study should be limited to rough estimates of students in each NAEP achievement level. (SLD)

Descriptors: Equated Scores, Estimation (Mathematics), National Surveys, State Programs

Previous Page | Next Page »

Pages: 1 | 2 | 3

Buckendahl, Chad W.	2
Holland, Paul W.	2
Miller, G. Edward	2
Pomplun, Mark	2
Twing, Jon S.	2
Wainer, Howard	2
Albano, Anthony D.	1
Anderson, David W.	1
Bolt, Sara E.	1
Brian F. French	1
Chen, Wen-Hung	1
Colvin, Kimberly F.	1
Cook, Robert J.	1
Cruse, Keith L.	1
Cummings, Cynthia B.	1
Davis, Susan L.	1
Edwards, Don	1
Engelhard, George, Jr.	1
Ercikan, Kadriye	1
Ferrara, Steve	1
Fisher, Thomas H.	1
Gabrielson, Stephen	1
Gao, Xiaohong	1
Goldberg, Gail Lynn	1
More ▼