ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	16

Source

Educational and Psychological…

Publication Type

Journal Articles	48
Reports - Research	44
Reports - Evaluative	4
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	6
Elementary Secondary Education	3
Grade 5	3
Grade 3	2
Higher Education	2
Intermediate Grades	2
Postsecondary Education	2
Primary Education	2
Secondary Education	2
Early Childhood Education	1
Grade 1	1
Grade 11	1
Grade 2	1
Grade 6	1
Grade 7	1
Grade 8	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Location

Australia	3
Germany	3
Canada	1
Greece	1
Israel	1
Mexico	1
New Zealand	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Educational and Psychological Measurement X

Showing 1 to 15 of 59 results Save | Export

Equating Oral Reading Fluency Scores: A Model-Based Approach

Peer reviewed

Direct link

Yusuf Kara; Akihito Kamata; Xin Qiao; Cornelis J. Potgieter; Joseph F. T. Nese – Educational and Psychological Measurement, 2024

Words read correctly per minute (WCPM) is the reporting score metric in oral reading fluency (ORF) assessments, which is popularly utilized as part of curriculum-based measurements to screen at-risk readers and to monitor progress of students who receive interventions. Just like other types of assessments with multiple forms, equating would be…

Descriptors: Oral Reading, Reading Fluency, Models, Reading Rate

Generalized Mantel-Haenszel Estimators for Simultaneous Differential Item Functioning Tests

Peer reviewed

Direct link

Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023

The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…

Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests

Identifying Disengaged Responding in Multiple-Choice Items: Extending a Latent Class Item Response Model with Novel Process Data Indicators

Peer reviewed

Direct link

Jana Welling; Timo Gnambs; Claus H. Carstensen – Educational and Psychological Measurement, 2024

Disengaged responding poses a severe threat to the validity of educational large-scale assessments, because item responses from unmotivated test-takers do not reflect their actual ability. Existing identification approaches rely primarily on item response times, which bears the risk of misclassifying fast engaged or slow disengaged responses.…

Descriptors: Foreign Countries, College Students, Guessing (Tests), Multiple Choice Tests

How Days between Tests Impacts Alternate Forms Reliability in Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2021

An essential question when computing test--retest and alternate forms reliability coefficients is how many days there should be between tests. This article uses data from reading and math computerized adaptive tests to explore how the number of days between tests impacts alternate forms reliability coefficients. Results suggest that the highest…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Reliability, Reading Tests

Assessing Measurement Invariance across Multiple Groups: When Is Fit Good Enough?

Peer reviewed

Direct link

van Dijk, Wilhelmina; Schatschneider, Christopher; Al Otaiba, Stephanie; Hart, Sara A. – Educational and Psychological Measurement, 2022

Complex research questions often need large samples to obtain accurate estimates of parameters and adequate power. Combining extant data sets into a large, pooled data set is one way this can be accomplished without expending resources. Measurement invariance (MI) modeling is an established approach to ensure participant scores are on the same…

Descriptors: Sample Size, Data Analysis, Goodness of Fit, Measurement

Controlling Guessing Bias in the Dichotomous Rasch Model Applied to a Large-Scale, Vertically Scaled Testing Program

Peer reviewed

Direct link

Andrich, David; Marais, Ida; Humphry, Stephen Mark – Educational and Psychological Measurement, 2016

Recent research has shown how the statistical bias in Rasch model difficulty estimates induced by guessing in multiple-choice items can be eliminated. Using vertical scaling of a high-profile national reading test, it is shown that the dominant effect of removing such bias is a nonlinear change in the unit of scale across the continuum. The…

Descriptors: Guessing (Tests), Statistical Bias, Item Response Theory, Multiple Choice Tests

Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

Peer reviewed

Direct link

Sideridis, Georgios D. – Educational and Psychological Measurement, 2016

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…

Descriptors: Learning Disabilities, Test Validity, Measures (Individuals), Hierarchical Linear Modeling

Examining Student Factors in Sources of Setting Accommodation DIF

Peer reviewed

Direct link

Lin, Pei-Ying; Lin, Yu-Cheng – Educational and Psychological Measurement, 2014

This exploratory study investigated potential sources of setting accommodation resulting in differential item functioning (DIF) on math and reading assessments for examinees with varied learning characteristics. The examinees were those who participated in large-scale assessments and were tested in either standardized or accommodated testing…

Descriptors: Test Bias, Multivariate Analysis, Testing Accommodations, Mathematics Tests

On the Factorial Structure of the SAT and Implications for Next-Generation College Readiness Assessments

Peer reviewed

Direct link

Wiley, Edward W.; Shavelson, Richard J.; Kurpius, Amy A. – Educational and Psychological Measurement, 2014

The name "SAT" has become synonymous with college admissions testing; it has been dubbed "the gold standard." Numerous studies on its reliability and predictive validity show that the SAT predicts college performance beyond high school grade point average. Surprisingly, studies of the factorial structure of the current version…

Descriptors: College Readiness, College Admission, College Entrance Examinations, Factor Analysis

Dealing with Omitted and Not-Reached Items in Competence Tests: Evaluating Approaches Accounting for Missing Responses in Item Response Theory Models

Peer reviewed

Direct link

Pohl, Steffi; Gräfe, Linda; Rose, Norman – Educational and Psychological Measurement, 2014

Data from competence tests usually show a number of missing responses on test items due to both omitted and not-reached items. Different approaches for dealing with missing responses exist, and there are no clear guidelines on which of those to use. While classical approaches rely on an ignorable missing data mechanism, the most recently developed…

Descriptors: Test Items, Achievement Tests, Item Response Theory, Models

An Application of Explanatory Item Response Modeling for Model-Based Proficiency Scaling

Peer reviewed

Direct link

Hartig, Johannes; Frey, Andreas; Nold, Gunter; Klieme, Eckhard – Educational and Psychological Measurement, 2012

The article compares three different methods to estimate effects of task characteristics and to use these estimates for model-based proficiency scaling: prediction of item difficulties from the Rasch model, the linear logistic test model (LLTM), and an LLTM including random item effects (LLTM+e). The methods are applied to empirical data from a…

Descriptors: Item Response Theory, Models, Methods, Computation

Reducing the Cognitive Complexity Associated with Standard Setting: A Comparison of the Single-Passage Bookmark and Yes/No Methods

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F. – Educational and Psychological Measurement, 2011

Judgmental standard setting methods have been criticized for the cognitive complexity of the judgment task that panelists are asked to complete. This study compared two methods designed to reduce this complexity: the yes/no method and the single-passage bookmark method. Two mock standard setting panel meetings were convened, one for each method,…

Descriptors: Standard Setting (Scoring), Methods, Cutting Scores, Experienced Teachers

Modeling the Effects of Person Group Factors on Discrimination

Peer reviewed

Direct link

Humphry, Stephen M. – Educational and Psychological Measurement, 2010

Discrimination has traditionally been parameterized for items but not other empirical factors. Consequently, if person factors affect discrimination they cause misfit. However, by explicitly formulating the relationship between discrimination and the unit of a metric, it is possible to parameterize discrimination for person groups. This article…

Descriptors: Discriminant Analysis, Models, Simulation, Reading Tests

Peer reviewed

Direct link

Wyse, Adam E. – Educational and Psychological Measurement, 2011

Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…

Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability

Item Selection Strategy for Reducing the Number of Items Rated in an Angoff Standard Setting Study

Peer reviewed

Direct link

Ferdous, Abdullah A.; Plake, Barbara S. – Educational and Psychological Measurement, 2007

In an Angoff standard setting procedure, judges estimate the probability that a hypothetical randomly selected minimally competent candidate will answer correctly each item in the test. In many cases, these item performance estimates are made twice, with information shared with the panelists between estimates. Especially for long tests, this…

Descriptors: Test Items, Probability, Item Analysis, Standard Setting (Scoring)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Michael, William B.	6
Hanna, Gerald S.	2
Pyrczak, Fred	2
Simpson, Robert G.	2
Weiner, Max	2
Wyse, Adam E.	2
Akihito Kamata	1
Al Otaiba, Stephanie	1
Andrich, David	1
Baldauf, Richard B., Jr.	1
Barton, Karen E.	1
Behuniak, Peter, Jr.	1
Benenson, Thea Fuchs	1
Birenbaum, Menucha	1
Brooks, Thomas	1
Brown, James M.	1
Brown, Nina W.	1
Calberg, Magda	1
Carver, Ronald P.	1
Chang, Gerald	1
Claus H. Carstensen	1
Cornelis J. Potgieter	1
Devine, Patrick J.	1
Dombrower, Jule	1
More ▼

California Achievement Tests	3
Comprehensive Tests of Basic…	3
Metropolitan Readiness Tests	3
Stanford Achievement Tests	3
Dimensions of Self Concept	2
Iowa Tests of Basic Skills	2
Nelson Denny Reading Tests	2
Woodcock Reading Mastery Test	2
ACT Assessment	1
Childrens Manifest Anxiety…	1
General Aptitude Test Battery	1
Peabody Individual…	1
Program for International…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
SRA Achievement Series	1
Slosson Intelligence Test	1
Stanford Diagnostic Reading…	1
Woodcock Johnson Tests of…	1
More ▼

Reading Tests	53
Test Validity	21
Predictive Validity	16
Foreign Countries	11
Test Items	11
Reading Comprehension	10
Primary Education	8
Test Construction	8
Test Reliability	8
Academic Achievement	7
Achievement Tests	7
Comparative Analysis	7
Difficulty Level	7
Elementary School Students	7
Higher Education	7
Reading Achievement	7
Correlation	6
Grade 1	6
High Schools	6
Item Response Theory	6
Reading Readiness Tests	6
Statistical Analysis	6
Comparative Testing	5
Grade Point Average	5
Item Analysis	5
More ▼