ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Test Length	10
Test Items	6
Elementary Secondary Education	5
Evaluation Methods	4
Foreign Countries	4
Mathematics Tests	4
Comparative Analysis	3
Item Response Theory	3
Mathematics Achievement	3
Reading Tests	3
Scores	3
Computer Assisted Testing	2
Correlation	2
Grade 8	2
International Assessment	2
Predictor Variables	2
Questionnaires	2
Science Tests	2
Simulation	2
Student Evaluation	2
Test Bias	2
Test Construction	2
Test Content	2
Test Reliability	2
Test Validity	2
More ▼

Source

Applied Measurement in…	1
ETS Research Report Series	1
Education Week	1
Educational Research and…	1
International Journal of…	1
Measurement:…	1
OECD Publishing (NJ1)	1
Online Submission	1
Pearson	1
Pennsylvania Department of…	1

Author

Camilli, Gregory	1
Chien, Yuehmei	1
Dikici, Ayhan	1
Gewertz, Catherine	1
Johan Braeken	1
Lee, HyeSun	1
Lu, Ying	1
McBride, James R.	1
Saskia van Laar	1
Shin, Chingwei David	1
Soh, Kaycheng	1
Way, Walter Denny	1
Wu, Margaret	1
Wyse, Adam E.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Descriptive	2
Reports - Evaluative	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	10
Elementary Education	2
Secondary Education	2
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

Asia	1
Iran	1
Pennsylvania	1
Singapore	1
Turkey	1

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Trends in International…	3
Program for International…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Prevalence of Random Responders as a Function of Scale Position and Questionnaire Length in the TIMSS 2015 Eighth-Grade Student Questionnaire

Peer reviewed

Direct link

Saskia van Laar; Johan Braeken – International Journal of Testing, 2024

This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…

Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length

Item Parameter Drift in a Time-Varying Predictor

Peer reviewed

Direct link

Lee, HyeSun – Applied Measurement in Education, 2018

The current simulation study examined the effects of Item Parameter Drift (IPD) occurring in a short scale on parameter estimates in multilevel models where scores from a scale were employed as a time-varying predictor to account for outcome scores. Five factors, including three decisions about IPD, were considered for simulation conditions. It…

Descriptors: Test Items, Hierarchical Linear Modeling, Predictor Variables, Scores

Variability in Percentage above Cut Scores Due to Discreteness in Score Scale. Research Report. ETS RR-17-32

Peer reviewed
PDF on ERIC

Download full text

Lu, Ying – ETS Research Report Series, 2017

For standard- or criterion-based assessments, the use of cut scores to indicate mastery, nonmastery, or different levels of skill mastery is very common. As part of performance summary, it is of interest to examine the percentage of examinees at or above the cut scores (PAC) and how PAC evolves across administrations. This paper shows that…

Descriptors: Cutting Scores, Evaluation Methods, Mastery Learning, Performance Based Assessment

Test Group Rethinks Questions

Direct link

Gewertz, Catherine – Education Week, 2012

A group that is developing tests for half the states in the nation has dramatically reduced the length of its assessment in a bid to balance the desire for a more meaningful and useful exam with concerns about the amount of time spent on testing. The decision by the Smarter Balanced Assessment Consortium reflects months of conversation among its…

Descriptors: State Standards, Test Length, Questioning Techniques, Test Construction

Indexing Creativity Fostering Teacher Behaviour: Replication and Modification

Download full text

Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015

Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…

Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

The 2008-2009 Pennsylvania System of School Assessment Handbook for Assessment Coordinators: Writing, Reading and Mathematics, Science

Download full text

Pennsylvania Department of Education, 2010

This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…

Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests

Comparing the Similarities and Differences of PISA 2003 and TIMSS. OECD Education Working Papers, No. 32

Direct link

Wu, Margaret – OECD Publishing (NJ1), 2010

This paper makes an in-depth comparison of the PISA (OECD) and TIMSS (IEA) mathematics assessments conducted in 2003. First, a comparison of survey methodologies is presented, followed by an examination of the mathematics frameworks in the two studies. The methodologies and the frameworks in the two studies form the basis for providing…

Descriptors: Mathematics Achievement, Foreign Countries, Gender Differences, Comparative Analysis