ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	34

Descriptor

Scoring	42
Test Bias	42
Test Items	42
Test Reliability	17
Test Construction	15
Test Validity	13
Scores	12
Psychometrics	11
Correlation	10
Comparative Analysis	9
Item Response Theory	9
Testing	9
Simulation	7
Testing Accommodations	7
Multiple Choice Tests	6
Statistical Analysis	6
College Entrance Examinations	5
Computer Assisted Testing	5
Factor Analysis	5
Foreign Countries	5
Gender Differences	5
Item Analysis	5
Mathematics Tests	5
Test Content	5
Difficulty Level	4
More ▼

Publication Type

Journal Articles	23
Reports - Research	20
Reports - Evaluative	12
Guides - General	4
Guides - Non-Classroom	3
Numerical/Quantitative Data	3
Reports - Descriptive	3
Tests/Questionnaires	3
Speeches/Meeting Papers	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Information Analyses	1
More ▼

Education Level

Secondary Education	8
Higher Education	6
High Schools	5
Middle Schools	5
Early Childhood Education	4
Elementary Education	4
Grade 7	4
Junior High Schools	4
Postsecondary Education	4
Elementary Secondary Education	3
Grade 3	3
Primary Education	3
Grade 4	2
Grade 5	2
Grade 6	2
Grade 8	2
Grade 9	2
Intermediate Grades	2
More ▼

Audience

Location

California	2
Alabama	1
Denmark	1
France	1
Idaho	1
Nebraska	1
New Mexico	1
New York	1
North Dakota	1
Ohio	1
Slovakia	1
Taiwan	1
Texas	1
Turkey	1
United States	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	3
SAT (College Admission Test)	3
Graduate Record Examinations	2
National Assessment of…	2
ACT Interest Inventory	1
Advanced Placement…	1
Computer Attitude Scale	1
Preliminary Scholastic…	1
Program for International…	1
Teaching and Learning…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 42 results Save | Export

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Differential Item Functioning Analysis of the Fundamental Concepts for Organic Reaction Mechanisms Inventory

Peer reviewed

Direct link

Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022

The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…

Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Quality Testing Standards and Criteria for Comparability Claims. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

A Validation Trajectory for the Washington Assessment of Risks and Needs of Students

Peer reviewed

Direct link

Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020

The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…

Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

Net and Global Differential Item Functioning in PISA Polytomously Scored Science Items: Application of the Differential Step Functioning Framework

Peer reviewed

Direct link

Akour, Mutasem; Sabah, Saed; Hammouri, Hind – Journal of Psychoeducational Assessment, 2015

The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…

Descriptors: Test Bias, Science Tests, Test Items, Scoring

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Measuring Process Quality in Early Childhood Education and Care through Situational Judgement Questions: Findings from TALIS Starting Strong 2018 Field Trial. OECD Education Working Papers, No. 217

Direct link

Nilsen, Trude; Slot, Pauline; Cigler, Hynek; Chen, Minge – OECD Publishing, 2020

Situational Judgement Questions (SJQs) measuring process quality were included in the OECD Starting Strong Teaching and Learning International Survey 2018 (TALIS Starting Strong 2018) to address concerns of self-report bias in large-scale international surveys. These SJQs provide the staff in early childhood education and care with situations…

Descriptors: Educational Quality, Situational Tests, Administrator Surveys, Teacher Surveys

Operational Study 4: Accessibility of New Items/Functionality. Component 3 Report

Download full text

Steedle, Jeffrey; LaSalle, Amy – Partnership for Assessment of Readiness for College and Careers, 2016

Partnership for Assessment of Readiness for College and Careers (PARCC) Operational Study 4 Component 3 was designed to compare performance on PARCC mathematics field-test items for grade 3 taken with and without a drawing tool. For the 2016 testing window, five field-test items were selected to have the directions edited to allow students to…

Descriptors: Grade 3, Mathematics Tests, Test Items, Freehand Drawing

Test Review: Reynolds, C. R., Voress, J. V., Kamphaus, R. W. (2015), "Mathematics Fluency and Calculation Tests (MFaCTs) review." PRO-ED

Peer reviewed

Direct link

Marbach, Joshua – Journal of Psychoeducational Assessment, 2017

The Mathematics Fluency and Calculation Tests (MFaCTs) are a series of measures designed to assess for arithmetic calculation skills and calculation fluency in children ages 6 through 18. There are five main purposes of the MFaCTs: (1) identifying students who are behind in basic math fact automaticity; (2) evaluating possible delays in arithmetic…

Descriptors: Mathematics Tests, Computation, Mathematics Skills, Arithmetic

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Previous Page | Next Page »

Pages: 1 | 2 | 3

New Meridian Corporation	5
ETS Research Report Series	4
Educational and Psychological…	4
Journal of Educational…	3
International Journal of…	2
Journal of Psychoeducational…	2
ACT, Inc.	1
Applied Measurement in…	1
Applied Psychological…	1
College Board	1
Education and Information…	1
Educational Assessment	1
Educational Measurement:…	1
Eurasian Journal of…	1
IGI Global	1
Journal of Chemical Education	1
National Center for Education…	1
National Center for Education…	1
OECD Publishing	1
Partnership for Assessment of…	1
ProQuest LLC	1
Research in Developmental…	1
Society for Research on…	1
More ▼

Dorans, Neil J.	2
Akour, Mutasem	1
Ali, Usama S.	1
Atanasov, Dimitar V.	1
Benson, Jeri	1
Chang, Hua-Hua	1
Chen, Minge	1
Cigler, Hynek	1
Corina E. Brown	1
Demir, Ergul	1
Dimitrov, Dimiter M.	1
Ferrara, Steve, Ed.	1
French, Brian F.	1
Fu, Jianbin	1
Gallagher, Carole	1
Garavaglia, Diane R.	1
Gotch, Chad M.	1
Hammouri, Hind	1
Huang, Chiungjung	1
Huang, Chun-Wei	1
Isham, Steven	1
Kieftenbeld, Vincent	1
Kim, Sooyeon	1
Kim, Yongnam	1
More ▼