ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	86

Descriptor

Evaluation Methods	160
Test Items	160
Test Construction	49
Item Response Theory	42
Student Evaluation	35
Psychometrics	28
Computer Assisted Testing	27
Simulation	27
Foreign Countries	25
Item Analysis	25
Test Validity	24
Educational Assessment	21
Measurement Techniques	21
Comparative Analysis	19
Test Bias	19
Measures (Individuals)	18
Scores	18
Models	17
Data Analysis	16
Mathematics Tests	16
Research Methodology	16
Evaluation Research	15
Internet	14
Classification	13
Item Banks	13
More ▼

Publication Type

Reports - Evaluative	160
Journal Articles	106
Speeches/Meeting Papers	28
Numerical/Quantitative Data	11
Tests/Questionnaires	4
Information Analyses	3
Dissertations/Theses -…	1
Opinion Papers	1
Reports - Descriptive	1
Reports - Research	1

Education Level

Elementary Secondary Education	22
Higher Education	15
Elementary Education	14
Grade 8	10
Grade 4	6
Postsecondary Education	5
Secondary Education	5
Grade 6	4
Grade 5	3
Middle Schools	3
Grade 12	2
High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 2	1
Grade 3	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
More ▼

Audience

Practitioners	3
Researchers	1
Teachers	1

Location

Oregon	8
Taiwan	3
United States	3
Asia	2
Canada	2
Japan	2
Massachusetts	2
Australia	1
California	1
China	1
Dominica	1
Grenada	1
Mississippi	1
Netherlands	1
Ohio	1
Pennsylvania	1
Portugal	1
Puerto Rico	1
Saint Lucia	1
Saint Vincent and the…	1
South Korea	1
Turkey	1
Washington	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	8
Education for All Handicapped…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	5
Program for International…	4
SAT (College Admission Test)	3
Trends in International…	2
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Flesch Kincaid Grade Level…	1
Hidden Figures Test	1
Maslach Burnout Inventory	1
Massachusetts Comprehensive…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 160 results Save | Export

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Peer reviewed

Direct link

Kim, Kyung Yong – Journal of Educational Measurement, 2020

New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…

Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation

A Framework to Evaluate Cognitive Complexity in Mathematics Assessments

Download full text

Achieve, Inc., 2018

In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The mathematics section of the document included five content-specific criteria to evaluate alignment of assessments to college- and…

Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes

Detection of Differential Item Functioning with Nonlinear Regression: A Non-IRT Approach Accounting for Guessing

Peer reviewed

Direct link

Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017

In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…

Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification

A Framework to Evaluate Cognitive Complexity in Reading Assessments

Download full text

Achieve, Inc., 2019

In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The English Language Arts (ELA)/Literacy section of the document included nine content-specific criteria to evaluate the alignment of…

Descriptors: Reading Skills, Student Evaluation, Evaluation Methods, Reading Tests

How Can We Help Our Students Be More Critical? Examining the Details in Questionnaire Studies

Peer reviewed
PDF on ERIC

Download full text

Direct link

Hartley, James – Psychology Teaching Review, 2017

In this article, Hartley notes the difficulties of using questionnaires to assess the efficiency of new instructional methods and highlights nine issues that researchers must consider. Hartley continues the discussion about the use of questionnaires and suggests that psychology teachers can help improve the teaching of psychology by drawing…

Descriptors: Questionnaires, Instructional Innovation, Instructional Effectiveness, Teaching Methods

Beyond Instrumentation: Redesigning Measures and Methods for Evaluating the Graduate College Experience

Peer reviewed

Direct link

Hardré, Patricia L.; Hackett, Shannon – Educational Assessment, Evaluation and Accountability, 2015

This manuscript chronicles the process and products of a redesign for evaluation of the graduate college experience (GCE) which was initiated by a university graduate college, based on its observed need to reconsider and update its measures and methods for assessing graduate students' experiences. We examined the existing instrumentation and…

Descriptors: Test Construction, Graduate Students, Student Experience, Evaluation Methods

Why Massachusetts Should Abandon the PARCC Tests and the 2011 Coleman et al English Language Arts Standards on Which the MCAS Tests Are Based. Testimony

Download full text

Stotsky, Sandra – Pioneer Institute for Public Policy Research, 2015

In this testimony, the author first describes her qualifications, as well as the lack of relevant qualifications in Common Core's standards writers and in most of the members of Common Core's Validation Committee, on which she served in 2009-2010. The author then details some of the many problems in the 2011 Massachusetts ELA standards, written by…

Descriptors: Common Core State Standards, Standardized Tests, Language Arts, English Instruction

Assessing the Discriminating Power of Item and Test Scores in the Linear Factor-Analysis Model

Peer reviewed
PDF on ERIC

Download full text

Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2012

Model-based attempts to rigorously study the broad and imprecise concept of "discriminating power" are scarce, and generally limited to nonlinear models for binary responses. This paper proposes a comprehensive framework for assessing the discriminating power of item and test scores which are analyzed or obtained using Spearman's…

Descriptors: Student Evaluation, Psychometrics, Test Items, Scores

An Algorithm for Testing Unidimensionality and Clustering Items in Rasch Measurement

Peer reviewed

Direct link

Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012

A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…

Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation

Challenges and Strategies for Assessing Specialised Knowledge for Teaching

Peer reviewed
PDF on ERIC

Download full text

Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015

Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…

Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems

Peer reviewed

Direct link

Demir, Yusuf; Ertas, Abdullah – Reading Matrix: An International Online Journal, 2014

Coursebook evaluation helps practitioners decide on the most appropriate coursebook to be exploited. Moreover, evaluation process enables to predict the potential strengths and weaknesses of a given coursebook. Checklist method is probably the most widely adopted way of judging coursebooks and there are plenty of ELT coursebook evaluation…

Descriptors: Check Lists, Course Evaluation, Instructional Material Evaluation, Media Selection

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

On the Adequacy of Current Empirical Evaluations of Formal Models of Categorization

Peer reviewed

Direct link

Wills, Andy J.; Pothos, Emmanuel M. – Psychological Bulletin, 2012

Categorization is one of the fundamental building blocks of cognition, and the study of categorization is notable for the extent to which formal modeling has been a central and influential component of research. However, the field has seen a proliferation of noncomplementary models with little consensus on the relative adequacy of these accounts.…

Descriptors: Classification, Computation, Test Items, Generalizability Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Journal of Educational…	12
Educational and Psychological…	11
Behavioral Research and…	8
Applied Measurement in…	7
Applied Psychological…	6
Online Submission	5
Journal of Educational and…	4
Measurement:…	4
Studies in Educational…	3
Achieve, Inc.	2
Assessment for Effective…	2
CBE - Life Sciences Education	2
Computers & Education	2
International Journal of…	2
Journal of Research in…	2
Language Testing	2
Multivariate Behavioral…	2
Psychological Assessment	2
Structural Equation Modeling:…	2
American Institutes for…	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
British Journal of…	1
College and University	1
More ▼

Alonzo, Julie	8
Tindal, Gerald	8
Lai, Cheng Fei	7
Hambleton, Ronald K.	5
Nandakumar, Ratna	4
Hill, Heather C.	3
Rogers, H. Jane	3
Wang, Wen-Chung	3
van der Linden, Wim J.	3
Blunk, Merrie	2
Dorans, Neil J.	2
Gierl, Mark J.	2
Goffney, Imani Masters	2
Jiao, Hong	2
Johanson, George A.	2
Sireci, Stephen G.	2
Su, Ya-Hui	2
Wu, Margaret	2
Zwick, Rebecca	2
Abedi, Jamal	1
Ackerman, Terry A.	1
Aggen, Steven H.	1
Ainley, John	1
Akers, Katherine G.	1
More ▼