Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 86 |
Descriptor
Source
Author
Alonzo, Julie | 8 |
Tindal, Gerald | 8 |
Lai, Cheng Fei | 7 |
Hambleton, Ronald K. | 5 |
Nandakumar, Ratna | 4 |
Hill, Heather C. | 3 |
Rogers, H. Jane | 3 |
Wang, Wen-Chung | 3 |
van der Linden, Wim J. | 3 |
Blunk, Merrie | 2 |
Dorans, Neil J. | 2 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 22 |
Higher Education | 15 |
Elementary Education | 14 |
Grade 8 | 10 |
Grade 4 | 6 |
Postsecondary Education | 5 |
Secondary Education | 5 |
Grade 6 | 4 |
Grade 5 | 3 |
Middle Schools | 3 |
Grade 12 | 2 |
More ▼ |
Audience
Practitioners | 3 |
Researchers | 1 |
Teachers | 1 |
Location
Oregon | 8 |
Taiwan | 3 |
United States | 3 |
Asia | 2 |
Canada | 2 |
Japan | 2 |
Massachusetts | 2 |
Australia | 1 |
California | 1 |
China | 1 |
Dominica | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 8 |
Education for All Handicapped… | 1 |
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024
Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…
Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods
Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022
Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…
Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction
Kim, Kyung Yong – Journal of Educational Measurement, 2020
New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…
Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation
Achieve, Inc., 2018
In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The mathematics section of the document included five content-specific criteria to evaluate alignment of assessments to college- and…
Descriptors: Mathematics Tests, Difficulty Level, Evaluation Criteria, Cognitive Processes
Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017
In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…
Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification
Achieve, Inc., 2019
In 2013, the Council of Chief State School Officers (CCSSO), working collaboratively with state education agencies, released a set of criteria for states to use to evaluate and procure high-quality assessments. The English Language Arts (ELA)/Literacy section of the document included nine content-specific criteria to evaluate the alignment of…
Descriptors: Reading Skills, Student Evaluation, Evaluation Methods, Reading Tests
Hartley, James – Psychology Teaching Review, 2017
In this article, Hartley notes the difficulties of using questionnaires to assess the efficiency of new instructional methods and highlights nine issues that researchers must consider. Hartley continues the discussion about the use of questionnaires and suggests that psychology teachers can help improve the teaching of psychology by drawing…
Descriptors: Questionnaires, Instructional Innovation, Instructional Effectiveness, Teaching Methods
Hardré, Patricia L.; Hackett, Shannon – Educational Assessment, Evaluation and Accountability, 2015
This manuscript chronicles the process and products of a redesign for evaluation of the graduate college experience (GCE) which was initiated by a university graduate college, based on its observed need to reconsider and update its measures and methods for assessing graduate students' experiences. We examined the existing instrumentation and…
Descriptors: Test Construction, Graduate Students, Student Experience, Evaluation Methods
Stotsky, Sandra – Pioneer Institute for Public Policy Research, 2015
In this testimony, the author first describes her qualifications, as well as the lack of relevant qualifications in Common Core's standards writers and in most of the members of Common Core's Validation Committee, on which she served in 2009-2010. The author then details some of the many problems in the 2011 Massachusetts ELA standards, written by…
Descriptors: Common Core State Standards, Standardized Tests, Language Arts, English Instruction
Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2012
Model-based attempts to rigorously study the broad and imprecise concept of "discriminating power" are scarce, and generally limited to nonlinear models for binary responses. This paper proposes a comprehensive framework for assessing the discriminating power of item and test scores which are analyzed or obtained using Spearman's…
Descriptors: Student Evaluation, Psychometrics, Test Items, Scores
Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012
A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…
Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation
Orrill, Chandra Hawley; Kim, Ok-Kyeong; Peters, Susan A.; Lischka, Alyson E.; Jong, Cindy; Sanchez, Wendy B.; Eli, Jennifer A. – Mathematics Teacher Education and Development, 2015
Developing and writing assessment items that measure teachers' knowledge is an intricate and complex undertaking. In this paper, we begin with an overview of what is known about measuring teacher knowledge. We then highlight the challenges inherent in creating assessment items that focus specifically on measuring teachers' specialised knowledge…
Descriptors: Specialization, Knowledge Base for Teaching, Educational Strategies, Testing Problems
Demir, Yusuf; Ertas, Abdullah – Reading Matrix: An International Online Journal, 2014
Coursebook evaluation helps practitioners decide on the most appropriate coursebook to be exploited. Moreover, evaluation process enables to predict the potential strengths and weaknesses of a given coursebook. Checklist method is probably the most widely adopted way of judging coursebooks and there are plenty of ELT coursebook evaluation…
Descriptors: Check Lists, Course Evaluation, Instructional Material Evaluation, Media Selection
Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012
This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…
Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring
Wills, Andy J.; Pothos, Emmanuel M. – Psychological Bulletin, 2012
Categorization is one of the fundamental building blocks of cognition, and the study of categorization is notable for the extent to which formal modeling has been a central and influential component of research. However, the field has seen a proliferation of noncomplementary models with little consensus on the relative adequacy of these accounts.…
Descriptors: Classification, Computation, Test Items, Generalizability Theory