Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 27 |
Descriptor
Source
Author
De Ayala, R. J. | 3 |
Hambleton, Ronald K. | 3 |
Miller, M. David | 3 |
Oshima, T. C. | 3 |
Donoghue, John R. | 2 |
Drasgow, Fritz | 2 |
Hoijtink, Herbert | 2 |
Liou, Michelle | 2 |
Nandakumar, Ratna | 2 |
Stout, William | 2 |
Swaminathan, Hariharan | 2 |
More ▼ |
Publication Type
Journal Articles | 82 |
Reports - Evaluative | 43 |
Reports - Research | 28 |
Reports - Descriptive | 9 |
Speeches/Meeting Papers | 4 |
Information Analyses | 1 |
Opinion Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 5 |
Elementary Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
Early Childhood Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
More ▼ |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
General Social Survey | 1 |
Program for International… | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Student, Sanford R. – Educational Researcher, 2022
Empirical growth benchmarks, as introduced by Hill, Bloom, Black, and Lipsey (2008), are a well-known way to contextualize effect sizes in education research. Past work on these benchmarks, both positive and negative, has largely avoided confronting the role of vertical scales, yet technical issues with vertical scales trouble the use of such…
Descriptors: Computer Simulation, Benchmarking, Effect Size, Intervention
Suzumura, Nana – Language Assessment Quarterly, 2022
The present study is part of a larger mixed methods project that investigated the speaking section of the Advanced Placement (AP) Japanese Language and Culture Exam. It investigated assumptions for the evaluation inference through a content analysis of test taker responses. Results of the content analysis were integrated with those of a many-facet…
Descriptors: Content Analysis, Test Wiseness, Advanced Placement, Computer Assisted Testing
Yoo, Hanwook; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2019
Item analysis is an integral part of operational test development and is typically conducted within two popular statistical frameworks: classical test theory (CTT) and item response theory (IRT). In this digital ITEMS module, Hanwook Yoo and Ronald K. Hambleton provide an accessible overview of operational item analysis approaches within these…
Descriptors: Item Analysis, Item Response Theory, Guidelines, Test Construction
Kalinowski, Steven T. – Educational and Psychological Measurement, 2019
Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph,…
Descriptors: Item Response Theory, Educational Assessment, Goodness of Fit, Probability
Scoular, Claire; Eleftheriadou, Sofia; Ramalingam, Dara; Cloney, Dan – Australian Journal of Education, 2020
Collaboration is a complex skill, comprised of multiple subskills, that is of growing interest to policy makers, educators and researchers. Several definitions and frameworks have been described in the literature to support assessment of collaboration; however, the inherent structure of the construct still needs better definition. In 2015, the…
Descriptors: Cooperative Learning, Problem Solving, Computer Assisted Testing, Comparative Analysis
Hameed, Paiker Fatima Mazhar – TESOL International Journal, 2020
Three-dimensional virtual (3D) environments provide EFL students with a rich and dynamic multimodal understanding of vocabulary. This study aims to explore implementing a 3D vocabulary learning strategy for young students on EFL vocabulary. The relationship between two facets of learning -- autonomy and collaboration -- has been studied in…
Descriptors: Student Centered Learning, Vocabulary Development, Second Language Learning, Second Language Instruction
Feinberg, Richard A.; Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2016
Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…
Descriptors: Simulation, Psychometrics, Vocabulary, Research Design
Aybek, Eren Can; Demirtasli, R. Nukhet – International Journal of Research in Education and Science, 2017
This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…
Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items
Makransky, Guido; Mayer, Richard; Nøremølle, Anne; Cordoba, Ainara Lopez; Wandall, Jakob; Bonde, Mads – Educational Technology Research and Development, 2020
There is great potential in making assessment and learning complementary. In this study, we investigated the feasibility of developing a desktop virtual reality (VR) laboratory simulation on the topic of genetics, with integrated assessment using multiple choice questions based on item response theory (IRT) and feedback based on the cognitive…
Descriptors: Student Evaluation, Feedback (Response), Computer Simulation, Computer Uses in Education
Wilson, Mark; Gochyyev, Perman; Scalise, Kathleen – Journal of Educational Measurement, 2017
This article summarizes assessment of cognitive skills through collaborative tasks, using field test results from the Assessment and Teaching of 21st Century Skills (ATC21S) project. This project, sponsored by Cisco, Intel, and Microsoft, aims to help educators around the world enable students with the skills to succeed in future career and…
Descriptors: Cognitive Ability, Thinking Skills, Evaluation Methods, Educational Assessment
Choi, Seung W.; Podrabsky, Tracy; McKinney, Natalie – Applied Psychological Measurement, 2012
Computerized adaptive testing (CAT) enables efficient and flexible measurement of latent constructs. The majority of educational and cognitive measurement constructs are based on dichotomous item response theory (IRT) models. An integral part of developing various components of a CAT system is conducting simulations using both known and empirical…
Descriptors: Computer Assisted Testing, Adaptive Testing, Computer Software, Item Response Theory
Kahraman, Nilüfer – Eurasian Journal of Educational Research, 2014
Problem: Practitioners working with multiple-choice tests have long utilized Item Response Theory (IRT) models to evaluate the performance of test items for quality assurance. The use of similar applications for performance tests, however, is often encumbered due to the challenges encountered in working with complicated data sets in which local…
Descriptors: Item Response Theory, Licensing Examinations (Professions), Performance Based Assessment, Computer Simulation
Jin, Kuan-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014
Extreme response style (ERS) is a systematic tendency for a person to endorse extreme options (e.g., strongly disagree, strongly agree) on Likert-type or rating-scale items. In this study, we develop a new class of item response theory (IRT) models to account for ERS so that the target latent trait is free from the response style and the tendency…
Descriptors: Item Response Theory, Research Methodology, Bayesian Statistics, Response Style (Tests)
Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015
This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs
MacCoun, Robert J. – Psychological Review, 2012
[Correction Notice: An erratum for this article was reported in Vol 119(2) of Psychological Review (see record 2012-06153-001). In the article, incorrect versions of figures 3 and 6 were included. Also, Table 8 should have included the following information in the table footnote "P(A V) = probability of acquittal given unanimous verdict." All…
Descriptors: Social Influences, Probability, Item Response Theory, Psychological Studies