Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Schoen, Robert C.; Yang, Xiaotong; Tazaz, Amanda M.; Bray, Wendy S.; Farina, Kristy – Grantee Submission, 2019
The "2016 Knowledge for Teaching Early Elementary Mathematics" (2016 K-TEEM) test measures teachers' mathematical knowledge for teaching early elementary mathematics. The 2016 K-TEEM is the third version of the K-TEEM (Schoen, Bray, Wolfe, Tazaz, & Nielsen, 2017). In this report, we present results of the first large-scale field test…
Descriptors: Test Construction, Elementary School Mathematics, Elementary School Teachers, Knowledge Base for Teaching
Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2016
This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…
Descriptors: Scoring, Equated Scores, Test Items, Measurement
Ibbett, Nicole L.; Wheldon, Brett J. – e-Journal of Business Education and Scholarship of Teaching, 2016
In 2014 Central Queensland University (CQU) in Australia banned the use of multiple choice questions (MCQs) as an assessment tool. One of the reasons given for this decision was that MCQs provide an opportunity for students to "pass" by merely guessing their answers. The mathematical likelihood of a student passing by guessing alone can…
Descriptors: Foreign Countries, Multiple Choice Tests, Item Banks, Guessing (Tests)
Ilhan, Mustafa – Educational Sciences: Theory and Practice, 2016
The aim of this study was to compare the results of many-facet Rasch analyses based on crossed and judge pair designs. The study was conducted with 168 eighth grade students and five judges. The study data were collected using an achievement test with open-ended questions and a holistic rubric that was used to rate the responses. In the data…
Descriptors: Item Response Theory, Achievement Tests, Scoring Rubrics, Judges
Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016
The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…
Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis
Zhang, Xinxin; Gierl, Mark – Journal of Educational Issues, 2016
The purpose of this study is to describe a methodology to recover the item model used to generate multiple-choice test items with a novel graph theory approach. Beginning with the generated test items and working backward to recover the original item model provides a model-based method for validating the content used to automatically generate test…
Descriptors: Test Items, Automation, Content Validity, Test Validity
Nichols, Bryan E. – Update: Applications of Research in Music Education, 2016
The purpose of this review of literature was to identify research findings for designing assessments in singing accuracy. The aim was to specify the test construction variables that directly affect test performance to guide future design in singing accuracy assessment for research and classroom uses. Three pitch-matching tasks--single pitch,…
Descriptors: Singing, Accuracy, Music, Music Education
Russell, Michael – Journal of Applied Testing Technology, 2016
Interest in and use of technology-enhanced items has increased over the past decade. Given the additional time required to administer many technology-enhanced items and the increased expense required to develop them, it is important for testing programs to consider the utility of technology-enhanced items. The Technology-Enhanced Item Utility…
Descriptors: Test Items, Computer Assisted Testing, Models, Fidelity
Tassé, Marc J.; Schalock, Robert L.; Thissen, David; Balboni, Giulia; Bersani, Henry, Jr.; Borthwick-Duffy, Sharon A.; Spreat, Scott; Widaman, Keith F.; Zhang, Dalun; Navas, Patricia – American Journal on Intellectual and Developmental Disabilities, 2016
The Diagnostic Adaptive Behavior Scale (DABS) was developed using item response theory (IRT) methods and was constructed to provide the most precise and valid adaptive behavior information at or near the cutoff point of making a decision regarding a diagnosis of intellectual disability. The DABS initial item pool consisted of 260 items. Using IRT…
Descriptors: Item Response Theory, Test Items, Test Construction, Intellectual Disability
Harlacher, Jason – Regional Educational Laboratory Central, 2016
Educators have many decisions to make and it's important that they have the right data to inform those decisions and access to questionnaires that can gather that data. This guide, developed by REL Central and based on work done through separate projects with the Wyoming Office of Public Instruction and the Nebraska Department of Education,…
Descriptors: Questionnaires, Test Construction, Student Surveys, Teacher Surveys
Mao, Xiuzhen; Ozdemir, Burhanettin; Wang, Yating; Xiu, Tao – Online Submission, 2016
Four item selection indexes with and without exposure control are evaluated and compared in multidimensional computerized adaptive testing (CAT). The four item selection indices are D-optimality, Posterior expectation Kullback-Leibler information (KLP), the minimized error variance of the linear combination score with equal weight (V1), and the…
Descriptors: Comparative Analysis, Adaptive Testing, Computer Assisted Testing, Test Items
Wahyuni, Tutik; Suwandi, Sarwiji; Slamet, St. Y.; Andayani – International Journal of Instruction, 2018
The objective of this present research is to develop an Indonesian Syntax textbook. At the exploratory stage, a descriptive-qualitative approach was adopted. The data were collected using a documentary study, observations, and questionnaires and analyzed through a contextual model. The model was experimentally tested. At this stage, some main…
Descriptors: Foreign Countries, Syntax, Pretests Posttests, Textbooks
Venticinque, Danilo; Whitworth, Andrew – Journal of Media Literacy Education, 2018
This article discusses the outcomes of research into the media literacy aspects of ENEM ("Exame Nacional do Ensino Médio"), Brazil's unified university entrance exam, which contains a significant number of exam questions based on excerpts from newspaper articles, online news and other media sources. Through content analysis, these…
Descriptors: Foreign Countries, College Entrance Examinations, Media Literacy, Test Content
Balboni, Giulia; Perrucci, Vittore; Cacciamani, Stefano; Zumbo, Bruno D. – Distance Education, 2018
Creating a sense of community in online classes contributes to student retention and to their overall satisfaction with the course itself. This study aimed to develop a scale of sense of community of students attending online university courses. A series of ordinal exploratory factor analyses were conducted on data obtained from 839 students…
Descriptors: Likert Scales, Test Construction, Sense of Community, Online Courses
Chu, Hye-Eun; Chandrasegaran, A. L.; Treagust, David F. – School Science Review, 2018
The purpose of this research was to investigate an efficient method to assess year 8 (age 13-14) students' conceptual understanding of heat and temperature concepts. Two different types of instruments were used in this study: Type 1, consisting of multiple-choice items with open-ended justifications; and Type 2, consisting of two-tier…
Descriptors: Comparative Analysis, Item Analysis, Test Items, Science Tests

Direct link
Peer reviewed
