Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022
In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…
Descriptors: Scores, Item Response Theory, Test Items, Test Format
Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022
Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…
Descriptors: Accuracy, Scoring, Statistical Analysis, Models
Rios, Joseph – Applied Measurement in Education, 2022
To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…
Descriptors: Accuracy, Guessing (Tests), Scoring, Classification
Pan, Yiqin; Wollack, James A. – Educational Measurement: Issues and Practice, 2023
Pan and Wollack (PW) proposed a machine learning method to detect compromised items. We extend the work of PW to an approach detecting compromised items and examinees with item preknowledge simultaneously and draw on ideas in ensemble learning to relax several limitations in the work of PW. The suggested approach also provides a confidence score,…
Descriptors: Artificial Intelligence, Prior Learning, Item Analysis, Test Content
Pierce, Corey D.; Epstein, Michael H.; Wood, Matthew D. – Journal of Emotional and Behavioral Disorders, 2023
Strength-based assessment has achieved acceptance from educational, mental health, and social service professionals as a means to measuring emotional and behavioral strengths of children. Several standardized, norm-referenced tests have been developed to assess these strengths; however, the primary mode of assessment is via informal interviews of…
Descriptors: Behavior Rating Scales, Content Validity, Psychometrics, Mental Health
Ko, Yeonjoo; Shim, Sungok Serena; Lee, Hyunju – International Journal of Science and Mathematics Education, 2023
The discussion of social responsibility has expanded from ethical conduct of research to include a broader range of topics including participating in the policy-making process, engaging in public discourse related to science, and promoting projects serving societal common goods. Although such an expanded framework on social responsibility is…
Descriptors: Test Construction, Test Validity, Attitude Measures, Social Responsibility
He, Yinhong – Journal of Educational Measurement, 2023
Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…
Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods
van der Linden, Wim J.; Belov, Dmitry I. – Journal of Educational Measurement, 2023
A test of item compromise is presented which combines the test takers' responses and response times (RTs) into a statistic defined as the number of correct responses on the item for test takers with RTs flagged as suspicious. The test has null and alternative distributions belonging to the well-known family of compound binomial distributions, is…
Descriptors: Item Response Theory, Reaction Time, Test Items, Item Analysis
Serhan Sarioglu; Bulut Demir; Ümmühan Ormanci; Salih Çepni – Journal of Teacher Education and Educators, 2023
This study aims to obtain and compare the opinions of exam question writers and teachers on skill-based exam questions. 24 science teachers and 11 context-based exam question writers participated in the study, which was carried out according to the convergent parallel design. Opinions of both parties on the skill-based exam questions were…
Descriptors: Tests, Authors, Science Tests, Foreign Countries
Abdolvahab Khademi; Craig S. Wells; Maria Elena Oliveri; Ester Villalonga-Olives – SAGE Open, 2023
The most common effect size when using a multiple-group confirmatory factor analysis approach to measurement invariance is [delta]CFI and [delta]TLI with a cutoff value of 0.01. However, this recommended cutoff value may not be ubiquitously appropriate and may be of limited application for some tests (e.g., measures using dichotomous items or…
Descriptors: Factor Analysis, Factor Structure, Error of Measurement, Test Items
Lewis, Samala B. – ProQuest LLC, 2023
This dissertation measures the multicultural teaching competency (MTC) and the frequency at which science educators use culturally relevant educational practices (CREPs). This study's mixed-method, convergent design is grounded in critical theory, and the MTC and culturally relevant education (CRE) framework. Findings suggest that the CREPs-F…
Descriptors: Cultural Pluralism, Culturally Relevant Education, Teacher Competencies, Science Teachers
Jacquelyn Thompson – ProQuest LLC, 2023
Quality and accountability remain meaningful discussions for university-based traditional educator preparation programs (EPPs) that prepare most new teachers. A fundamental premise of this study is to create rigorous and transparent measures that are intentionally designed and aligned with pre-service teacher (PST) readiness standards, which is…
Descriptors: Preservice Teachers, Career Readiness, Standards, Teacher Effectiveness
Parsons, Seth A.; Ives, Samantha T.; Fields, R. Stacy; Barksdale, Bonnie; Marine, Jonathan; Rogers, Paul – Reading Teacher, 2023
Students who are engaged writers are likely to produce better writing and to enjoy writing more than students who are disengaged writers. Yet, we are unaware of any existing tool that validly and reliably measures writing engagement. In this article, we describe what writing engagement is and why it is important. Then, we present the Writing…
Descriptors: Learner Engagement, Writing (Composition), Writing Attitudes, Measures (Individuals)
Gonzales, Fredrick – ProQuest LLC, 2023
This study examines the relationship between two secondary End of Course (EOC) exams, the Biology EOC and the English I EOC exams, and their impact on Emergent Bilingual (EB) students in a small district in South Texas. This study is a mixed methods study which uses both quantitative and qualitative data to answer three research questions: (a)…
Descriptors: Secondary School Students, Bilingual Students, Biology, Exit Examinations
Kearney, Grainne P.; Corman, Michael K.; Johnston, Jennifer L.; Hart, Nigel D.; Gormley, Gerard J. – Advances in Health Sciences Education, 2023
New public management ideals and standards have become increasingly adhered to in health professions education; this is particularly apparent in high-stakes assessment, as a gateway to practice. Using an Institutional Ethnographic approach, we looked at the work involved in running high-stakes Objective Structured Clinical Exams (OSCEs) throughout…
Descriptors: High Stakes Tests, Allied Health Occupations Education, Medical Education, Ethnography

Peer reviewed
Direct link
