Publication Date
In 2025 | 3 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 41 |
Descriptor
Test Construction | 41 |
Test Selection | 18 |
Evaluation Methods | 10 |
Foreign Countries | 10 |
Test Items | 9 |
Test Validity | 9 |
Testing | 9 |
Psychometrics | 8 |
Scores | 8 |
Selection Criteria | 7 |
Computer Assisted Testing | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Teachers | 2 |
Location
Australia | 2 |
Canada | 2 |
Netherlands | 2 |
Arizona | 1 |
Denmark | 1 |
European Union | 1 |
Indonesia | 1 |
Ireland | 1 |
New Zealand | 1 |
Norway | 1 |
Poland | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
National Assessment of… | 2 |
Measures of Academic Progress | 1 |
Program for International… | 1 |
Test of Adult Basic Education | 1 |
Wide Range Achievement Test | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Davis, Mark A.; Philip, Jestine; Walker, Laura – Management Teaching Review, 2022
This article outlines an active learning project that gives students hands-on experience in developing an undergraduate situational judgment test. The five-part activity models the process for constructing a situational judgment test--a tool commonly used for employee selection in organizations. The project is designed to help students assimilate…
Descriptors: Undergraduate Students, Situational Tests, Active Learning, Selection Tools
Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025
Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…
Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Grantee Submission, 2024
The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-In-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this paper, we lay the foundation for DIRTy…
Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction
Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Educational Measurement: Issues and Practice, 2024
The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-in-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this article, we lay the foundation for DIRTy…
Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction
Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020
More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…
Descriptors: Classification, Models, Diagnostic Tests, Test Construction
Fink, Arlene – SAGE Publications Ltd (CA), 2016
Packed with new topics that reflect today's challenges, the Sixth Edition of the bestselling "How to Conduct Surveys" guides readers through the process of developing their own rigorous surveys and evaluating the credibility and transparency of surveys created by others. Offering practical, step-by-step advice and written in the same…
Descriptors: Surveys, Guides, Research Methodology, Test Construction
Claudia Gentile; Eric Brown; Lauren Conte; Bill Drewett; Will Fisher; Abrea Greene; Susan Pachikara – NORC at the University of Chicago, 2022
Educators and researchers highlight the important role that math plays as a consequential gateway to upward mobility in the United States (and globally). The purpose of this study was to develop a survey for teens in middle school and high school, to capture information about their math mindsets, identities, experiences studying math, and…
Descriptors: Mathematics Education, Adolescents, Middle School Students, High School Students
Clement, Laurence; Dorman, Jennie B.; McGee, Richard – CBE - Life Sciences Education, 2020
We describe here the development and validation of the Academic Career Readiness Assessment (ACRA) rubric, an instrument that was designed to provide more equity in mentoring, transparency in hiring, and accountability in training of aspiring faculty in the biomedical life sciences. We report here the results of interviews with faculty at 20 U.S.…
Descriptors: Measures (Individuals), Test Construction, Medical School Faculty, Biomedicine
Larson, Anne L.; An, Zhe Gigi; Wood, Carla; Uchikoshi, Yuuko; Cycyk, Lauren M.; Scheffner Hammer, Carol; Escobar, Kelly; Roberts, Kate – Topics in Early Childhood Special Education, 2020
The social validity of intervention research has been emphasized in special education and related fields for decades. There is relatively little focus on social validity that considers culturally and linguistically diverse populations. Eleven articles met the inclusionary criteria for this systematic review and were evaluated to describe social…
Descriptors: Intervention, Validity, Cultural Differences, Bilingualism
Tangen, Jodi L.; Borders, DiAnne – Counselor Education and Supervision, 2016
To date, a comprehensive review of supervisory relationship measures has yet to be published. In this article, the authors explore conceptualizations of the supervisory relationship, describe and critique 11 measures, provide recommendations for researchers and practitioners when selecting measures, and offer suggestions regarding future measure…
Descriptors: Supervisor Supervisee Relationship, Psychometrics, Concept Formation, Outcome Measures
Wright, Christian D.; Huang, Austin L.; Cooper, Katelyn M.; Brownell, Sara E. – International Journal for the Scholarship of Teaching and Learning, 2018
College instructors in the United States usually make their own decisions about how to design course exams. Even though summative course exams are well known to be important to student success, we know little about the decision making of instructors when designing course exams. To probe how instructors design exams for introductory biology, we…
Descriptors: College Faculty, Science Teachers, Science Tests, Teacher Made Tests
Zulaiha, Siti; Mulyono, Herri – Cogent Education, 2020
The training of teachers is one of the most critical factors in improving the quality of teaching and assessment in the classroom. EFL teachers need to be literate in language assessment; this can be achieved through training. A total of 147 Junior High School EFL teachers was surveyed to identify their training needs in assessmen. Semi-structured…
Descriptors: Junior High School Teachers, Teacher Attitudes, Language Teachers, English (Second Language)
Demir, Yusuf; Ertas, Abdullah – Reading Matrix: An International Online Journal, 2014
Coursebook evaluation helps practitioners decide on the most appropriate coursebook to be exploited. Moreover, evaluation process enables to predict the potential strengths and weaknesses of a given coursebook. Checklist method is probably the most widely adopted way of judging coursebooks and there are plenty of ELT coursebook evaluation…
Descriptors: Check Lists, Course Evaluation, Instructional Material Evaluation, Media Selection