Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 29 |
Descriptor
Evaluation Methods | 38 |
Item Analysis | 38 |
Validity | 38 |
Foreign Countries | 12 |
Reliability | 12 |
Factor Analysis | 8 |
Measures (Individuals) | 8 |
Psychometrics | 8 |
Test Items | 7 |
Comparative Analysis | 6 |
Questionnaires | 6 |
More ▼ |
Source
Author
Apantee Poonputta | 1 |
Beauchamp, David | 1 |
Beech, Anthony | 1 |
Bhola, Dennison S. | 1 |
Booth, Adam J. | 1 |
Bothe, Anne K. | 1 |
Breakstone, Joel | 1 |
Browne, Kevin D. | 1 |
Buckendahl, Chad W. | 1 |
Burts, Diane C. | 1 |
Cantrell, Pamela | 1 |
More ▼ |
Publication Type
Education Level
Audience
Location
China | 2 |
Netherlands | 2 |
Australia | 1 |
Belgium | 1 |
Canada | 1 |
Germany | 1 |
Indonesia | 1 |
Ireland | 1 |
Israel | 1 |
North Carolina (Charlotte) | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022
Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…
Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Koyuncu, Ilhan; Kilic, Abdullah Faruk – International Journal of Assessment Tools in Education, 2021
In exploratory factor analysis, although the researchers decide which items belong to which factors by considering statistical results, the decisions taken sometimes can be subjective in case of having items with similar factor loadings and complex factor structures. The aim of this study was to examine the validity of classifying items into…
Descriptors: Classification, Graphs, Factor Analysis, Decision Making
Tu, Thuy Thi Minh – ProQuest LLC, 2023
The study aimed to elicit information from Vietnamese EFL university instructors about their knowledge and skills regarding the principles, theory, and practices of language assessment by means of revision and validation of the Language Assessment Literacy--Revised Vietnam (LAL-RV), which was previously developed by Kremmel and Harding (2020). A…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, College Faculty
Clairmont, Albert Anthony; Katz, Daniel; Wilton, Mike – AERA Online Paper Repository, 2021
This study demonstrates the importance of Rasch Measurement Theory (RMT) in program evaluation when outcome measures need to be constructed from scratch. The paper introduces typical measure validation methods presented in program evaluation texts and discusses room for improvement. The study then illustrates how the seamless transitions from…
Descriptors: Program Evaluation, Measurement Techniques, Validity, Ethnography
Ronen Kasperski; Merav E. Hemi – Assessment in Education: Principles, Policy & Practice, 2024
Educators' Social-Emotional Learning (SEL) is crucial for fostering positive, supportive, and effective learning environments. This study seeks to improve SEL assessment among educators by addressing limitations of the previous EduSEL questionnaire. Study 1 established convergent validity by comparing EduSEL with a validated SEL questionnaire.…
Descriptors: Social Emotional Learning, Factor Structure, Factor Analysis, Teacher Attitudes
Smith, Mark; Breakstone, Joel; Wineburg, Sam – Cognition and Instruction, 2019
This article reports a validity study of History Assessments of Thinking (HATs), which are short, constructed-response assessments of historical thinking. In particular, this study focuses on aspects of cognitive validity, which is an examination of whether assessments tap the intended constructs. Think-aloud interviews with 26 high school…
Descriptors: History, History Instruction, Thinking Skills, Multiple Choice Tests
Schat, Esther; van der Knaap, Ewout; de Graaff, Rick – Intercultural Communication Education, 2021
Intercultural competence is a crucial element of foreign language education, yet the multifaceted nature of this construct makes it inherently difficult to assess. Although several tools for evaluating intercultural competence currently exist, research on their use in secondary school settings is scarce. This study reports on the development and…
Descriptors: Intercultural Communication, Communicative Competence (Languages), Second Language Learning, Second Language Instruction
Beauchamp, David; Constantinou, Filio – Research Matters, 2020
Assessment is a useful process as it provides various stakeholders (e.g., teachers, parents, government, employers) with information about students' competence in a particular subject area. However, for the information generated by assessment to be useful, it needs to support valid inferences. One factor that can undermine the validity of…
Descriptors: Computational Linguistics, Inferences, Validity, Language Usage
Lopata, Christopher; Donnelly, James P.; Rodgers, Jonathan D.; Thomeer, Marcus L.; Booth, Adam J. – Autism: The International Journal of Research and Practice, 2020
This study assessed the reliability and criterion-related validity of teacher ratings on the Adapted Skillstreaming Checklist for a sample of 133 children, aged 6-11 years, with autism spectrum disorder (without intellectual disability). Internal consistency for the total sample was 0.93. For a subsample, test-retest reliability was very good (r =…
Descriptors: Check Lists, Validity, Reliability, Teacher Attitudes
Jaikaran-Doe, Seeta; Doe, Peter Edward – Australian Educational Computing, 2015
A number of validated survey instruments for assessing technological pedagogical content knowledge (TPACK) do not accurately discriminate between the seven elements of the TPACK framework particularly technological content knowledge (TCK) and technological pedagogical knowledge (TPK). By posing simple questions that assess technological,…
Descriptors: Technological Literacy, Pedagogical Content Knowledge, Surveys, Evaluation Methods
Hitchcock, John H.; Johanson, George A. – Research in the Schools, 2015
Understanding the reason(s) for Differential Item Functioning (DIF) in the context of measurement is difficult. Although identifying potential DIF items is typically a statistical endeavor, understanding the reasons for DIF (and item repair or replacement) might require investigations that can be informed by qualitative work. Such work is…
Descriptors: Mixed Methods Research, Test Items, Item Analysis, Measurement
Kim, Do-Hong; Lambert, Richard G.; Durham, Sean; Burts, Diane C. – Early Education and Development, 2018
Research Findings: This study builds on prior work related to the assessment of young dual language learners (DLLs). The purposes of the study were to (a) determine whether latent subgroups of preschool DLLs would replicate those found previously and (b) examine the validity of GOLD® by Teaching Strategies with empirically derived subgroups.…
Descriptors: Preschool Education, Teaching Methods, Bilingualism, Bilingual Education
Choi, Jae-Sung; Lee, Minhong – Research on Social Work Practice, 2014
Objective: This study examined the validity and reliability of a person-directed care (PDC) measure for nursing homes in Korea. Method: Managerial personnel from 223 nursing homes in 2010 and 239 in 2012 were surveyed. Results: Item analysis and exploratory factor analysis for the first sample generated a 33-item PDC measure with eight factors.…
Descriptors: Foreign Countries, Psychometrics, Nursing Homes, Item Analysis