Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 40 |
Descriptor
Evaluation Methods | 75 |
Item Analysis | 75 |
Test Validity | 75 |
Test Reliability | 39 |
Test Construction | 31 |
Test Items | 16 |
Psychometrics | 15 |
Statistical Analysis | 15 |
Factor Analysis | 14 |
Measurement Techniques | 12 |
Student Evaluation | 11 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 2 |
Klein, Stephen P. | 2 |
Akarsu, Bayram | 1 |
Alghazali, Tawfeeq | 1 |
Anderson, Ronald E. | 1 |
Baker, Eva L. | 1 |
Baril, G. L. | 1 |
Barniol, Pablo | 1 |
Bart, William M. | 1 |
Bursac, Zoran | 1 |
Burstein, Leigh | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Teachers | 1 |
Location
Greece | 2 |
Singapore | 2 |
Arkansas | 1 |
Australia | 1 |
California | 1 |
China (Beijing) | 1 |
Mexico | 1 |
Michigan | 1 |
Pennsylvania | 1 |
Texas | 1 |
Turkey | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yu-Sheng Su; Xiao Wang; Li Zhao – IEEE Transactions on Education, 2024
Research Purpose and Contribution: The study aimed to construct an evaluation framework for assessing pupils' computational thinking (CT) during classroom learning problem solving. As a self-report evaluation scale for pupils, this evaluation framework further enriched the CT assessment instruments for pupils and provided a specialized instrument…
Descriptors: Computation, Thinking Skills, Student Evaluation, Evaluation Methods
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Thapelo Ncube Whitfield – ProQuest LLC, 2021
Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…
Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Jorgensen, Maribeth F.; Schweinle, William E. – Professional Counselor, 2018
The 68-item Research Identity Scale (RIS) was informed through qualitative exploration of research identity development in master's-level counseling students and practitioners. Classical psychometric analyses revealed the items had strong validity and reliability and a single factor. A one-parameter Rasch analysis and item review was used to…
Descriptors: Qualitative Research, Counseling Services, Counselor Training, Psychometrics
Feranchak, Bret; Deiger, Megan – AERA Online Paper Repository, 2017
Increasingly content area projects and programs at the K-12 level, such as in mathematics, involve a programmatic component or project emphasis on developing "teacher leadership". However, there is no consistent definition or framework for this construct and even fewer validated tools for measuring it. This paper describes our efforts in…
Descriptors: Teacher Leadership, Mathematics Instruction, Guidelines, Elementary Secondary Education
Hartley, James – Psychology Teaching Review, 2017
In this article, Hartley notes the difficulties of using questionnaires to assess the efficiency of new instructional methods and highlights nine issues that researchers must consider. Hartley continues the discussion about the use of questionnaires and suggests that psychology teachers can help improve the teaching of psychology by drawing…
Descriptors: Questionnaires, Instructional Innovation, Instructional Effectiveness, Teaching Methods
Rigney, Alexander M. – Journal of Psychoeducational Assessment, 2019
This report reviews the "Social Skills Improvement System Social-Emotional Learning Edition" (SSIS SEL; Gresham & Elliott, 2017), a multicomponent rating scale that includes a criterion and norm-referenced measure of social-emotional and academic functioning--based on a reformulation of the "Social Skills Improvement…
Descriptors: Rating Scales, Interpersonal Competence, Social Development, Emotional Development
Cable, John – Mathematics Education Research Journal, 2015
This article provides a critical evaluation of a technique of analysis, the "Social Activity Method," recently offered by Dowling (2013) as a "gift" to mathematics education. The method is found to be inadequate, firstly, because it employs a dichotomy (between "expression" and "content") instead of a finer…
Descriptors: Mathematics, Mathematics Education, Evaluation Methods, Criticism
Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017
We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…
Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods
Hampton, David D.; Lembke, Erica S. – Reading & Writing Quarterly, 2016
The purpose of this study was to examine 4 early writing measures used to monitor the early writing progress of 1st-grade students. We administered the measures to 23 1st-grade students biweekly for a total of 16 weeks. We obtained 3-min samples and conducted analyses for each 1-min increment. We scored samples using 2 different methods: correct…
Descriptors: Progress Monitoring, Curriculum Based Assessment, Writing Tests, Outcome Measures
Lee, Jeong-Sook; Kim, Sung-Wan – Journal of Educational Computing Research, 2015
The purpose of this study is to develop and validate an evaluation tool of educational apps for smart education. Based on literature reviews, a potential model for evaluating educational apps was suggested. An evaluation tool consisting of 57 survey items was delivered to 156 students in middle and high schools. An exploratory factor analysis was…
Descriptors: Educational Technology, Courseware, Computer Software Evaluation, Test Construction
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Witzig, Stephen B.; Rebello, Carina M.; Siegel, Marcelle A.; Freyermuth, Sharyn K.; Izci, Kemal; McClure, Bruce – Research in Science Education, 2014
Identifying students' conceptual scientific understanding is difficult if the appropriate tools are not available for educators. Concept inventories have become a popular tool to assess student understanding; however, traditionally, they are multiple choice tests. International science education standard documents advocate that assessments…
Descriptors: Test Construction, Scientific Concepts, Concept Formation, Knowledge Level
Preliminary Psychometric Properties of the CFTIndex in Greece: The Perspective of Physical Education
Konstantinidou, Elisavet; Zisi, Vasiliki; Michalopoulou, Maria – Early Child Development and Care, 2015
The promotion of creativity has been proved a necessity in the education process. The research on creativity in physical education though is very limited worldwide and there is a lack of multilingual evaluation instruments. In the current study, the Creativity Fostering Teacher Index (CFTIndex) was translated and culturally adapted in Greek and…
Descriptors: Psychometrics, Physical Education, Physical Education Teachers, Creativity