Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 31 |
Descriptor
Evaluation Methods | 75 |
Statistical Analysis | 75 |
Test Construction | 75 |
Test Validity | 21 |
Foreign Countries | 19 |
Test Reliability | 17 |
Student Evaluation | 16 |
Test Items | 15 |
Measurement Techniques | 13 |
Models | 13 |
Evaluation Criteria | 9 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 3 |
Rogers, H. Jane | 2 |
Abayeva, Nella F. | 1 |
Ahmad, Farhan | 1 |
AlFallay, Ibrahim S. | 1 |
Ames, Russell | 1 |
Andrew, Barbara J. | 1 |
Barnett, Jerrold E. | 1 |
Bishop, M. J. | 1 |
Bissember, Alex C. | 1 |
Braun, Henry | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 3 |
Practitioners | 2 |
Students | 2 |
Teachers | 2 |
Location
California | 3 |
European Union | 2 |
Turkey | 2 |
United Kingdom | 2 |
Australia | 1 |
Canada | 1 |
Finland | 1 |
Indonesia | 1 |
Israel | 1 |
Italy | 1 |
Pennsylvania | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Widén, Gunilla; Ahmad, Farhan; Nikou, Shahrokh; Ryan, Bruce; Cruickshank, Peter – Journal of Information Literacy, 2021
This paper focuses information literacy (IL) from a methodological perspective, addressing quantitative IL measures, suitable for evaluating the role of IL in supporting work activities. So far, IL in workplace contexts has mostly been studied using qualitative methods, designed for studying situational and context-dependent practices. Therefore…
Descriptors: Information Literacy, Workplace Learning, Test Construction, Evaluation Methods
Pullen, Reyne; Thickett, Stuart C.; Bissember, Alex C. – Chemistry Education Research and Practice, 2018
In chemistry curricula, both the role of the laboratory program and the method of assessment used are subject to scrutiny and debate. The ability to identify clearly defined competencies for the chemistry laboratory program is crucial, given the numerous other disciplines that rely on foundation-level chemistry knowledge and practical skills. In…
Descriptors: Undergraduate Study, College Science, Chemistry, Science Laboratories
Varela, Otmar; Mead, Esther – Journal of Education for Business, 2018
Popular teamwork assessments have been strongly criticized on the grounds of poor psychometric properties and their disconnect with conceptual models of teamwork. These issues raise concerns with respect to our ability to evaluate efforts devoted to advancing teamwork in academia. We report the development of a teamwork assessment that builds on…
Descriptors: Teamwork, Evaluation Methods, Test Validity, Psychometrics
Ward, Phillip; Dervent, Fatih; Lee, Yun Soo; Ko, Bomna; Kim, Insook; Tao, Wang – Journal of Teaching in Physical Education, 2017
Purpose: This study reports on our efforts toward extending the conceptual understanding of content development in physical education by validating content maps as a measurement tool, examining new categories of instructional tasks to describe content development and validating formulae that can be used to evaluate depth of content development.…
Descriptors: Physical Education, Program Validation, Curriculum Development, Course Evaluation
Cecile C. Dietrich; Eric J. Lichtenberger – Sage Research Methods Cases, 2016
We present a case study of the process through which a methodology was developed and applied to a quasi-experimental research study that employed propensity score matching. Methodological decisions are discussed and summarized, including an explanation of the approaches selected for each step in the study as well as rationales for these…
Descriptors: Test Construction, Quasiexperimental Design, Community Colleges, Fees
Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018
A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…
Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests
Undersander, Molly A.; Lund, Travis J.; Langdon, Laurie S.; Stains, Marilyne – Chemistry Education Research and Practice, 2017
The design of assessment tools is critical to accurately evaluate students' understanding of chemistry. Although extensive research has been conducted on various aspects of assessment tool design, few studies in chemistry have focused on the impact of the order in which questions are presented to students on the measurement of students'…
Descriptors: Test Construction, Scientific Concepts, Concept Formation, Science Education
Wesolowski, Brian C. – International Journal of Music Education, 2017
The purpose of this study was to develop a valid and reliable rating scale to assess jazz rhythm sections in the context of jazz big band performance. The research questions that guided this study included: (a) what central factors contribute to the assessment of a jazz rhythm section? (b) what items should be used to describe and assess a jazz…
Descriptors: Test Construction, Rating Scales, Music, Evaluation Methods
Fuller, Matthew B.; Skidmore, Susan T.; Bustamante, Rebecca M.; Holzweiss, Peggy C. – Review of Higher Education, 2016
Although touted as beneficial to student learning, cultures of assessment have not been examined adequately using validated instruments. Using data collected from a stratified, random sample (N = 370) of U.S. institutional research and assessment directors, the models tested in this study provide empirical support for the value of using the…
Descriptors: Higher Education, Administrators, Evaluation Methods, Attitude Measures
Kalkan, Ömür Kaya; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
Linear factor analysis models used to examine constructs underlying the responses are not very suitable for dichotomous or polytomous response formats. The associated problems cannot be eliminated by polychoric or tetrachoric correlations in place of the Pearson correlation. Therefore, we considered parameters obtained from the NOHARM and FACTOR…
Descriptors: Sample Size, Nonparametric Statistics, Factor Analysis, Correlation
Golovachyova, Viktoriya N.; Menlibekova, Gulbakhyt Zh.; Abayeva, Nella F.; Ten, Tatyana L.; Kogaya, Galina D. – International Journal of Environmental and Science Education, 2016
Using computer-based monitoring systems that rely on tests could be the most effective way of knowledge evaluation. The problem of objective knowledge assessment by means of testing takes on a new dimension in the context of new paradigms in education. The analysis of the existing test methods enabled us to conclude that tests with selected…
Descriptors: Expertise, Computer Assisted Testing, Student Evaluation, Knowledge Level
Singh-Ackbarali, Dimple; Maharaj, Rohanie – Journal of Curriculum and Teaching, 2014
This paper discusses the comprehensive and practical training that was delivered to students in a university classroom on how sensory evaluation can be used to determine acceptability of food products. The report presents how students used their training on sensory evaluation methods and analysis and applied it to improving and predicting…
Descriptors: Foreign Countries, Sensory Experience, Evaluation Methods, Test Construction
Tidén, Anna; Lundqvist, Carolina; Nyberg, Marie – Measurement in Physical Education and Exercise Science, 2015
This study presents the development process and initial validation of the NyTid test, a process-oriented movement assessment tool for compulsory school pupils. A sample of 1,260 (627 girls and 633 boys; mean age of 14.39) Swedish school children participated in the study. In the first step, exploratory factor analyses (EFAs) were performed in…
Descriptors: Test Construction, Test Validity, Psychomotor Skills, Student Evaluation
Lin, Pei-Ying; Lin, Yu-Cheng – Educational and Psychological Measurement, 2014
This exploratory study investigated potential sources of setting accommodation resulting in differential item functioning (DIF) on math and reading assessments for examinees with varied learning characteristics. The examinees were those who participated in large-scale assessments and were tested in either standardized or accommodated testing…
Descriptors: Test Bias, Multivariate Analysis, Testing Accommodations, Mathematics Tests
AlFallay, Ibrahim S. – International Journal of Instruction, 2018
This study investigates to what extend do teachers of English as a school subject (ESS) in Saudi schools follow recommendations and guidelines suggested by language testing specialists in developing tables of specifications and preparing blueprints to their formative and summative language tests. To answer the study questions, a thirteen-statement…
Descriptors: Foreign Countries, English Teachers, Second Language Instruction, English (Second Language)