Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 26 |
Descriptor
Foreign Countries | 32 |
Test Reliability | 32 |
Test Theory | 32 |
Test Validity | 16 |
Item Response Theory | 14 |
Test Construction | 10 |
Psychometrics | 9 |
Test Items | 9 |
Scores | 7 |
Error of Measurement | 5 |
Evaluation Methods | 5 |
More ▼ |
Source
Author
He, Qingping | 2 |
Abdullah Faruk Kilic | 1 |
Abela, John | 1 |
Acevedo, Daniela | 1 |
Aktas, Mehtap | 1 |
Alessandri, Guido | 1 |
Anita Padmanabhanunni | 1 |
Arias, Benito | 1 |
Asiret, Semih | 1 |
Baird, Jo-Anne | 1 |
Bernholt, S. | 1 |
More ▼ |
Publication Type
Journal Articles | 29 |
Reports - Research | 26 |
Reports - Evaluative | 5 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Practitioners | 1 |
Teachers | 1 |
Location
United Kingdom (England) | 5 |
Canada | 4 |
Spain | 3 |
Singapore | 2 |
Turkey | 2 |
Turkey (Ankara) | 2 |
United States | 2 |
Australia | 1 |
Chile | 1 |
Egypt | 1 |
Finland (Helsinki) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Center for Epidemiologic… | 1 |
Rosenberg Self Esteem Scale | 1 |
What Works Clearinghouse Rating
Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024
The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…
Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Tyrone B. Pretorius; P. Paul Heppner; Anita Padmanabhanunni; Serena Ann Isaacs – SAGE Open, 2023
In previous studies, problem solving appraisal has been identified as playing a key role in promoting positive psychological well-being. The Problem Solving Inventory is the most widely used measure of problem solving appraisal and consists of 32 items. The length of the instrument, however, may limit its applicability to large-scale surveys…
Descriptors: Problem Solving, Measures (Individuals), Test Construction, Item Response Theory
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Yasar, Metin – International Journal of Assessment Tools in Education, 2019
The main purpose of this study is to develop a perceived stress scale based on Classical Test Theory (CTT) and Graded Response Model (GRM); to compare the parameters of the items in the scale that are tried to be developed according to both models, and to determine under which theory the measurement tool produces more reliable and valid results…
Descriptors: Affective Measures, Anxiety, Test Theory, Test Construction
Chin, Huan; Chew, Cheng Meng; Lim, Hooi Lian; Thien, Lei Mee – International Journal of Science and Mathematics Education, 2022
Cognitive Diagnostic Assessment (CDA) is an alternative assessment which can give a clear picture of pupils' learning process and cognitive structures to education stakeholders so that appropriate instructional strategies can be designed to tailored pupils' needs. Coincide with this function, the Ordered Multiple-Choice (OMC) items were…
Descriptors: Mathematics Instruction, Mathematics Tests, Multiple Choice Tests, Diagnostic Tests
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Ikah, December S. K.; Finn, Gabrielle M.; Swamy, Meenakshi; White, Pamela M.; McLachlan, John C. – Anatomical Sciences Education, 2015
Although medical curricula now adopt an integrated teaching approach, this is not adequately reflected in assessment of anatomy knowledge and skills. In this study, we aimed to explore the impact of the addition of clinical vignette to item stems on students' performance in anatomy practical examinations. In this study, 129 undergraduate medical…
Descriptors: Vignettes, Anatomy, Medical Education, Medical Students
Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015
This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)
Meneses, Alejandra; Uccelli, Paola; Santelices, María Verónica; Ruiz, Marcela; Acevedo, Daniela; Figueroa, Javiera – Reading Research Quarterly, 2018
Although literacy achievement has improved in Chile, adolescents' underperformance in reading comprehension is still a serious concern. In English, core academic-language skills (CALS) have been found to significantly predict reading comprehension, even controlling for academic vocabulary knowledge. CALS are high-utility language skills that…
Descriptors: Reading Achievement, Foreign Countries, Academic Discourse, Reading Comprehension
Taskin, V.; Bernholt, S.; Parchmann, I. – Chemistry Education Research and Practice, 2015
Chemical representations play an important role in helping learners to understand chemical contents. Thus, dealing with chemical representations is a necessity for learning chemistry, but at the same time, it presents a great challenge to learners. Due to this great challenge, it is not surprising that numerous national and international studies…
Descriptors: Student Teachers, Knowledge Level, Science Instruction, Chemistry
Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013
This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…
Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability
Grigg, Kaine; Manderson, Lenore – Australian Educational and Developmental Psychologist, 2015
Existing Australian measures of racist attitudes focus on single groups or have not been validated across the lifespan. To redress this, the present research aimed to develop and validate a measure of racial, ethnic, cultural and religious acceptance--the Australian Racism, Acceptance, and Cultural-Ethnocentrism Scale (RACES)--for use with…
Descriptors: Racial Bias, Racial Attitudes, Foreign Countries, Ethnocentrism