ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	16

Descriptor

Test Reliability	16
Test Theory	16
Test Validity	9
Foreign Countries	7
Test Items	6
Item Response Theory	5
Test Construction	5
Scoring	4
Undergraduate Students	4
Difficulty Level	3
Error of Measurement	3
Measurement Techniques	3
Physics	3
Psychometrics	3
Scientific Concepts	3
Computer Assisted Testing	2
Evaluation Methods	2
Generalizability Theory	2
High School Students	2
Inferences	2
Interrater Reliability	2
Item Analysis	2
Measurement	2
Measures (Individuals)	2
Test Bias	2
More ▼

Source

SAGE Open	2
Applied Measurement in…	1
Asia Pacific Journal of…	1
Educational and Psychological…	1
European Journal of Science…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Psychoeducational…	1
Journal of Turkish Science…	1
Language Teaching Research…	1
Physical Review Physics…	1
Practical Assessment,…	1
Society for Research on…	1
Turkish Online Journal of…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	14
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	5
Postsecondary Education	5
Secondary Education	3
Elementary Education	2
High Schools	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
Grade 12	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Location

France	1
Malaysia	1
New York	1
New York (New York)	1
Norway	1
Singapore	1
South Africa	1
Turkey	1
Uganda	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Electronic Assessment Anxiety Scale: Development, Validity and Reliability

Peer reviewed
PDF on ERIC

Download full text

Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024

The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…

Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Evidence for Validity and Reliability of a Research-Based Assessment Instrument on Measurement Uncertainty

Peer reviewed

Direct link

Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024

The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…

Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

The PSI-20: Development of a Viable Short Form Alternative of the Problem Solving Inventory Using Item Response Theory

Peer reviewed

Direct link

Tyrone B. Pretorius; P. Paul Heppner; Anita Padmanabhanunni; Serena Ann Isaacs – SAGE Open, 2023

In previous studies, problem solving appraisal has been identified as playing a key role in promoting positive psychological well-being. The Problem Solving Inventory is the most widely used measure of problem solving appraisal and consists of 32 items. The length of the instrument, however, may limit its applicability to large-scale surveys…

Descriptors: Problem Solving, Measures (Individuals), Test Construction, Item Response Theory

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Development of a Circular Motion Concept Question Item Inventory for Use in Ugandan Science Education

Peer reviewed
PDF on ERIC

Download full text

Kirya, Kent Robert; Mashood, Kalarattu Kandiyi; Yadav, Lakhan Lal – Journal of Turkish Science Education, 2022

In this study, we administered and evaluated circular motion concept question items with a view to developing an inventory suitable for the Ugandan context. Before administering the circular concept items, six physics experts and ten undergraduate physics students carried out the face and content validation. One hundred eighteen undergraduate…

Descriptors: Motion, Scientific Concepts, Test Construction, Test Items

Day Scholars Food Insecurity Experience Scale-Survey Module (DSFIES-SM): Psychometric Analysis

Peer reviewed

Direct link

Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023

Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…

Descriptors: Hunger, Food, Program Effectiveness, Psychometrics

Development and Validation of a Cognitive Diagnostic Assessment with Ordered Multiple-Choice Items for Addition of Time

Peer reviewed

Direct link

Chin, Huan; Chew, Cheng Meng; Lim, Hooi Lian; Thien, Lei Mee – International Journal of Science and Mathematics Education, 2022

Cognitive Diagnostic Assessment (CDA) is an alternative assessment which can give a clear picture of pupils' learning process and cognitive structures to education stakeholders so that appropriate instructional strategies can be designed to tailored pupils' needs. Coincide with this function, the Ordered Multiple-Choice (OMC) items were…

Descriptors: Mathematics Instruction, Mathematics Tests, Multiple Choice Tests, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2

Abdullah Faruk Kilic	1
Aksu, Gökhan	1
Anita Padmanabhanunni	1
Braithwaite, Nicholas St. J.	1
Chakrabartty, Satyendra Nath	1
Chew, Cheng Meng	1
Chin, Huan	1
Ellis, Jules L.	1
Eser, Mehmet Taha	1
Gayle Geschwind	1
H.? J. Lewandowski	1
Hau, Kit-Tai	1
Hedgeland, Holly	1
Huebner, Alan	1
Hugo Melgar-Quinonez	1
Ibrahim Kasujja	1
Jordan, Sally E.	1
Joweria Nambooze	1
Kim, Peter	1
Kirya, Kent Robert	1
Laurent Lima	1
Lim, Hooi Lian	1
Marcos D. Caballero	1
Mashood, Kalarattu Kandiyi	1
Michael Vignal	1
More ▼