ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	13

Descriptor

Generalizability Theory	14
Item Response Theory	14
Test Reliability	14
Test Theory	6
Psychometrics	5
Test Validity	5
Comparative Analysis	3
Foreign Countries	3
Item Analysis	3
Language Proficiency	3
Language Tests	3
Statistical Analysis	3
Test Items	3
Writing Tests	3
College Entrance Examinations	2
English (Second Language)	2
Error of Measurement	2
Goodness of Fit	2
Grade 8	2
Higher Education	2
Interrater Reliability	2
Measurement Techniques	2
Scores	2
Teacher Evaluation	2
Teaching Methods	2
More ▼

Source

Online Submission	2
Applied Measurement in…	1
Behavioral Research and…	1
Chemistry Education Research…	1
College Board	1
Educational Sciences: Theory…	1
Grantee Submission	1
IEEE Transactions on Learning…	1
International Journal of…	1
Journal on Educational…	1
Practical Assessment,…	1
Routledge, Taylor & Francis…	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	8
Reports - Descriptive	3
Books	1
Collected Works - General	1
Information Analyses	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
More ▼

Education Level

Higher Education	5
Secondary Education	4
Elementary Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Grade 8	2
Grade 6	1
Grade 7	1
Intermediate Grades	1
Two Year Colleges	1
More ▼

Audience

Location

California	1
Colorado	1
Norway	1
South Korea	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Psychometric Properties of MATE: A Study Focused on Testing the Generalizability of the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021

The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…

Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)

Sources of Variance in Special Educator Observation Rubrics

Peer reviewed
PDF on ERIC

Download full text

Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018

This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…

Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

The Effects of Testlets on Reliability and Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Teker, Gulsen Tasdelen; Dogan, Nuri – Educational Sciences: Theory and Practice, 2015

Reliability and differential item functioning (DIF) analyses were conducted on testlets displaying local item dependence in this study. The data set employed in the research was obtained from the answers given by 1,500 students to the 20 items included in six testlets given in English Proficiency Exam by the School of Foreign Languages of a state…

Descriptors: Foreign Countries, Test Items, Test Bias, Item Response Theory

Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

Download full text

Badjadi, Nour El Imane – Online Submission, 2013

The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

Generalizability Theory and Classical Test Theory

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2011

Broadly conceived, reliability involves quantifying the consistencies and inconsistencies in observed scores. Generalizability theory, or G theory, is particularly well suited to addressing such matters in that it enables an investigator to quantify and distinguish the sources of inconsistencies in observed scores that arise, or could arise, over…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Item Response Theory

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Measurement Theory in Language Testing: Past Traditions and Current Trends

Peer reviewed
PDF on ERIC

Download full text

Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Measurement Theory in Language Testing: Past Traditions and Current Trends

Download full text

Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Select Psychometric Properties and Predictive Validity of Scores on the SAT Writing Section

Download full text

Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009

Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…

Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity

Generalizability Theory and Many-Facet Rasch Measurement.

Download full text

Linacre, John M. – 1993

Generalizability theory (G-theory) and many-facet Rasch measurement (Rasch) manage the variability inherent when raters rate examinees on test items. The purpose of G-theory is to estimate test reliability in a raw score metric. Unadjusted examinee raw scores are reported as measures. A variance component is estimated for the examinee…

Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Evaluators

Handbook on Measurement, Assessment, and Evaluation in Higher Education

Direct link

Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011

Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…

Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness

Salmani-Nodoushan, Mohammad…	2
Alonzo, Julie	1
Anderson, Daniel	1
Badjadi, Nour El Imane	1
Barbera, Jack	1
Brennan, Robert L.	1
Crawford, Angela R.	1
Denison, D. Brian, Ed.	1
Dogan, Nuri	1
Ha, Minsu	1
Huebner, Alan	1
Johnson, Evelyn S.	1
Kim, YoungKoung Rachel	1
Linacre, John M.	1
Moylan, Laura A.	1
Proctor, Thomas P.	1
Rachmatullah, Arif	1
Secolsky, Charles, Ed.	1
Skar, Gustaf B.	1
Sya'bandari, Yustika	1
Teker, Gulsen Tasdelen	1
Tindal, Gerald	1
Ueno, Maomi	1
Uto, Masaki	1
Wren, David	1
More ▼