ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	13

Descriptor

Error of Measurement	13
Test Reliability	13
High School Students	7
Scores	7
Psychometrics	6
Test Validity	6
Foreign Countries	4
Factor Analysis	3
Factor Structure	3
Grade 10	3
Grade 9	3
Grade Point Average	3
Measurement Techniques	3
Questionnaires	3
Secondary School Science	3
Academic Achievement	2
Academically Gifted	2
Adaptive Testing	2
Advanced Placement Programs	2
Attitude Measures	2
College Entrance Examinations	2
Computer Assisted Testing	2
Grade 11	2
Grade 12	2
High Schools	2
More ▼

Source

International Journal of…	2
ACT Education Corp.	1
Applied Measurement in…	1
ETS Research Report Series	1
EURASIA Journal of…	1
Evaluation & Research in…	1
GED Testing Service	1
Gifted Child Quarterly	1
Grantee Submission	1
Journal of Chemical Education	1
Measurement:…	1
Remedial and Special Education	1
More ▼

Publication Type

Reports - Research	11
Journal Articles	10
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

High Schools	13
Secondary Education	11
Junior High Schools	5
Middle Schools	5
Grade 10	3
Grade 9	3
Elementary Education	2
Grade 11	2
Grade 12	2
Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 5	1
Grade 8	1
Intermediate Grades	1
More ▼

Audience

Location

Indonesia	3
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
General Educational…	1
National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Examination of the Factor Structure and Measurement Invariance of the SRSS-IE

Peer reviewed

Direct link

Kathleen Lynne Lane; Wendy Peia Oakes; Mark Matthew Buckman; Nathan Allen Lane; Katie Scarlett Lane; Kandace Fleming; Rebecca E. Swinburne Romine; Rebecca L. Sherod; Chi-Ning Chang; Jamie Jones; Emily Dawn Cantwell; Meredith Crittenden – Remedial and Special Education, 2024

Given the need for a swift, systematic way to identify students with internalizing and externalizing behavior patterns to connect these students with appropriate supports, we present new findings of the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE). In this article, we examined (a) factor structure of the SRSS-IE and (b)…

Descriptors: Screening Tests, At Risk Students, Psychometrics, Factor Structure

The Quality of Test Instruments Constructed by Teachers in Bima Regency, Indonesia: Document Analysis

Peer reviewed
PDF on ERIC

Download full text

Ramadhan, Syahrul; Sumiharsono, Rudy; Mardapi, Djemari; Prasetyo, Zuhdan Kun – International Journal of Instruction, 2020

The analysis of the Test Instruments' quality is a crucial thing needs to be conducted. The test instruments made by teachers must fulfil the requirements (validity, reliability, and standard error of measurement) until the measurement result obtained can describe the students' actual abilities. This research aims to analyse the content validity…

Descriptors: Foreign Countries, Teacher Made Tests, Content Validity, Test Reliability

Developing IRT-Based Physics Critical Thinking Skill Test: A CAT to Answer 21st Century Challenge

Peer reviewed
PDF on ERIC

Download full text

Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019

The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…

Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics

Test-Retest Reliability of the Adaptive Chemistry Assessment Survey for Teachers: Measurement Error and Alternatives to Correlation

Peer reviewed

Direct link

Harshman, Jordan; Yezierski, Ellen – Journal of Chemical Education, 2016

Determining the error of measurement is a necessity for researchers engaged in bench chemistry, chemistry education research (CER), and a multitude of other fields. Discussions regarding what constructs measurement error entails and how to best measure them have occurred, but the critiques about traditional measures have yielded few alternatives.…

Descriptors: Science Instruction, Chemistry, Error of Measurement, Psychometrics

Multidimensional Computerized Adaptive Testing for Indonesia Junior High School Biology

Peer reviewed

Direct link

Kuo, Bor-Chen; Daud, Muslem; Yang, Chih-Wei – EURASIA Journal of Mathematics, Science & Technology Education, 2015

This paper describes a curriculum-based multidimensional computerized adaptive test that was developed for Indonesia junior high school Biology. In adherence to the Indonesian curriculum of different Biology dimensions, 300 items was constructed, and then tested to 2238 students. A multidimensional random coefficients multinomial logit model was…

Descriptors: Secondary School Science, Science Education, Science Tests, Computer Assisted Testing

An Application of Generalizability Theory to Evaluate the Technical Quality of an Alternate Assessment

Peer reviewed

Direct link

Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013

Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…

Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores

Psychometric Properties of the School Attitude Assessment Survey-Revised with International Baccalaureate High School Students

Direct link

Dedrick, Robert F.; Shaunessy-Dedrick, Elizabeth; Suldo, Shannon M.; Ferron, John M. – Gifted Child Quarterly, 2015

In two studies (ns = 312 and 1,149) with 9- to 12-grade students in pre-International Baccalaureate (IB) and IB Diploma programs, we evaluated the reliability, factor structure, measurement invariance, and criterion-related validity of the scores from the School Attitude Assessment Survey-Revised (SAAS-R). Reliabilities of the five SAAS-R subscale…

Descriptors: Psychometrics, High School Students, Advanced Placement Programs, Attitude Measures

Psychometric Properties of the School Attitude Assessment Survey-Revised with International Baccalaureate (IB) High School Students

Peer reviewed
PDF on ERIC

Download full text

Direct link

Dedrick, Robert F.; Shaunessy-Dedrick, Elizabeth; Suldo, Shannon M.; Ferron, John – Grantee Submission, 2015

In two studies (ns = 312 and 1149) with 9-12 grade students in pre-International Baccalaureate (IB) and IB Diploma programs, we evaluated the reliability, factor structure, measurement invariance, and criterion-related validity of the scores from the School Attitude Assessment Survey-Revised (SAAS-R; McCoach & Siegle, 2003a). Reliabilities of…

Descriptors: Psychometrics, High School Students, Advanced Placement Programs, Attitude Measures

Public Perceptions of Reliability in Examination Results in England

Peer reviewed

Direct link

He, Qingping; Boyle, Andrew; Opposs, Dennis – Evaluation & Research in Education, 2011

Building on findings from existing qualitative research into public perceptions of reliability in examination results in England, a questionnaire was developed and administered to samples of teachers, students and employers to study their awareness of and opinions about various aspects of reliability quantitatively. Main findings from the study…

Descriptors: Qualitative Research, Student Evaluation, Tests, Program Effectiveness

Improved Reliability Estimates for Small Samples Using Empirical Bayes Techniques. Research Report. ETS RR-09-46

Peer reviewed
PDF on ERIC

Download full text

Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009

Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…

Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics

Reliability and Validity Evidence for the GED[R] English as a Second Language Test. GED Testing Service[R] Research Studies, 2009-4

Download full text

Setzer, J. Carl – GED Testing Service, 2009

The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…

Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills

Dedrick, Robert F.	2
Shaunessy-Dedrick, Elizabeth	2
Suldo, Shannon M.	2
Boyle, Andrew	1
Chi-Ning Chang	1
Daud, Muslem	1
Dwandaru, Wipsar Sunu Brams	1
Emily Dawn Cantwell	1
Ferron, John	1
Ferron, John M.	1
Guo, Hongwen	1
Harshman, Jordan	1
He, Qingping	1
Istiyono, Edi	1
Jamie Jones	1
Jeff Allen	1
Kandace Fleming	1
Kathleen Lynne Lane	1
Katie Scarlett Lane	1
Kuo, Bor-Chen	1
Lede, Yulita Adelfin	1
Mardapi, Djemari	1
Mark Matthew Buckman	1
Meredith Crittenden	1
Nadapdap, Amipa	1
More ▼