Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 13 |
Descriptor
Error of Measurement | 13 |
Test Reliability | 13 |
High School Students | 7 |
Scores | 7 |
Psychometrics | 6 |
Test Validity | 6 |
Foreign Countries | 4 |
Factor Analysis | 3 |
Factor Structure | 3 |
Grade 10 | 3 |
Grade 9 | 3 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 11 |
Journal Articles | 10 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
High Schools | 13 |
Secondary Education | 11 |
Junior High Schools | 5 |
Middle Schools | 5 |
Grade 10 | 3 |
Grade 9 | 3 |
Elementary Education | 2 |
Grade 11 | 2 |
Grade 12 | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
More ▼ |
Audience
Location
Indonesia | 3 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
General Educational… | 1 |
National Merit Scholarship… | 1 |
Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Kathleen Lynne Lane; Wendy Peia Oakes; Mark Matthew Buckman; Nathan Allen Lane; Katie Scarlett Lane; Kandace Fleming; Rebecca E. Swinburne Romine; Rebecca L. Sherod; Chi-Ning Chang; Jamie Jones; Emily Dawn Cantwell; Meredith Crittenden – Remedial and Special Education, 2024
Given the need for a swift, systematic way to identify students with internalizing and externalizing behavior patterns to connect these students with appropriate supports, we present new findings of the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE). In this article, we examined (a) factor structure of the SRSS-IE and (b)…
Descriptors: Screening Tests, At Risk Students, Psychometrics, Factor Structure
Ramadhan, Syahrul; Sumiharsono, Rudy; Mardapi, Djemari; Prasetyo, Zuhdan Kun – International Journal of Instruction, 2020
The analysis of the Test Instruments' quality is a crucial thing needs to be conducted. The test instruments made by teachers must fulfil the requirements (validity, reliability, and standard error of measurement) until the measurement result obtained can describe the students' actual abilities. This research aims to analyse the content validity…
Descriptors: Foreign Countries, Teacher Made Tests, Content Validity, Test Reliability
Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019
The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…
Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics
Harshman, Jordan; Yezierski, Ellen – Journal of Chemical Education, 2016
Determining the error of measurement is a necessity for researchers engaged in bench chemistry, chemistry education research (CER), and a multitude of other fields. Discussions regarding what constructs measurement error entails and how to best measure them have occurred, but the critiques about traditional measures have yielded few alternatives.…
Descriptors: Science Instruction, Chemistry, Error of Measurement, Psychometrics
Kuo, Bor-Chen; Daud, Muslem; Yang, Chih-Wei – EURASIA Journal of Mathematics, Science & Technology Education, 2015
This paper describes a curriculum-based multidimensional computerized adaptive test that was developed for Indonesia junior high school Biology. In adherence to the Indonesian curriculum of different Biology dimensions, 300 items was constructed, and then tested to 2238 students. A multidimensional random coefficients multinomial logit model was…
Descriptors: Secondary School Science, Science Education, Science Tests, Computer Assisted Testing
Taylor, Melinda Ann; Pastor, Dena A. – Applied Measurement in Education, 2013
Although federal regulations require testing students with severe cognitive disabilities, there is little guidance regarding how technical quality should be established. It is known that challenges exist with documentation of the reliability of scores for alternate assessments. Typical measures of reliability do little in modeling multiple sources…
Descriptors: Generalizability Theory, Alternative Assessment, Test Reliability, Scores
Dedrick, Robert F.; Shaunessy-Dedrick, Elizabeth; Suldo, Shannon M.; Ferron, John M. – Gifted Child Quarterly, 2015
In two studies (ns = 312 and 1,149) with 9- to 12-grade students in pre-International Baccalaureate (IB) and IB Diploma programs, we evaluated the reliability, factor structure, measurement invariance, and criterion-related validity of the scores from the School Attitude Assessment Survey-Revised (SAAS-R). Reliabilities of the five SAAS-R subscale…
Descriptors: Psychometrics, High School Students, Advanced Placement Programs, Attitude Measures
Dedrick, Robert F.; Shaunessy-Dedrick, Elizabeth; Suldo, Shannon M.; Ferron, John – Grantee Submission, 2015
In two studies (ns = 312 and 1149) with 9-12 grade students in pre-International Baccalaureate (IB) and IB Diploma programs, we evaluated the reliability, factor structure, measurement invariance, and criterion-related validity of the scores from the School Attitude Assessment Survey-Revised (SAAS-R; McCoach & Siegle, 2003a). Reliabilities of…
Descriptors: Psychometrics, High School Students, Advanced Placement Programs, Attitude Measures
He, Qingping; Boyle, Andrew; Opposs, Dennis – Evaluation & Research in Education, 2011
Building on findings from existing qualitative research into public perceptions of reliability in examination results in England, a questionnaire was developed and administered to samples of teachers, students and employers to study their awareness of and opinions about various aspects of reliability quantitatively. Main findings from the study…
Descriptors: Qualitative Research, Student Evaluation, Tests, Program Effectiveness
Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009
Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…
Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics
Setzer, J. Carl – GED Testing Service, 2009
The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…
Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills