ERIC - Search Results

Publication Date

In 2025	3
Since 2024	4
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	63
Since 2006 (last 20 years)	123

Descriptor

Scores	253
Test Reliability	253
Test Validity	132
Testing	68
Computer Assisted Testing	66
Testing Problems	60
Test Construction	51
Standardized Tests	43
Correlation	42
Test Interpretation	42
Achievement Tests	38
Elementary Secondary Education	37
Psychometrics	37
Foreign Countries	36
Scoring	36
Comparative Analysis	35
Statistical Analysis	34
Language Tests	30
Test Bias	29
Test Items	28
Student Evaluation	27
Evaluation Methods	26
Academic Achievement	24
Item Analysis	24
Item Response Theory	24
More ▼

Education Level

Elementary Education	26
Higher Education	26
Postsecondary Education	20
Middle Schools	16
Elementary Secondary Education	15
Secondary Education	14
Early Childhood Education	13
Intermediate Grades	13
Primary Education	11
Grade 3	10
High Schools	10
Junior High Schools	10
Grade 4	9
Grade 5	9
Grade 6	8
Grade 7	8
Grade 8	7
Grade 9	5
Grade 2	4
Grade 10	2
Grade 11	2
Grade 12	2
High School Equivalency…	2
Preschool Education	2
Adult Basic Education	1
More ▼

Audience

Researchers	12
Practitioners	8
Administrators	2
Teachers	2
Community	1
Parents	1
Policymakers	1

Location

Australia	4
China	4
United Kingdom	4
Vermont	4
California	3
Canada	3
Germany	3
Israel	3
Turkey	3
United Kingdom (England)	3
United States	3
Connecticut	2
Florida	2
Illinois	2
Indonesia	2
Kenya	2
Netherlands	2
Sweden	2
Alaska	1
Arizona	1
Asia	1
Brazil	1
Denmark	1
Egypt	1
Estonia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	4
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 253 results Save | Export

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

Measurement Invariance and Structure Validity of Scores on the Center for Epidemiologic Studies Depression - Revised (CESD-R) Scale with a Large University Sample

Peer reviewed

Direct link

Julie Sriken; Bradley T. Erford; Martin F. Sherman; Kristen Watson; Heather L. Smith – Measurement and Evaluation in Counseling and Development, 2024

Psychometric characteristics of CESD-R scores were explored on a sample of 966 undergraduate students. Internal consistency ([alpha] = 0.92), external convergent and discriminant validity, and response bias were adequate to excellent. Strong measurement invariance was evident for gender and race comparisons, and the unidimensional model fit the…

Descriptors: Symptoms (Individual Disorders), Depression (Psychology), Measures (Individuals), Undergraduate Students

Reliability of Computer-Based CBMs versus Paper/Pencil Administration for Fact and Complex Operations in Mathematics

Peer reviewed

Direct link

VanDerHeyden, Amanda M.; Codding, Robin; Solomon, Benjamin G. – Remedial and Special Education, 2023

Computer-based curriculum-based measurement (CBM) is a relatively common practice, but surprisingly few studies have examined the reliability of computer-based CBM. This study sought to examine the reliability of CBM administered via paper/pencil versus the computer. Twenty-one of 25 students in two third-grade classes (N = 21) participated in two…

Descriptors: Curriculum Based Assessment, Computer Assisted Testing, Test Format, Grade 3

Measurement Invariance of Scores on the Teacher Stress Scale: International Sample of PreK-12 Teachers

Peer reviewed

Direct link

Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025

Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…

Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Comparison of Two Test Methods for VIS: Paper-Pencil Test and CAT

Peer reviewed

Direct link

Senel, Selma; Kutlu, Ömer – European Journal of Special Needs Education, 2018

This paper examines listening comprehension skills of visually impaired students (VIS) using computerised adaptive testing (CAT) and reader-assisted paper-pencil testing (raPPT) and student views about them. Explanatory mixed method design was used in this study. Sample is comprised of 51 VIS, in 7th and 8th grades. 9 of these students were…

Descriptors: Computer Assisted Testing, Adaptive Testing, Visual Impairments, Student Attitudes

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Reliability. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.

Descriptors: Scores, Measurement, Test Reliability, Error Patterns

Responsibilities of Users of Standardized Tests (Rust-4E)

Peer reviewed

Direct link

Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022

In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…

Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Review of Recent Empirical Research (2011-2018) on Language Assessment in China

Peer reviewed

Direct link

Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020

This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…

Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles

Measuring Scientific Reasoning of Fourth Graders: Validation of the Science-K Inventory in Paper-Based and Computer-Based Testing Environments

Peer reviewed
PDF on ERIC

Download full text

Márió Tibor Nagy; Erzsébet Korom – Journal of Baltic Science Education, 2023

Nowadays, the assessment of student performance has become increasingly technology-based, a trend that can also be observed in the evaluation of scientific reasoning, with more and more of the formerly paper-based assessment tools moving into the digital space. The study aimed to examine the reliability and validity of the paper-based and…

Descriptors: Science Process Skills, Elementary School Students, Grade 4, Science Tests

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Examining Skills and Abilities during the Pandemic -- Psychology Students' and Examiners' Perceptions of a Digital OSCE

Peer reviewed

Direct link

Hakelind, Camilla; Sundström, Anna E. – Psychology Learning and Teaching, 2022

Finding valid and reliable ways to assess complex clinical skills within psychology is a challenge. Recently, there have been some examples of applying Objective Structured Clinical Examinations (OSCEs) in psychology for making such assessments. The aim of this study was to examine students' and examiners' perceptions of a digital OSCE in…

Descriptors: Graduate Students, Masters Programs, Clinical Psychology, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Language Testing	8
ETS Research Report Series	7
ProQuest LLC	7
Educational Measurement:…	4
Partnership for Assessment of…	4
Applied Measurement in…	3
Applied Psychological…	3
Educational and Psychological…	3
GED Testing Service	3
Journal of Psychoeducational…	3
Language, Speech, and Hearing…	3
Measurement and Evaluation in…	3
Psychology in the Schools	3
Regional Educational…	3
Advances in Health Sciences…	2
American School Board Journal	2
Canadian Journal of School…	2
Education and Information…	2
Grantee Submission	2
International Journal of…	2
Journal of Clinical Psychology	2
Journal of Experimental…	2
Journal of Speech, Language,…	2
Language Assessment Quarterly	2
Multivariate Behavioral…	2
More ▼

Bennett, Randy Elliot	3
Gallas, Edwin J.	3
Koretz, Daniel	3
Booker, Kevin	2
Bruch, Julie	2
Ferguson, Richard L.	2
Gill, Brian	2
Hambleton, Ronald K.	2
Kapes, Jerome T.	2
Ling, Guangming	2
McNeil, Malcolm R.	2
Nese, Joseph F. T.	2
Sawaki, Yasuyo	2
Setzer, J. Carl	2
Sinharay, Sandip	2
Steinberg, Jonathan	2
Thurlow, Martha L.	2
Zimmerman, Donald W.	2
Airasian, Peter W.	1
Alkoby, Moty	1
Allen, Nancy L.	1
Allen, Thomas E.	1
Allison, Donald E.	1
Altman, Jason	1
More ▼

Journal Articles	134
Reports - Research	133
Reports - Evaluative	49
Speeches/Meeting Papers	24
Reports - Descriptive	19
Numerical/Quantitative Data	15
Guides - Non-Classroom	11
Opinion Papers	10
Tests/Questionnaires	10
Information Analyses	8
Dissertations/Theses -…	7
Books	3
Collected Works - Proceedings	3
Guides - General	3
Collected Works - Serials	2
Guides - Classroom - Teacher	1
Reference Materials -…	1
Reference Materials - General	1
Reports - General	1
More ▼

National Assessment of…	7
SAT (College Admission Test)	7
ACT Assessment	5
Stanford Achievement Tests	5
Test of English as a Foreign…	5
Wechsler Adult Intelligence…	5
California Achievement Tests	4
Iowa Tests of Basic Skills	4
Wechsler Intelligence Scale…	4
ACTFL Oral Proficiency…	3
Comprehensive Tests of Basic…	3
General Educational…	3
Metropolitan Achievement Tests	3
Peabody Picture Vocabulary…	3
Dynamic Indicators of Basic…	2
International English…	2
Marlowe Crowne Social…	2
Preliminary Scholastic…	2
Raven Progressive Matrices	2
Slosson Intelligence Test	2
Woodcock Johnson Tests of…	2
Armed Forces Qualification…	1
Autism Diagnostic Observation…	1
Block Design Test	1
Center for Epidemiologic…	1
More ▼