ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	13

Descriptor

Standardized Tests	36
Statistical Analysis	36
Test Reliability	36
Test Validity	13
Scores	10
Correlation	9
Achievement Tests	8
Comparative Analysis	7
Item Analysis	7
Test Results	7
Academic Achievement	6
Evaluation Methods	6
Reading Tests	6
Test Construction	6
Test Interpretation	6
Test Items	6
Elementary Education	5
Equated Scores	5
Foreign Countries	5
Mathematical Models	5
Student Evaluation	5
Test Bias	5
Testing	5
Testing Problems	5
Achievement Gains	4
More ▼

Source

Behavioral Research and…	2
Regional Educational…	2
American Journal of…	1
Assessment in Education:…	1
Canadian Journal of…	1
Frontline Learning Research	1
Journal of Educational…	1
Journal of Speech, Language,…	1
Journal of Teacher Education	1
Mid-Western Educational…	1
National Center for Education…	1
Online Submission	1
Phi Delta Kappan	1
More ▼

Publication Type

Reports - Research	21
Journal Articles	10
Reports - Evaluative	6
Numerical/Quantitative Data	3
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Middle Schools	6
Elementary Education	5
Higher Education	3
Elementary Secondary Education	2
Grade 3	2
Grade 7	2
High Schools	2
Intermediate Grades	2
Postsecondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 8	1
Primary Education	1
More ▼

Audience

Practitioners

Location

Germany	2
California	1
Canada	1
China	1
Netherlands	1
Texas (Houston)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	3
Iowa Tests of Basic Skills	3
Stanford Achievement Tests	3
Dynamic Indicators of Basic…	2
Preliminary Scholastic…	2
Adjective Check List	1
California Achievement Tests	1
Graduate Record Examinations	1
Metropolitan Achievement Tests	1
Modern Language Aptitude Test	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1
Stanford Binet Intelligence…	1
Wechsler Intelligence Scale…	1
Wonderlic Personnel Test	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 36 results Save | Export

(In)Stability of Test Scores

Peer reviewed
PDF on ERIC

Download full text

Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022

Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…

Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores

Development of Psychometrically Validated Standardized Test Instruments for Outcomes Assessment in Experiential Engineering Education

Peer reviewed
PDF on ERIC

Download full text

Ssemakula, Mukasa E.; Liao, Gene Y.; Sawilowsky, Shlomo – American Journal of Engineering Education, 2018

There is a major trend in engineering education to provide students with realistic hands-on learning experiences. This paper reports on the results of work done to develop standardized test instruments to use for student learning outcomes assessment in an experiential hands-on manufacturing engineering and technology environment. The specific…

Descriptors: Test Construction, Psychometrics, Test Validity, Standardized Tests

Loosening Psychometric Constraints on Educational Assessments

Peer reviewed

Direct link

Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017

In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…

Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability

Research to Establish the Validity, Reliability, and Clinical Utility of a Comprehensive Language Assessment of Mandarin

Peer reviewed

Direct link

Liu, Xueman Lucy; de Villiers, Jill; Ning, Chunyan; Rolfhus, Eric; Hutchings, Teresa; Lee, Wendy; Jiang, Fan; Zhang, Yi Wen – Journal of Speech, Language, and Hearing Research, 2017

Purpose: With no existing gold standard for comparison, challenges arise for establishing the validity of a new standardized Mandarin language assessment normed in mainland China. Method: A new assessment, Diagnostic Receptive and Expressive Assessment of Mandarin (DREAM), was normed with a stratified sample of 969 children ages 2;6 (years;months)…

Descriptors: Mandarin Chinese, Correlation, Language Tests, Diagnostic Tests

All Sizzle and No Steak: Value-Added Model Doesn't Add Value in Houston

Direct link

Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017

Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…

Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology

Teachers' Professional Knowledge for Teaching English as a Foreign Language: Assessing the Outcomes of Teacher Education

Peer reviewed

Direct link

König, Johannes; Lammerding, Sandra; Nold, Günter; Rohde, Andreas; Strauß, Sarah; Tachtsoglou, Sarantis – Journal of Teacher Education, 2016

Despite an increasing research interest in subject-specific teacher knowledge, the scientific understanding regarding teachers' professional knowledge for teaching English as a foreign language (TEFL) is very limited. This study therefore applies standardized tests to directly assess content knowledge (CK), pedagogical content knowledge (PCK), and…

Descriptors: Knowledge Base for Teaching, English (Second Language), Second Language Learning, Second Language Instruction

Competence Assessment of Students with Special Educational Needs--Identification of Appropriate Testing Accommodations

Peer reviewed
PDF on ERIC

Download full text

Südkamp, Anna; Pohl, Steffi; Weinert, Sabine – Frontline Learning Research, 2015

Including students with special educational needs in learning (SEN-L) is a challenge for large-scale assessments. In order to draw inferences with respect to students with SEN-L and to compare their scores to students in general education, one needs to assure that the measurement model is reliable and that the same construct is measured for…

Descriptors: Disabilities, Special Education, Inclusion, Competence

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 2. Technical Report #1201

Download full text

Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Elementary School Students

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States are increasingly interested in including measures of student achievement growth, or "value- added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot …

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. Summary. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

Accommodations for English Language Learner Students: The Effect of Linguistic Modification of Math Test Item Sets. Final Report. NCEE 2009-4079

Peer reviewed
PDF on ERIC

Download full text

Sato, Edynn; Rabinowitz, Stanley; Gallagher, Carole; Huang, Chun-Wei – National Center for Education Evaluation and Regional Assistance, 2010

This study examined the effect of linguistic modification on middle school students' ability to show what they know and can do on math assessments. REL West's study on middle school math assessment accommodations found that simplifying the language--or linguistic modification--on standardized math test items made it easier for English Language…

Descriptors: Test Items, Standardized Tests, Mathematics Tests, Testing Accommodations

An Analysis of Homeschooled and Non-Homeschooled Students' Performance on an ACT Mathematics Achievement Test

Download full text

Qaqish, Basil – Online Submission, 2007

ACT college test publisher provided scores. On average, non-homeschoolers performed better than homeschoolers, by about two items, out of sixty items, on the ACT mathematics test that was analyzed. This result may be due to the different teaching/learning media used in teaching each of the two groups, to different teacher/student interaction, or…

Descriptors: Home Schooling, College Entrance Examinations, Standardized Tests, Achievement Tests

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Differences Between Kuder-Richardson Formula 20 and Formula 21 Reliability Coefficients for Short Tests with Different Item Variabilities.

Download full text

Lenke, Joanne M.; And Others – 1977

To investigate the effect of violating the assumption of equal item difficulty on Kuder-Richardson (KR) Formula 21 reliability coefficient, 670 eighth-and ninth- grade students were administered 26 short, homogeneous "tests" of mathematics concepts and skills. Both KR Formula 20 and KR Formula 21 were used to estimate reliability on each…

Descriptors: Comparative Analysis, Diagnostic Tests, Difficulty Level, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Alonzo, Julie	2
Bashaw, W. L.	2
Booker, Kevin	2
Bruch, Julie	2
Gill, Brian	2
Irvin, P. Shawn	2
Lai, Cheng-Fei	2
Lenke, Joanne M.	2
Park, Bitnara Jasmine	2
Rentz, R. Robert	2
Tindal, Gerald	2
Amrein-Beardsley, Audrey	1
Barker, Pierce	1
Benson, Jeri	1
Budescu, David	1
Charters, Moire C.	1
Crocker, Linda	1
Dimitrov, Dimiter M.	1
Dizney, Henry	1
Ekstrom, Ruth B.	1
Gallagher, Carole	1
Geiger, Tray	1
Huang, Chun-Wei	1
Hutchings, Teresa	1
More ▼