Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 13 |
Descriptor
Standardized Tests | 36 |
Statistical Analysis | 36 |
Test Reliability | 36 |
Test Validity | 13 |
Scores | 10 |
Correlation | 9 |
Achievement Tests | 8 |
Comparative Analysis | 7 |
Item Analysis | 7 |
Test Results | 7 |
Academic Achievement | 6 |
More ▼ |
Source
Author
Alonzo, Julie | 2 |
Bashaw, W. L. | 2 |
Booker, Kevin | 2 |
Bruch, Julie | 2 |
Gill, Brian | 2 |
Irvin, P. Shawn | 2 |
Lai, Cheng-Fei | 2 |
Lenke, Joanne M. | 2 |
Park, Bitnara Jasmine | 2 |
Rentz, R. Robert | 2 |
Tindal, Gerald | 2 |
More ▼ |
Publication Type
Education Level
Middle Schools | 6 |
Elementary Education | 5 |
Higher Education | 3 |
Elementary Secondary Education | 2 |
Grade 3 | 2 |
Grade 7 | 2 |
High Schools | 2 |
Intermediate Grades | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Grade 2 | 1 |
More ▼ |
Audience
Practitioners | 1 |
Location
Germany | 2 |
California | 1 |
Canada | 1 |
China | 1 |
Netherlands | 1 |
Texas (Houston) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Ssemakula, Mukasa E.; Liao, Gene Y.; Sawilowsky, Shlomo – American Journal of Engineering Education, 2018
There is a major trend in engineering education to provide students with realistic hands-on learning experiences. This paper reports on the results of work done to develop standardized test instruments to use for student learning outcomes assessment in an experiential hands-on manufacturing engineering and technology environment. The specific…
Descriptors: Test Construction, Psychometrics, Test Validity, Standardized Tests
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability
Liu, Xueman Lucy; de Villiers, Jill; Ning, Chunyan; Rolfhus, Eric; Hutchings, Teresa; Lee, Wendy; Jiang, Fan; Zhang, Yi Wen – Journal of Speech, Language, and Hearing Research, 2017
Purpose: With no existing gold standard for comparison, challenges arise for establishing the validity of a new standardized Mandarin language assessment normed in mainland China. Method: A new assessment, Diagnostic Receptive and Expressive Assessment of Mandarin (DREAM), was normed with a stratified sample of 969 children ages 2;6 (years;months)…
Descriptors: Mandarin Chinese, Correlation, Language Tests, Diagnostic Tests
Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017
Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…
Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology
König, Johannes; Lammerding, Sandra; Nold, Günter; Rohde, Andreas; Strauß, Sarah; Tachtsoglou, Sarantis – Journal of Teacher Education, 2016
Despite an increasing research interest in subject-specific teacher knowledge, the scientific understanding regarding teachers' professional knowledge for teaching English as a foreign language (TEFL) is very limited. This study therefore applies standardized tests to directly assess content knowledge (CK), pedagogical content knowledge (PCK), and…
Descriptors: Knowledge Base for Teaching, English (Second Language), Second Language Learning, Second Language Instruction
Südkamp, Anna; Pohl, Steffi; Weinert, Sabine – Frontline Learning Research, 2015
Including students with special educational needs in learning (SEN-L) is a challenge for large-scale assessments. In order to draw inferences with respect to students with SEN-L and to compare their scores to students in general education, one needs to assure that the measurement model is reliable and that the same construct is measured for…
Descriptors: Disabilities, Special Education, Inclusion, Competence
Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Elementary School Students
Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States are increasingly interested in including measures of student achievement growth, or "value-
added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student
growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot
…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Sato, Edynn; Rabinowitz, Stanley; Gallagher, Carole; Huang, Chun-Wei – National Center for Education Evaluation and Regional Assistance, 2010
This study examined the effect of linguistic modification on middle school students' ability to show what they know and can do on math assessments. REL West's study on middle school math assessment accommodations found that simplifying the language--or linguistic modification--on standardized math test items made it easier for English Language…
Descriptors: Test Items, Standardized Tests, Mathematics Tests, Testing Accommodations
Qaqish, Basil – Online Submission, 2007
ACT college test publisher provided scores. On average, non-homeschoolers performed better than homeschoolers, by about two items, out of sixty items, on the ACT mathematics test that was analyzed. This result may be due to the different teaching/learning media used in teaching each of the two groups, to different teacher/student interaction, or…
Descriptors: Home Schooling, College Entrance Examinations, Standardized Tests, Achievement Tests

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests
Lenke, Joanne M.; And Others – 1977
To investigate the effect of violating the assumption of equal item difficulty on Kuder-Richardson (KR) Formula 21 reliability coefficient, 670 eighth-and ninth- grade students were administered 26 short, homogeneous "tests" of mathematics concepts and skills. Both KR Formula 20 and KR Formula 21 were used to estimate reliability on each…
Descriptors: Comparative Analysis, Diagnostic Tests, Difficulty Level, Item Analysis