NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)7
What Works Clearinghouse Rating
Showing 1 to 15 of 281 results Save | Export
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Cabrera, Nolan L.; Cabrera, George A. – Educational Horizons, 2011
Just like all the high-stakes tests that determine students' futures nowadays, The Chorizo Test is a standardized test rooted in the culture of the test makers. It was originally created to be used with students in teacher training programs to sensitize them to the pitfalls inherent in standardized pencil-and-paper tests, such as linguistic bias…
Descriptors: Test Use, Standardized Tests, Social Sciences, High Stakes Tests
Mathis, Frankie Eubanks – ProQuest LLC, 2012
With increased emphasis on accountability, the use of low-stakes test data to make high-stakes decisions about program effectiveness is on the rise. In order to make valid inferences about what students know and can do, it is crucial to understand the consequences of low and high stakes in testing contexts. As a result, with a sample comprised of…
Descriptors: High Stakes Tests, Academic Achievement, Program Effectiveness, Grade 11
Peer reviewed Peer reviewed
Direct linkDirect link
Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009
The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…
Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation
Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010
Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…
Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification
Torrance, E. Paul – Creative Child and Adult Quarterly, 1976
The author of the Torrance Tests of Creative Thinking reviews uses and abuses of creativity tests, distinctions between intelligence and creativity (including racial factors and heritability issues), and methods of using the tests (such as therapy for test phobia and as an introduction to further training experiences). (CL)
Descriptors: Creativity, Creativity Tests, Intelligence Tests, Test Interpretation
Peer reviewed Peer reviewed
Riedel, James A.; Dodson, Janet D. – Educational and Psychological Measurement, 1977
GURU is a computer program developed to analyze data generated by open-ended question techniques such as ECHO or other semistructured data collection techniques in which data are categorized. The program provides extensive descriptive statistics and allows extensive flexibility in comparing data. (Author/JKS)
Descriptors: Computer Programs, Data Analysis, Essay Tests, Test Interpretation
Criscuolo, Nicholas P. – NJEA Review, 1972
Article describes problems encountered by school districts over publication of student reading test scores, and lists ways to insure correct test result interpretation. (SP)
Descriptors: Reading Level, Reading Tests, Test Interpretation, Test Results
LEIBERT, ROBERT E. – 1967
A STUDY DESIGNED TO IDENTIFY SOME OF THE DIFFERENCES BETWEEN THE RESPONSES ON THE GATES ADVANCED PRIMARY READING TEST AND THE KINDS OF RESPONSES OBTAINED FROM AN INFORMAL READING INVENTORY (IRI) IS REPORTED. SUBJECTS WERE 65 THIRD-GRADE PUPILS IN WEST BABYLON, NEW YORK. PUPILS AT THE SAME INSTRUCTIONAL LEVEL SCORED HIGHER IN THE RECOGNITION TEST…
Descriptors: Informal Reading Inventories, Reading Tests, Standardized Tests, Test Interpretation
Peer reviewed Peer reviewed
Rentz, R. Robert; Bashaw, W. L. – Journal of Educational Measurement, 1977
This paper presents the characteristics, properties and development of the National Reference Scale for reading. This new scale is the result of a reanalysis of the Anchor Test Study data using Rasch model procedures, in an effort to produce equated scores among all reading tests included in that study. (Author/JKS)
Descriptors: Equated Scores, Measurement, Reading Tests, Test Interpretation
Rose, Harriet A.; Elton, Charles F. – Journal of College Student Personnel, 1971
The authors discuss the advisability of making orientation test batteries voluntary. Though students are opposed to compulsory testing, the authors argue that the information gained from them is important to the educational process. (CG)
Descriptors: Activism, Orientation, Personality Measures, Test Interpretation
Peer reviewed Peer reviewed
Green, Donald Ross; Trimble, C. Scott; Lewis, Daniel M. – Educational Measurement: Issues and Practice, 2003
Describes the procedures by which Kentucky's state assessment program synthesized results from three standard setting procedures (Contrasting Groups, Bookmark, and Jaeger-Mills) for the 2000 state assessment. Shows the value of using multiple standard-setting approaches to gather information from each. (SLD)
Descriptors: Achievement Tests, Standard Setting, State Programs, Synthesis
Rhode Island Department of Elementary and Secondary Education, 2007
This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…
Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs
Pruett, Kathy E. – 1982
The booklet was designed to help each recipient of the California Achievement Test (CAT) reports to understand the general format of the reports, the abbreviations and symbols used, and the types of scores presented. It was also intended to assist in interpreting CAT results and using these results at each level of the educational process to best…
Descriptors: Educational Planning, Scoring, Test Interpretation, Test Manuals
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  19