NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 50 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Flanagan, Agnes; Cormier, Damien C. – Communique, 2019
One of the areas subsumed under the data-based decision making and accountability practice identified in the National Association of School Psychologists' (NASP) "Model for Integrated School Psychological Services" is to collect information on psychological and educational variables to make decisions at a number of levels of service…
Descriptors: Test Bias, School Psychologists, Measurement, Data Collection
Center on Standards and Assessments Implementation, 2018
Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…
Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias
Shah, Harshini; Niland, Katherine; Kharsa, Miranda; Caronongan, Pia; Moiduddin, Emily – US Department of Health and Human Services, 2020
In 2017, the Office of Planning, Research, and Evaluation (OPRE) in the Administration for Children and Families (ACF) funded Mathematica to conduct the Infant and Toddler Teacher and Caregiver Competencies (ITTCC) project. The project aims to examine existing efforts across states, institutions of higher education, professional organizations, and…
Descriptors: Infants, Toddlers, Caregivers, Preschool Teachers
Moodie, Shannon; Daneri, Paula; Goldhagen, Samantha; Halle, Tamara; Green, Katie; LaMonte, Lauren – US Department of Health and Human Services, 2014
For children age birth to five, physical, cognitive, linguistic, and social-emotional growth and development occur at a rapid pace. While all children in this age range may not reach developmental milestones (e.g., smiling, saying first words, taking first steps) at the same time, development that does not happen within an expected timeframe can…
Descriptors: Young Children, Child Development, Screening Tests, Measurement Techniques
Bill & Melinda Gates Foundation, 2012
No one has a bigger stake in teaching effectiveness than students. Nor are there any better experts on how teaching is experienced by its intended beneficiaries. Only recently have many policymakers and practitioners come to recognize that--when asked the right questions, in the right ways--students can be an important source of information on the…
Descriptors: Student Surveys, Student Attitudes, Feedback (Response), Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Bush, Martin E. – Quality Assurance in Education: An International Perspective, 2006
Purpose: To provide educationalists with an understanding of the key quality issues relating to multiple-choice tests, and a set of guidelines for the quality assurance of such tests. Design/methodology/approach: The discussion of quality issues is structured to reflect the order in which those issues naturally arise. It covers the design of…
Descriptors: Multiple Choice Tests, Test Reliability, Educational Quality, Quality Control
Lembke, Erica S.; Stecker, Pamela M. – Center on Instruction, 2007
One of the best methods of formative assessment in academic areas and a method that exemplifies the characteristics of good measures is Curriculum-Based Measurement (CBM; Deno, 1985). Developed at the University of Minnesota in the early 1970's, CBM has been researched in academic areas including mathematics computation, concepts, and…
Descriptors: Curriculum Based Assessment, Formative Evaluation, Mathematics Education, Educational Research
Peer reviewed Peer reviewed
Dyer, Henry S. – NASSP Bulletin, 1987
Reviews 12 studies to determine whether coaching improves student performance on the Scholastic Aptitude Test (SAT). While results are mixed, evidence suggests that coaching does not appreciably improve students' verbal or math scores. Factors such as test reliability and weighting SAT scores with high school records should be considered. Includes…
Descriptors: Aptitude Tests, Scores, Secondary Education, Standardized Tests
Peer reviewed Peer reviewed
Kibblewhite, D. – Educational Studies, 1981
Describes a practical approach that teachers can use to check for test-item validity in test construction. The Kuder-Richardson Reliability Formula is used. Detailed instructions describe the procedure for evaluating items for difficulty and using statistical methods to determine test validity. (AM)
Descriptors: Elementary Secondary Education, Higher Education, Item Analysis, Statistical Analysis
Peer reviewed Peer reviewed
Gross, Edward J.; And Others – Research in Developmental Disabilities, 1994
This study describes the development of the Active Treatment Client Rights checklist (ATCR), which was designed to facilitate the assessment, monitoring, and implementation of readily observable client active treatment services for adults with developmental disabilities. The ATCR was found to be highly reliable, valid, and useful in enhancing…
Descriptors: Adults, Check Lists, Developmental Disabilities, Evaluation Methods
Peer reviewed Peer reviewed
Seibert, Jeffrey M.; And Others – Topics in Early Childhood Special Education, 1987
The paper describes the "Early Social Communication Scales" designed to assess social and communication skills typically acquired in the first 30 months of life. Stressed is the role of the tester as an interactive partner for the infant. Reliability data suggesting reliability over time with the same partner are presented. (Author/DB)
Descriptors: Behavior Rating Scales, Communication Skills, Evaluation Methods, Infants
Maguire, Thomas O.; And Others – 1983
A study was commissioned to develop and validate a test to assess the attitudes of Alberta students towards the world of work. A revised instrument was created that used 75 items grouped into 15 scales, of five items each, measuring perceptions about available opportunities. During the validation field trial the instrument was administered to 467…
Descriptors: Attitude Measures, Career Education, Foreign Countries, Secondary Education
Peer reviewed Peer reviewed
Hoyle, John R. – Planning and Changing, 1987
Describes the Examination for Certification of Educators in Texas (ExCET) required of all Texans seeking certification as principals, supervisors, or superintendents. Discusses certain test controversies (including test development by an out-of-state firm), the quality of administraor training in Texas, test objectives, and the future of ExCET.…
Descriptors: Administrator Education, Administrator Qualifications, Administrators, Certification
Peer reviewed Peer reviewed
Direct linkDirect link
Crocker, Linda – New Directions for Community Colleges, 1987
Examines reasons for using essay tests in the direct assessment of writing ability. Reviews the steps in developing a large-scale testing program; e.g., creating a pool of topics or prompts; developing scoring procedures; training raters; field-testing the system; scoring writing samples; assessing reliability; and assessing validity. (DMM)
Descriptors: Essay Tests, Postsecondary Education, Scoring, Test Construction
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4