NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 22 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Psychometrics, Validity, Child Development
Peer reviewed Peer reviewed
Direct linkDirect link
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Usability, Decision Making, Validity
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Galeoto, Giovanni; D'Elpidio, Giuliana; Alvaro, Rosaria; Zicari, Anna Maria; Valente, Donatella; Riccio, Marianna – International Association for Development of the Information Society, 2021
The Italian Disciplinary section of Test of Competences (TECO-D) project is an important longitudinal study used to analyze learning outcomes of ungraded students and to measure quality of the educational process. The aim of the present study was to evaluate the psychometric properties of the TECO-D in students enrolled in the Bachelor's Degree in…
Descriptors: Case Studies, Nursing Education, Psychometrics, Longitudinal Studies
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Peer reviewed Peer reviewed
Direct linkDirect link
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
National Council on Measurement in Education, 2012
Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…
Descriptors: State Programs, Integrity, Testing, Test Preparation
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
Peer reviewed Peer reviewed
Rudner, Lawrence M.; And Others – Applied Measurement in Education, 1996
An analysis of data from the 1990 National Assessment of Educational Progress Trial State Assessment suggests that person-fit statistics may not provide additional information about results of psychometrically strong achievement tests. More research is needed before person-fit statistics can be used routinely in analysis of item response data.…
Descriptors: Achievement Tests, Individual Differences, Item Response Theory, Psychometrics
PDF pending restoration PDF pending restoration
Thompson, Bruce; And Others – 1997
This study was conducted to investigate the construct validity of scores on the Personal Preferences Self-Description Questionnaire (PPSDQ), a measure of Jungian types. Confirmatory factor analysis methods were used to investigate the structures underlying PPSDQ responses of 641 university students. The model fit statistics were generally…
Descriptors: College Students, Construct Validity, Goodness of Fit, Higher Education
Martinez, Michael E.; Katz, Irvin R. – 1992
Contrasts between constructed response items and stem-equivalent multiple-choice counterparts typically have involved averaging item characteristics, and this aggregation has masked differences in statistical properties at the item level. Moreover, even aggregated format differences have not been explained in terms of differential cognitive…
Descriptors: Architecture, Cognitive Processes, Construct Validity, Constructed Response
Burstein, Leigh – 1994
Issues in alternative assessment for accountability purposes are discussed. Most new forms of performance assessment are linked in the literature, but all alternative forms of assessment do not have the same attributes in terms of technical and feasibility criteria. Tradeoffs in the validity of inferences that can be drawn from alternative…
Descriptors: Accountability, Alternative Assessment, Costs, Educational Assessment
Stansfield, Charles W.; And Others – 1992
The development of the Polish Proficiency Test, a standardized, nationally-normed test of listening and reading comprehension for English-speaking learners of Polish, is reported. An introductory chapter provides background information about the test's development, including discussion of the relationship between the test and Polish language…
Descriptors: Language Proficiency, Language Tests, Listening Comprehension, Polish
Peer reviewed Peer reviewed
Patton, Wendy; Noller, Patricia – Journal of Youth and Adolescence, 1994
Reliability and validity of the Offer Self-Image Questionnaire (OSIQ) for adolescents were studied in an Australian sample of 72 male and 144 female high school students. The OSIQ was found to be reliable for appropriate research with adolescents. Further work is needed to improve item content and the present subscales. (SLD)
Descriptors: Adolescents, Factor Analysis, Factor Structure, Foreign Countries
North Carolina State Dept. of Public Instruction, Raleigh. Div. of Accountability/Testing. – 1996
The North Carolina End-of-Grade Testing Program is based on the assessment of higher level skills in the context of specific subject-area content. These tests inform students, parents, the community, and educators about the achievement of North Carolina students in grades three through eight in given areas. This report describes the development…
Descriptors: Achievement Tests, Elementary Education, Elementary School Students, Mathematics Tests
Previous Page | Next Page »
Pages: 1  |  2