NotesFAQContact Us
Collection
Advanced
Search Tips
Location
Laws, Policies, & Programs
No Child Left Behind Act 20011
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Ng, Zi Jia; Willner, Cynthia J.; Mannweiler, Morgan D.; Hoffmann, Jessica D.; Bailey, Craig S.; Cipriano, Christina – Educational Psychology Review, 2022
Many emotion regulation assessments have been developed for research purposes, but few are frequently used in schools despite the rapid growth of social and emotional learning programs with an explicit focus on emotion regulation in schools. This systematic review provides an overview of emotion regulation assessments that have been utilized with…
Descriptors: Emotional Response, Self Control, Elementary School Students, Secondary School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schmitz, Florian; Wilhelm, Oliver – Journal of Intelligence, 2019
Current taxonomies of intelligence comprise two factors of mental speed, clerical speed (Gs), and elementary cognitive speed (Gt). Both originated from different research traditions and are conceptualized as dissociable constructs in current taxonomies. However, previous research suggests that tasks of one category can be transferred into the…
Descriptors: Taxonomy, Intelligence Tests, Testing, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C. – Assessment in Education: Principles, Policy & Practice, 2017
In the United States, students have historically taken large-scale assessments for many different purposes. One purpose that is shared with many other countries is a desire to monitor aggregate trends in educational attainment in core subject domains such as literacy, mathematics, and science. In this commentary, the author examines testing,…
Descriptors: Educational Assessment, Learning Theories, Learning, Psychometrics
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Peer reviewed Peer reviewed
Direct linkDirect link
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
National Council on Measurement in Education, 2012
Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…
Descriptors: State Programs, Integrity, Testing, Test Preparation
Peer reviewed Peer reviewed
Direct linkDirect link
Shohamy, Elana – Current Issues in Language Planning, 2008
In the past decade, major attention has been given to the power of tests and to the pivotal roles tests play in societies in shaping the definitions of language, affecting learning and teaching, and maintaining and creating social class. Accordingly, the quality of tests is not judged merely by their psychometric traits but also in relation to…
Descriptors: Language Planning, Social Class, Testing, Language Tests
Camara, Wayne J. – 1988
Psychological testing has played a major role in the American Psychological Association (APA) because testing and assessment are important aspects of what psychologists do; tests assist psychologists in diagnosis and treatment. From its earliest years, APA has had one or more committees concerned with testing. The present Committee on…
Descriptors: Agency Role, Educational Policy, Psychological Testing, Psychometrics
Peer reviewed Peer reviewed
van der Vleuten, C. P. M.; Swanson, David B. – Teaching and Learning in Medicine, 1990
Large scale studies of the psychometric characteristics of standardized-patient clinical examinations are reviewed, and recommendations for test improvement are offered, including (1) assessment of history taking, physical examination, and communication skills separately from diagnostic and management skills; (2) a mastery-testing framework for…
Descriptors: Educational Trends, Higher Education, Medical Education, Patients
Peer reviewed Peer reviewed
Miller, George E. – Teaching and Learning in Medicine, 1990
A review (HE 528 450) of large-scale studies of the psychometric characteristics of standardized-patient clinical examinations is commended for its clarity, thoroughness, and relevance. Issues of reliability, validity, scoring, reporting, and use of the tests are discussed further. (MSE)
Descriptors: Educational Trends, Higher Education, Medical Education, Patients
Leclercq, Dieudonne – Evaluation in Education: An International Review Series, 1982
In a confidence weighting situation, the examinee is asked to indicate the correct answer, and how certain he or she is of the correctness of that answer. This paper reviews the bases for confidence marking, its validity and accuracy in evaluating students, and it's use in research. (BW)
Descriptors: Confidence Testing, Educational Research, Measurement Techniques, Models
Peer reviewed Peer reviewed
Woodburn, Jim; Sutcliffe, Nick – Assessment & Evaluation in Higher Education, 1996
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Descriptors: Clinical Teaching (Health Professions), Higher Education, Medical Education, Podiatry
Figueroa, R. A. – Diagnostique, 1991
This article reviews literature asserting that legal mandates eliminating overrepresentation in special education classes may have hurt minority children and argues that such a position ignores the impact of bilingualism on psychometric test performance. The article proposes that psychometric tests be excluded from any aspect of decision making…
Descriptors: Bilingual Students, Bilingualism, Court Litigation, Decision Making
Previous Page | Next Page ยป
Pages: 1  |  2