ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Descriptor

Construct Validity	16
Standardized Tests	16
Test Reliability	16
Test Validity	9
Test Construction	7
Psychometrics	6
Factor Analysis	5
Language Tests	4
Correlation	3
Difficulty Level	3
Elementary School Students	3
Evaluation Methods	3
Factor Structure	3
Multiple Choice Tests	3
Predictive Validity	3
Scores	3
Second Language Learning	3
Test Items	3
College Students	2
English (Second Language)	2
Evaluation Problems	2
Foreign Countries	2
Interrater Reliability	2
Interviews	2
Item Analysis	2
More ▼

Source

Assessment for Effective…	1
Diagnostique	1
Early Childhood Research…	1
Education Policy Analysis…	1
Educational Research and…	1
Grantee Submission	1
Journal of Instructional…	1
Journal of Multicultural…	1
Online Submission	1
ProQuest LLC	1
SAGE Open	1
Studies in Second Language…	1
TESOL Quarterly: A Journal…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	10
Reports - Evaluative	5
Opinion Papers	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1

Education Level

Early Childhood Education	3
Elementary Education	3
Preschool Education	3
Elementary Secondary Education	2
Higher Education	2
Kindergarten	2
Primary Education	2
Grade 1	1
Postsecondary Education	1

Audience

Researchers

Location

Chile	1
Greece	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Bayley Scales of Infant…	1
Test of Written English	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Psychometry: Cutting-Off Points and Standardization of the Jefferson Empathy Scale Adapted for Students of Kinesiology

Peer reviewed

Direct link

Reyes-Reyes, Alejandro; Calzadilla-Núñez, Aracelis; Torres-Martínez, Pilar; Díaz-Calzadilla, Patricia; Pastén-Hidalgo, Wilson; Bracho-Milic, Fanny; Díaz-Narváez, Víctor – SAGE Open, 2021

Currently, the most common measurement of empathy is obtained using scales that offer a continuum between a minimum and a maximum value. The objectives of this study were to establish a norm and estimate cut-off points that would make it possible to assess the Jefferson Scale of Empathy (JSE) version for Health Professions students (HPS-version),…

Descriptors: Attitude Measures, Empathy, Psychometrics, Cutting Scores

Development and Validation of the Systematic Assessment of Book Reading (SABR-2.2)

Peer reviewed
PDF on ERIC

Download full text

Direct link

Pentimonti, Jill M.; Bowles, Ryan P.; Zucker, Tricia A.; Tambyraja, Sherine R.; Justice, Laura M. – Grantee Submission, 2021

Measuring the quality of classroom-based interactive shared book reading within the early childhood classroom represents a specific dimension of teacher-child interactions that is of great interest to researchers. This interest reflects decades of research demonstrating the benefit of reading to young children in both the home and the classroom.…

Descriptors: Standardized Tests, Test Construction, Construct Validity, Predictive Validity

Validation of a Standardized Multiple-Choice Multicultural Competence Test: Implications for Training, Assessment, and Practice

Peer reviewed

Direct link

Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett – Journal of Multicultural Counseling and Development, 2016

The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…

Descriptors: Counseling Techniques, Cultural Relevance, Counselor Qualifications, Expertise

Between Scylla and Charybdis: Reflections on and Problems Associated with the Evaluation of Teachers in an Era of Metrification

Peer reviewed
PDF on ERIC

Download full text

Berliner, David C. – Education Policy Analysis Archives, 2018

The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…

Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests

The Cognitive Validity of Child English Language Tests: What Young Language Learners and Their Native-Speaking Peers Can Reveal

Peer reviewed

Direct link

Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018

This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…

Descriptors: Language Tests, English, English (Second Language), Second Language Learning

A Standardized Tool for Assessing the Quality of Classroom-Based Shared Reading: Systematic Assessment of Book Reading (SABR)

Peer reviewed

Direct link

Pentimonti, Jill M.; Zucker, Tricia A.; Justice, Laura M.; Petscher, Yaacov; Piasta, Shayne B.; Kaderavek, Joan N. – Early Childhood Research Quarterly, 2012

Participation in shared-reading experiences is associated with children's language and literacy outcomes, yet few standardized assessments of shared-reading quality exist. The purpose of this study was to describe the psychometric characteristics of the Systematic Assessment of Book Reading (SABR), an observational tool designed to characterize…

Descriptors: Test Validity, Construct Validity, Interrater Reliability, Factor Structure

A "Conditional" Sense of Fairness in Assessment

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013

Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…

Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics

Item-Level and Construct Evaluation of Early Numeracy Curriculum-Based Measures

Peer reviewed

Direct link

Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012

The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…

Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten

Reliability for the Greek Version of the "Test of Everyday Reasoning (TER)"

Peer reviewed

Direct link

Malamitsa, Katerina; Kasoutas, Michael; Kokkotas, Panagiotis – Journal of Instructional Psychology, 2008

The core critical thinking skills, identified in "The Delphi Report" as essential elements for workplace and educational success, are targeted in a standardized 35 item multiple-choice assessment tool entitled the "Test of Everyday Reasoning (TER)" which is designed to provide a representation of a person's overall critical…

Descriptors: Critical Thinking, Thinking Skills, Greek, Test Reliability

Developing a Test of Pragmatics of Japanese as a Foreign Language

Direct link

Itomitsu, Masayuki – ProQuest LLC, 2009

This dissertation reports development and validation studies of a Web-based standardized test of Japanese as a foreign language (JFL), designed to measure learners' off-line grammatical and pragmatic knowledge in multiple-choice format. Targeting Japanese majors in the U.S. universities and colleges, the test is designed to explore possible…

Descriptors: Sentences, Speech Acts, Grammar, Second Language Learning

Problems in Examining the Validity of the ACTFL Oral Proficiency Interview.

Peer reviewed

Bachman, Lyle F. – Studies in Second Language Acquisition, 1988

Discusses the problem of measuring the validity of interview ratings in the American Council on the Teaching of Foreign Languages (ACTFL) Oral Proficiency Interviews (OPI), proposes frameworks to distinguish abilities from testing methods, and considers factors affecting test performance. Suggestions for research and development on the ACTFL OPI…

Descriptors: Communicative Competence (Languages), Construct Validity, Content Validity, Interviews

Rasch Analysis of the Standardization Data of the Bayley Mental Scale of Infant Development.

Snyder, Scott; Sheehan, Robert – Diagnostique, 1992

Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)

Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity

A Long-Term Research Agenda for the Test of Written English.

Download full text

Stansfield, Charles W.; Ross, Jacqueline – 1988

An overview of the research needed on the new Test of Written English (TWE), a section of the Test of English as a Foreign Language (TOEFL), looks at research needs in the areas of test validity, test reliability, topic development, and equating. Suggested topics for study include: the uniqueness of the construct measured by the test, in…

Descriptors: Construct Validity, English (Second Language), Essays, Language Tests

The Problem of Measuring SES on Educational Assessments

Download full text

Merola, Stacey S. – Online Submission, 2005

In this article, we review some of the ways socioeconomic status has been measured on assessments and the issues associated with measuring SES of students, issues which are not limited to statistical concerns. We also present possible proxy measures that could be used as a means of potentially overcoming some of the problems with current measures…

Descriptors: Academic Achievement, Construct Validity, Test Reliability, Test Validity

Keylist Items for the Measurement of Verbal Aptitude. Research Report.

Download full text

Ward, William C.; And Others – 1986

The keylist format (rather than the conventional multiple-choice format) for item presentation provides a machine-scorable surrogate for a truly free-response test. In this format, the examinee is required to think of an answer, look it up in a long ordered list, and enter its number on an answer sheet. The introduction of keylist items into…

Descriptors: Analogy, Aptitude Tests, Construct Validity, Correlation

Previous Page | Next Page »

Pages: 1 | 2

Justice, Laura M.	2
Pentimonti, Jill M.	2
Zucker, Tricia A.	2
Ahn, Jieun Irene	1
Bachman, Lyle F.	1
Bartoli, Eleonora	1
Bellamy, Scarlett	1
Berliner, David C.	1
Bertsch, Kristin N.	1
Bowles, Ryan P.	1
Bracho-Milic, Fanny	1
Calzadilla-Núñez, Aracelis	1
Cheng, Britte H.	1
Choi, Ina	1
Colker, Alexis M.	1
Constant, Kerra	1
Cui, Yaqiong	1
DeBarger, Angela	1
DeVaney, Thomas A.	1
Díaz-Calzadilla, Patricia	1
Díaz-Narváez, Víctor	1
Franks, Melvin E.	1
Gillem, Angela R.	1
Ginsburg, Herbert P.	1
More ▼