Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 10 |
Descriptor
Construct Validity | 16 |
Standardized Tests | 16 |
Test Reliability | 16 |
Test Validity | 9 |
Test Construction | 7 |
Psychometrics | 6 |
Factor Analysis | 5 |
Language Tests | 4 |
Correlation | 3 |
Difficulty Level | 3 |
Elementary School Students | 3 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 11 |
Reports - Research | 10 |
Reports - Evaluative | 5 |
Opinion Papers | 2 |
Speeches/Meeting Papers | 2 |
Dissertations/Theses -… | 1 |
Education Level
Early Childhood Education | 3 |
Elementary Education | 3 |
Preschool Education | 3 |
Elementary Secondary Education | 2 |
Higher Education | 2 |
Kindergarten | 2 |
Primary Education | 2 |
Grade 1 | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Bayley Scales of Infant… | 1 |
Test of Written English | 1 |
What Works Clearinghouse Rating
Reyes-Reyes, Alejandro; Calzadilla-Núñez, Aracelis; Torres-Martínez, Pilar; Díaz-Calzadilla, Patricia; Pastén-Hidalgo, Wilson; Bracho-Milic, Fanny; Díaz-Narváez, Víctor – SAGE Open, 2021
Currently, the most common measurement of empathy is obtained using scales that offer a continuum between a minimum and a maximum value. The objectives of this study were to establish a norm and estimate cut-off points that would make it possible to assess the Jefferson Scale of Empathy (JSE) version for Health Professions students (HPS-version),…
Descriptors: Attitude Measures, Empathy, Psychometrics, Cutting Scores
Pentimonti, Jill M.; Bowles, Ryan P.; Zucker, Tricia A.; Tambyraja, Sherine R.; Justice, Laura M. – Grantee Submission, 2021
Measuring the quality of classroom-based interactive shared book reading within the early childhood classroom represents a specific dimension of teacher-child interactions that is of great interest to researchers. This interest reflects decades of research demonstrating the benefit of reading to young children in both the home and the classroom.…
Descriptors: Standardized Tests, Test Construction, Construct Validity, Predictive Validity
Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett – Journal of Multicultural Counseling and Development, 2016
The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…
Descriptors: Counseling Techniques, Cultural Relevance, Counselor Qualifications, Expertise
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
Pentimonti, Jill M.; Zucker, Tricia A.; Justice, Laura M.; Petscher, Yaacov; Piasta, Shayne B.; Kaderavek, Joan N. – Early Childhood Research Quarterly, 2012
Participation in shared-reading experiences is associated with children's language and literacy outcomes, yet few standardized assessments of shared-reading quality exist. The purpose of this study was to describe the psychometric characteristics of the Systematic Assessment of Book Reading (SABR), an observational tool designed to characterize…
Descriptors: Test Validity, Construct Validity, Interrater Reliability, Factor Structure
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012
The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…
Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten
Malamitsa, Katerina; Kasoutas, Michael; Kokkotas, Panagiotis – Journal of Instructional Psychology, 2008
The core critical thinking skills, identified in "The Delphi Report" as essential elements for workplace and educational success, are targeted in a standardized 35 item multiple-choice assessment tool entitled the "Test of Everyday Reasoning (TER)" which is designed to provide a representation of a person's overall critical…
Descriptors: Critical Thinking, Thinking Skills, Greek, Test Reliability
Itomitsu, Masayuki – ProQuest LLC, 2009
This dissertation reports development and validation studies of a Web-based standardized test of Japanese as a foreign language (JFL), designed to measure learners' off-line grammatical and pragmatic knowledge in multiple-choice format. Targeting Japanese majors in the U.S. universities and colleges, the test is designed to explore possible…
Descriptors: Sentences, Speech Acts, Grammar, Second Language Learning

Bachman, Lyle F. – Studies in Second Language Acquisition, 1988
Discusses the problem of measuring the validity of interview ratings in the American Council on the Teaching of Foreign Languages (ACTFL) Oral Proficiency Interviews (OPI), proposes frameworks to distinguish abilities from testing methods, and considers factors affecting test performance. Suggestions for research and development on the ACTFL OPI…
Descriptors: Communicative Competence (Languages), Construct Validity, Content Validity, Interviews
Snyder, Scott; Sheehan, Robert – Diagnostique, 1992
Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)
Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity
Stansfield, Charles W.; Ross, Jacqueline – 1988
An overview of the research needed on the new Test of Written English (TWE), a section of the Test of English as a Foreign Language (TOEFL), looks at research needs in the areas of test validity, test reliability, topic development, and equating. Suggested topics for study include: the uniqueness of the construct measured by the test, in…
Descriptors: Construct Validity, English (Second Language), Essays, Language Tests
Merola, Stacey S. – Online Submission, 2005
In this article, we review some of the ways socioeconomic status has been measured on assessments and the issues associated with measuring SES of students, issues which are not limited to statistical concerns. We also present possible proxy measures that could be used as a means of potentially overcoming some of the problems with current measures…
Descriptors: Academic Achievement, Construct Validity, Test Reliability, Test Validity
Ward, William C.; And Others – 1986
The keylist format (rather than the conventional multiple-choice format) for item presentation provides a machine-scorable surrogate for a truly free-response test. In this format, the examinee is required to think of an answer, look it up in a long ordered list, and enter its number on an answer sheet. The introduction of keylist items into…
Descriptors: Analogy, Aptitude Tests, Construct Validity, Correlation
Previous Page | Next Page »
Pages: 1 | 2