Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Katz, Irvin R.; Elliot, Norbert; Attali, Yigal; Scharf, Davida; Powers, Donald; Huey, Heather; Joshi, Kamal; Briller, Vladimir – ETS Research Report Series, 2008
This study presents an investigation of information literacy as defined by the ETS iSkills™ assessment and by the New Jersey Institute of Technology (NJIT) Information Literacy Scale (ILS). As two related but distinct measures, both iSkills and the ILS were used with undergraduate students at NJIT during the spring 2006 semester. Undergraduate…
Descriptors: Information Literacy, Information Skills, Skill Analysis, Case Studies
Zhuang, Xiaohua; MacCann, Carolyn; Wang, Lijuan; Liu, Lydia; Roberts, Richard D. – ETS Research Report Series, 2008
Various policy papers and research studies assert that teamwork is one of the most important skills for students to learn if they are to become meaningful contributors to the 21st century workforce. However, outside of organizational psychology and adult populations, few reliable assessments of this construct exist, with suitable validity evidence…
Descriptors: Teamwork, Cooperative Learning, Evaluation Methods, Student Evaluation
Garet, Michael S.; Cronen, Stephanie; Eaton, Marian; Kurki, Anja; Ludwig, Meredith; Jones, Wehmah; Uekawa, Kazuaki; Falk, Audrey; Bloom, Howard S.; Doolittle, Fred; Zhu, Pei; Sztejnberg, Laura – National Center for Education Evaluation and Regional Assistance, 2008
To help states and districts make informed decisions about the professional development (PD) they implement to improve reading instruction, the U.S. Department of Education commissioned the Early Reading PD Interventions Study to examine the impact of two research-based PD interventions for reading instruction: (1) a content-focused teacher…
Descriptors: Early Reading, Reading Instruction, Professional Development, Intervention
US Department of Education, 2008
This report presents the results of an audit by the Office of the Inspector General to determine whether the Department of Education's Office of Elementary and Secondary Education (OESE) provided sufficient oversight of graduation and dropout rates submitted by states in their Consolidated State Performance Reports to ensure the rates were…
Descriptors: Agencies, Federal Government, Audits (Verification), Inspection
Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008
Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…
Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies
Vlachopoulos, Symeon P.; Kaperoni, Maria; Moustaka, Frederiki C.; Anderson, Dean F. – Research Quarterly for Exercise and Sport, 2008
The present study reported on translating the Exercise Identity Scale (EIS: Anderson & Cychosz, 1994) into Greek and examining its psychometric properties and cross-cultural validity based on U.S. individuals' EIS responses. Using four samples comprising 33, 103, and 647 Greek individuals, including exercisers and nonexercisers, and a similar…
Descriptors: Test Reliability, Test Validity, Factor Structure, Measures (Individuals)
Moyer-Packenham, Patricia S.; Bolyard, Johnna J.; Kitsantas, Anastasia; Oh, Hana – Peabody Journal of Education, 2008
The purpose of this study was to examine the types of instruments being used to document mathematics and science teacher quality characteristics in 48 nationally funded mathematics and science education awards. Each of the 48 projects operationalized teacher quality and determined how to assess it. The main research questions examined the…
Descriptors: Teacher Effectiveness, Teacher Characteristics, Awards, Psychometrics
Sales, Jessica McDermott; Milhausen, Robin R.; Wingood, Gina M.; DiClemente, Ralph J.; Salazar, Laura F.; Crosby, Richard A. – Health Education & Behavior, 2008
This study reports on the validation of a scale to assess adolescent girls' frequency of sexual communication with their parents. The Parent-Adolescent Communication Scale (PACS) was administered to 522 African American female adolescents ranging in age from 14 to 18. The PACS demonstrated satisfactory internal consistency (across multiple…
Descriptors: Self Efficacy, Adolescents, Measures (Individuals), Sexuality
Feinberg, Mark E.; Gomez, Brendan J.; Puddy, Richard W.; Greenberg, Mark T. – Health Education & Behavior, 2008
Community coalitions (CCs) have labored with some difficulty to demonstrate empirical evidence of effectiveness in preventing a wide range of adolescent problem behaviors. Training and technical assistance (TA) have been identified as important elements in promoting improved functioning of CCs. A reliable, valid, and inexpensive method to assess…
Descriptors: Prevention, Construct Validity, Risk, Questionnaires
Strong, Gregory – Thought Currents in English Literature, 1995
This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…
Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries
Carlson, Sybil B.; And Others – 1985
Four writing samples were obtained from 638 foreign college applicants who represented three major foreign language groups (Arabic, Chinese, and Spanish), and from 60 native English speakers. All four were scored holistically, two were also scored for sentence-level and discourse-level skills, and some were scored by the Writer's Workbench…
Descriptors: Arabic, Chinese, College Entrance Examinations, Computer Software
Shiflett, Samuel; And Others – 1985
A study was undertaken to improve the measurement of small team performance within the Army. A provisional taxonomy of team-level performance functions was field-validated; criteria and measures of the functions were developed; and their reliability was examined. The provisional taxonomy, used for observing Army field training exercises, was used…
Descriptors: Behavior Rating Scales, Classification, Evaluation Criteria, Evaluators
Peer reviewedPolio, Charlene G. – Language Learning, 1997
Investigates the reliability of measures of linguistic accuracy in second language writing. The study uses a holistic scale, error-free T-units, and an error classification system on the essays of English-as-a-Second-Language students and discusses why disagreements arise within a rater and between raters. (24 references) (Author/CK)
Descriptors: College Students, English (Second Language), Error Analysis (Language), Error of Measurement
Almond, Russell G. – ETS Research Report Series, 2007
Over the course of instruction, instructors generally collect a great deal of information about each student. Integrating that information intelligently requires models for how a student's proficiency changes over time. Armed with such models, instructors can "filter" the data--more accurately estimate the student's current proficiency…
Descriptors: Markov Processes, Decision Making, Student Evaluation, Learning Processes
Lembke, Erica S.; Stecker, Pamela M. – Center on Instruction, 2007
One of the best methods of formative assessment in academic areas and a method that exemplifies the characteristics of good measures is Curriculum-Based Measurement (CBM; Deno, 1985). Developed at the University of Minnesota in the early 1970's, CBM has been researched in academic areas including mathematics computation, concepts, and…
Descriptors: Curriculum Based Assessment, Formative Evaluation, Mathematics Education, Educational Research

Direct link
