Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Ozen Kutanis, Rana; Tunc, Tulin; Tunc, Murat – Educational Sciences: Theory and Practice, 2011
In this study, it was aimed to explore whether a single-step examination is adequate for ranking the medical graduates for specialty training in medicine which is practically similar to doctoral training (PhD) in other disciplines. For this purpose, a semi- structured interview-based qualitative research was carried out at a university medical…
Descriptors: Medical Education, Qualitative Research, Trainees, Competitive Selection
Porter, Stephen R.; Rumann, Corey; Pontius, Jason – New Directions for Institutional Research, 2011
Survey data are widely used in higher education for purposes such as assessment and strategic planning. One of the most common ways of using surveys has been to assess student learning outcomes by means of proxy questions on a survey, assuming that students who engage in specific behaviors (called engagement) have learned more during college than…
Descriptors: Institutional Research, Student Surveys, Outcomes of Education, Academic Achievement
Rogers, W. Todd; Lin, Jie; Rinaldi, Christia M. – Applied Measurement in Education, 2011
The evidence gathered in the present study supports the use of the simultaneous development of test items for different languages. The simultaneous approach used in the present study involved writing an item in one language (e.g., French) and, before moving to the development of a second item, translating the item into the second language (e.g.,…
Descriptors: Test Items, Item Analysis, Achievement Tests, French
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Mathers, Carrie; Oliva, Michelle – National Comprehensive Center for Teacher Quality, 2008
The purpose of this Research and Policy Brief is to provide state and local policymakers with a comprehensive understanding of the measures used in teacher evaluation--their strengths, limitations, and current use in policy and practice. This brief will underscore aspects of evaluation policies currently aligned with best practices as well as…
Descriptors: Teacher Evaluation, Student Evaluation, Test Reliability, Test Validity
Park, Sohee; McLean, Gary N.; Yang, Baiyin – Online Submission, 2008
With the increasing attention on managerial coaching as an effective leadership initiative in organizations, there have been increasing needs for reliable and valid tools to assess managers' coaching skills. This study reviewed and revised an existing instrument measuring coaching skills in organizations created by McLean, Yang, Kuo, Tolbert, and…
Descriptors: Leadership Effectiveness, Test Validity, Career Development, Interprofessional Relationship
Lee, Yi-Hsuan; Ip, Edward H.; Fuh, Cheng-Der – Educational and Psychological Measurement, 2008
Although computerized adaptive tests have enjoyed tremendous growth, solutions for important problems remain unavailable. One problem is the control of item exposure rate. Because adaptive algorithms are designed to select optimal items, they choose items with high discriminating power. Thus, these items are selected more often than others,…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Test Validity
Korkmaz, Ozgen; Kaya, Sinan – Turkish Online Journal of Distance Education, 2012
The purpose of this study is to determine online self-regulated learning levels of students by adapting "Online Self-Regulated Learning Scale" designed by Barnard and his colleagues into Turkish. Present study, irrespective of being a scale analysis, is at the same time a qualitative research. It is executed via scan model. Study group…
Descriptors: Foreign Countries, Educational Technology, Test Construction, Test Validity
Preckel, Franzis; Lipnevich, Anastasiya A.; Boehme, Katharina; Brandner, Lena; Georgi, Karsten; Konen, Tanja; Mursin, Katharina; Roberts, Richard D. – British Journal of Educational Psychology, 2013
Background: Chronotype refers to individuals' preference for morning or evening activities. Its two dimensions (morningness and eveningness) are related to a number of academic outcomes. Aims: The main goal of the study was to investigate the incremental validity of chronotype as a predictor of academic achievement after controlling for a number…
Descriptors: High School Students, Foreign Countries, Grade 9, Grade 10
Baly, Michael W. – ProQuest LLC, 2013
This dissertation is comprised of three manuscripts and presents a line of research aimed at improving the measurement of bullying in schools. The first manuscript investigated the impact of an educational video on self-reports of bullying. A sample of 1,283 middle school students in randomly assigned classrooms either watched or did not watch an…
Descriptors: Bullying, Measurement Techniques, Video Technology, Self Disclosure (Individuals)
ACT, Inc., 2013
This manual contains information about the American College Test (ACT) Plan® program. The principal focus of this manual is to document the Plan program's technical adequacy in light of its intended purposes. This manual supersedes the 2011 edition. The content of this manual responds to requirements of the testing industry as established in the…
Descriptors: College Entrance Examinations, Formative Evaluation, Evaluation Research, Test Bias
Serafini, Ellen Johnson – ProQuest LLC, 2013
This study examined the second language (L2) development of adult learners of Spanish at three levels of proficiency during and after a semester of instruction. A fundamental goal was to identify cognitive and psychosocial individual differences (IDs) that can explain between-learner variation over time in order to expand our understanding of the…
Descriptors: Second Language Learning, Spanish, Language Aptitude, Language Processing
Morgan, Deanna L. – National Center for Postsecondary Research, 2010
Cut scores are used in a variety of circumstances to aid in decision making through the establishment of a clear cut line between adjacent categories. Community colleges regularly use cut scores on placement tests to decide the appropriate course for each beginning student: the first college-level course or a developmental course, depending on…
Descriptors: Standard Setting (Scoring), Cutting Scores, Psychometrics, Best Practices
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity
Abedi, Jamal – Stanford Center for Opportunity Policy in Education, 2010
Standardized achievement tests that are used for assessment and accountability purposes may not provide reliable and valid outcomes for English language learners (ELLs) because extraneous sources may confound the outcome of assessments for these students. Performance assessments, by contrast, may offer opportunities for these students to present a…
Descriptors: English Language Learners, Performance Based Assessment, Evaluation Methods, Student Evaluation

Peer reviewed
Direct link
