Teacher Evaluation Kit 

GLOSSARY S thru Z

Glossary Overview

[ COMPLETE GLOSSARY] [ A-D] [ E-L] [ M-R] [ Menu]

  • Sampling of Performance - the selection of an array of teaching performances and settings to be evaluated so that they are wide enough in range and large enough in numbers both to: (1) cover the scope of the performances addressed by the evaluation system in terms of representativeness and comprehensiveness, and (2) permit valid inferences about performances to be made. See   Error of Measurement.

  • Sanction - that which is given to encourage improvement in less than satisfactory performance or service, or as a penalty for poor performance. See  Reward.

  • Scale (Rating) - a series of numerical or descriptive ratings on a continuum used to assess or judge specific levels of performance. See  Behaviorally-Anchored Rating Scale, Rating.

  • Schedule - the designated dates and times for various activities related to the evaluation (e.g., when the classroom will be observed, when the portfolio is to be submitted, when the teacher will be interviewed). The schedule, which is part of the procedures, should also indicate the location of such activities and the people who will be involved. See  Frequency of Evaluation, Timeline.

  • Scope of Content - the extent of coverage, by an instrument or process, of all knowledge, skills, abilities, and behaviors to be measured, in terms of both breadth and depth of coverage.

  • Score (noun)  - the number of points earned on a measure or the degree of success on an assessment of teacher attributes or performance. Scores are usually expressed in numerical terms, but sometimes in descriptive terms or graphically. See  Estimate, Obtained Score.

  • Scorer - an assessor who summarizes the results of an assessment for use by an evaluator or decision maker. See  Assessor, Evaluator, Scoring.

  • Scoring - the process of determining the value of a performance on an indicator or criterion. See  Analytic Scoring, Holistic Scoring, Primary Trait Scoring, Score, Scorer.

  • Scoring Dimension - an attribute or facet of behavior or performance in a domain. Dimensions are usually determined through logical or statistical analysis, and sometimes are reported as part scores. See   Criterion-Domain.

  • Scoring Rubric - a set of rules, guidelines, or benchmarks at different levels of performance, or prescribed descriptors for use in quantifying measures of teacher attributes and performance. See  Analytic Scoring, Holistic Scoring, Primary Trait Scoring, Protocol.

  • Scripting - the writing of evidence and notes throughout an assessment activity (e.g., interview, classroom observation) about what is happening. Scripting does not include making judgments or interpreting the evidence and notes. Scripting may be continuous throughout the activity or intermittent at pre-specified time intervals (e.g., 3 minutes on, 2 minutes off). See  Evidence, Notes.

  • Secondary Standards - those standards that apply to the evaluation process and assessment methods rather than to the teachers being evaluated or to their performance levels, and for which it is desirable, but not crucial or important, that they be met or addressed before the assessments are administered and the evaluation process is implemented. See   Primary Standards, Standards (Legal), Standards (Professional), Standards (Technical).

  • Self-Assessment - the process of judging one's own teaching performance for the purpose of self-improvement. A teacher may use such techniques as self-viewing on a videotape, observing and modeling exemplary teachers, filling out self-rating forms, completing open-ended self-reports, keeping a log, compiling a portfolio, or using self-study handbooks and materials. See   Reflection, Self-Evaluation, Self-Study Materials.

  • Self-Evaluation - the process of reviewing one's own behavior and student learning outcomes for the purpose of monitoring and changing one's own teaching performance. See  Reflection, Self-Assessment, Student Learning Outcomes.

  • Self-Report Measures - those instruments in which teachers record their own recollections of events, feelings, judgments, and attitudes. See  Self-Evaluation.

  • Self-Study Materials - the programs designed for use by a teacher in assessing his/her own teaching behaviors. See  Self-Assessment.

  • Sensitivity - the awareness and understanding of other people's feelings, attitudes, social and cultural backgrounds, ethnic traditions and customs, languages, interests, rights, and needs.

  • Setting - the temporal and physical environment of an event or activity. See  Context (Teaching), Learning Environment.

  • Similar-to-Me Effect - a type of rater effect in which an assessor or an evaluator judges more favorably those people seen as similar to himself/herself. See   Rater Effect.

  • Simulation - an imitation of a typical job task or situation to assess how well a teacher might implement such a task or perform in an actual situation (e.g., asking a science teacher to prepare slides for biology, having a music teacher listen to a tape and show how he/she would conduct that piece of music using the musical score in front of him/her, asking a fourth-grade teacher to develop and present a learning activity on interpreting map symbols). See  Assessment Center, Surrogate Task, Task, Work Simulation.

  • Situational Specificity - the extent to which it is appropriate to use an assessment conducted in one setting or context for other settings or contexts. See  Context (Teaching), Generalizability, Transportability, Validity.

  • Skill - the ability to use knowledge in a practical manner. See  Ability, Capacity, Competency, Knowledge, Talent.

  • Specifications (Assessment) - a delineation of the major attributes of an assessment to be developed, including breadth and depth of content to be covered, level of difficulty, format of the assessment materials, supplies and equipment needed, level of complexity, administrative process (e.g., individual or group, location, timing), scoring procedures, and numbers and types of items and tasks.

  • Spontaneous Performance Assessment - a measure based on observing, without prompting or preannouncing, what a teacher does during non-assessment activities (e.g., the principal oversees one teacher helping another teacher prepare a plan for working with a difficult student).

  • Stakeholders - those individuals who have a substantial interest in teacher evaluation and in the quality of teaching. These include not only the teacher and the principal, but also other teachers, school and district staff, students, parents, school board members, future employers, taxpayers, and community members. See  Audience.

  • Standard - the level of performance on the criterion being assessed that is considered satisfactory in terms of the purpose of the evaluation. There are three major categories of standards, related to various purposes. (1) Developmental standards specify improvement levels to be attained and may be used for professional development and self-assessment. (2) Minimum standards designate the level below which performance is not acceptable and are used for such purposes as licensure and job assignments. (3) Desired performance standards reflect what is regarded as accomplished or effective teaching and typically are used for such purposes as promotions, awards, and certification. See   Criterion.

  • Standard Score - an indicator of the relative standing of a score within a normal distribution of scores, defined by its mean and standard deviation. By transforming raw scores to standard scores, the user can interpret intervals between any two score points in relation to a reference population. Z scores are a commonly used standard score transformation, providing a normal distribution with a mean of zero and a standard deviation of one. A teacher evaluation system may use several different tests or assessmnets. If it is important to compare a teacher's performance on the different assessments, then the scores from each test or assessment need to be standardized. Given that each of the assessments may have a different number of items or rating scale points and each may have been administered to different groups of teachers, standardizing the scores within each distribution becomes a necessity for purposes of comparing a teahcer's performance on the different assessments. By transforming the raw scores for each assessment into standard scores, it becomes possible to determine a teacher's relative strengths and weaknesses within the groups of teachers upon which the scores were standardized.
  • See
  •   Raw Score.

  • Standard Setting - the determination of the teaching performance level considered acceptable in terms of the purpose of the evaluation. Standards are usually determined using empirical or judgmental techniques, or a combination of these. See  Standard.

  • Standardization - the use of consistent procedures for administering, scoring, reviewing, interpreting, and reporting the results of the teacher evaluation.

  • Standardized Conditions - the administration of an assessment process or instrument to all teachers in the identical manner (e.g., same instructions and timing, comparable settings, use of trained assessors and evaluators). See   Irregularity.

  • Standards (Legal) - those guidelines and requirements related to an assessment and evaluation that are specified in the law, governmental policies and regulations, school district policies, and court decisions. See  Primary Standards, Secondary Standards.

  • Standards (Professional) - those guidelines related to an assessment and evaluation that are specified by the individuals and associations in the career area affected, directly or indirectly, by the assessment. See  Primary Standards, Secondary Standards.

  • Standards (Technical) - those guidelines related to an assessment and evaluation that are specified by psychometricians, statisticians, test publishers, and specialists in the domain covered by the assessment. See  Primary Standards, Secondary Standards.

  • Stringency - a type of rater effect in which an assessor or an evaluator tends to rate a teacher too low or to judge the performance level as poorer than it actually is. Sometimes stringency is referred to as "negative leniency." See  Leniency, Rater Effect.

  • Structured Performance Assessment - a measure based on the administration of an assessment instrument or task at a specified time and in a designated manner.

  • Student Learning Outcomes - the measures of student achievement of knowledge and skills and other educational outcomes such as improved student attitudes and behaviors that should have been taught to them by the teacher being evaluated. This term covers acquisition, retention, application, transfer, and adaptability of knowledge and skills. See  Outcome Variables, Teacher-Outcomes Evaluation.

  • Style-Based Evaluation - a teacher evaluation approach that uses criteria which specify in what way or how teachers are to perform their job (e.g., announced objectives at the start of each lesson, use of cooperative learning techniques) rather than what teachers are responsible for doing. See  Duty, Foundation, Responsibility.

  • Subjective (adjective)  - a characteristic of an assessment, observation, or conclusion that yields results which cannot be empirically verified by another person. See  High Inference, Judgment, Objective, Replicable, Verification.

  • Summary - a concise report encompassing the major results of an assessment or an evaluation of a teacher's performance.

  • Summative Teacher Evaluation - an evaluation conducted primarily for the purpose of making personnel decisions about the teacher (e.g., merit pay, reassignment, promotion, dismissal, tenure). Summative evaluation usually is done by an administrator rather than a supervisor or another teacher. See  Formative Teacher Evaluation.

  • Supervisor - the person responsible for overseeing the work of a teacher and for ensuring that the teacher performs his/her duties and professional responsibilities. See  Administrator.

  • Surrogate Task - a substitute or replacement for an actual activity or task, but one that is very similar to the actual task and that uses the same types of skills, behaviors, materials, and equipment. An example is asking a candidate for a teaching position to present a lesson to a group of ten students. See   Simulation, Task, Work Sample, Work Simulation.

  • Synthesis - the process of combining data and information from multiple sources, or of ratings and judgments on separate scoring dimensions in order to arrive at a conclusion or result. See   Analysis, Configural Scoring Rule, Review, Scoring Dimension, Weighted Score.

  • Systemic Validity - See   Validity.

  • Talent - an unusually high level of proficiency in performing a task or using a skill. Talent can be in the affective, cognitive, and psychomotor domains. See  Ability, Affective Domain, Cognitive Domain, Merit, Proficiency, Psychomotor Domain, Skill.

  • Task - an assessment activity or assignment to be completed by the teacher being evaluated. See  Simulation, Surrogate Task, Work Sample, Work Simulation.

  • Teachable - the practicality or feasibility of providing instruction on a topic, knowledge area, or skill in terms of the readiness of the learners, the expertise of the teachers, and the availability of resources (e.g., equipment, time).

  • Teacher Effectiveness - the attribute of a teacher who has the capability or potential of having a positive impact on student learning, behavior, and attitudes. See   Effective Teaching.

  • Teacher Evaluation - the systematic appraisal of a teacher's performance and/or qualifications in relation to the teacher's defined professional role and responsibities as well as to the school's and district's missions. See Assessment, Evaluation, Performance Evaluation. See  Assessment, Evaluation, Performance Evaluation.

  • Teacher Evaluation System - a complete approach to the evaluation of teachers including its purpose, the rules and regulations that apply, the target group to be evaluated, the domains to be covered, the procedures and methods to be employed, the instruments to be used, the persons to be involved, and the types of reports and feedback to be provided.

  • Teacher Improvement - the accomplishment of goals and objectives for professional development, growth in knowledge, acquisition of skills, and changes in practices. See  Plan of Assistance, Professional Development, Remediation.

  • Teacher Norms - the expectations of teacher behavior in a given context or setting that are usually learned during student teaching or in the first year of teaching in a new school. See  Induction.

  • Teacher-Outcomes Evaluation - an approach to evaluating teachers that is based on student performance, usually standardized achievement test results. See  Student Learning Outcomes.

  • Teachers' Rights - the privileges of teachers guaranteed by law and under their collective bargaining agreement. They include the rights to form and organize a teachers' association, to select their representatives, and to bargain collectively. See  Collective Bargaining Agreement, Collective Bargaining Unit.

  • Temporary Teacher - a teacher who is filling a position for a specified period of time, whether to fill a vacancy on a short-term basis (one year or less) or for an experimental teaching position with a designated ending date.

  • Tenure - an employment status conferred upon a teacher by state law or institutional regulation after successful completion of a probationary period. Tenure provides substantial, but not complete, protection against arbitrary or capricious dismissal and termination, and entitles the teacher to due process procedures and other protections that may not be available to the non-tenured teacher. See  Dismissal, Due Process, Permanent Teacher, Probationary Teacher, Tenured Teacher.

  • Tenured Teacher - a teacher who has completed a probationary period (usually two to five years) and is now considered a permanent employee of the school district with all applicable benefits and rights as specified in state law, district policy, and the collective bargaining agreement. See   Collective Bargaining Agreement, Permanent Teacher, Probationary Teacher, Tenure.

  • Termination - See Dismissal

  • Test (noun)  - an assessment instrument consisting of a sample of items or tasks from a particular domain and that can provide an estimate of performance in that domain. See  Instrument, Measure (noun), Sampling of Performance.

  • Test (verb)  - to administer an instrument or implement an assessment process. See  Assessment, Instrument, Measure (verb).

  • Test Score Pollution - an inflation of test scores that is the results of practices, usually associated with high-stakes testing, which are designed to increase test scores or performance ratings without improvement of actual performance on the attributes being assessed. See  Coaching, High-Stakes Testing.

  • Theory-Based Evaluation - a teacher evaluation approach that is based on certain theories of teaching or of learning. See  Foundation.

  • Timeline - a calendar or list of dates showing the evaluation stages and activities, and indicating the dates by which they should be implemented and be completed. See   Frequency of Evaluation.

  • Timeliness - coming at an opportune time, or providing information at a point when it can readily inform the teacher evaluation process.

  • Track Record - a summary of past events and accomplishments related to a teacher's performance (e.g., education and training completed, conferences attended, awards received by the teacher and/or students). See  Experience.

  • Training - the provision of instruction and planned activities to facilitate the learning of specific knowledge, skills, abilities, attitudes, and behaviors. See  Coaching, Instruction, Mentoring, On-the-Job Training.

  • Transferability - (1) the degree to which the knowledge and skills demonstrated in solving an assessment task can be used in solving other work-related tasks and real-world activities. (2) one of several characteristics used to evaluate assessments. See  Generalizability.

  • Transportability - the appropriateness of extending the use of a policy, instrument, assessment procedure, or evaluation system across different teachers, student groups, subject areas, instructional approaches, learning activities, school settings, states, etc. See   Generalizability, Situational Specificity, Validity Generalization Study.

  • Triangulation - the attempt to obtain more valid results by using multiple sources of data about one aspect of performance, multiple methods of collecting data, and/or multiple interpretations of the same data. See  Congruence Analysis, Corroborating Evidence, Multiple Measures, Validity, Verification.

  • True Score - a hypothetical score that represents an assessment result which is entirely free of error. Sometimes true score is thought of as the average score of an infinite series of assessments with the same or exactly equivalent instruments, but with no practice effect or change in the person being assessed across the series of assessments. See   Error of Measurement, Estimate, Obtained Score.

  • Trust - a common understanding of the purpose and potential of teacher evaluation, and a cooperative spirit between the teacher and the evaluator for maximizing the benefits of doing the evaluation. Trust is related to such factors as confidentiality of communication, careful consideration of the accuracy of evidence from such sources as hearsay or complaints, honesty, openness, sharing, and sincerity on the part of both the teacher and the evaluator. See  Credible.

  • Unintended Consequences - any unplanned or unanticipated outcomes that occur as a result of implementing an assessment or evaluation. For example, the use of student test scores as part of the teacher evaluation data results in test score pollution practices and more use of teaching-to-the-test activities accompanied by less reliance on the district curriculum guide for planning instructional activities and lessons. See   Consequences, Validity-Consequential Basis of.

  • USED - United States Department of Education, part of the executive branch of the United States Government.

  • Utility (of Teacher Evaluation System) - (1) the practical value of a teacher evaluation system with respect to such factors as time requirements, logistics, resources needed, costs and benefits, and district and state policies, as well as the technical concerns of validity and reliability. (2) one of four areas of standards in The Personnel Evaluation Standards: How To Assess Systems for Evaluating Educators  by the Joint Committee on Standards for Educational Evaluation. This area has five sets of standards: Constructive Orientation, Defined Uses, Evaluator Credibility, Functional Reporting, Follow-up and Impact. See  Accuracy, Administrative Feasibility, Feasibility, Propriety.

  • Validation - the process of determining the appropriateness, meaningfulness, and usefulness of a measure, an instrument, or an assessment process, and of the inferences made from the results of it. See   Validity, Validity Coefficient.

  • Validity - the extent to which the test scores or responses measure the attribute(s) that they were designed to measure. Several types of validity are described below. See  Generalizability, Validation, Validity Coefficient.

    • Concurrent - the relationship of one measure to another simultaneous measure or variable assessing the same or a related attribute.
    • Consequential Basis of Validity - the assemblage of information on the theoretical and value implications of the way that the results of testing are used, and the appraisal of both the potential and actual social consequences of the testing, including side effects.
    • Construct - the degree of fit of a measure and its interpretation with its underlying explanatory concepts, theoretical rationales, or foundations.
    • Content - (1) the appropriateness of the domain definition and the sampling of content. (2) the extent of congruence between the scope of a content area that an instrument or process claims to cover and what it actually does cover. Both definitions are aspects of construct validity.
    • Criterion-Related - the correlation or extent of agreement of the test score from an assessment with one or more external variables that measure the attribute being assessed.
    • Curricular - the extent to which the items on the assessment or test measure the content of a local curriculum, or the extent of agreement between the test coverage (topics, breadth and depth, skills, cognitive complexity) and the goals and objectives of the curriculum.
    • Evidential Basis of Validity - the assemblage of information about the construct validity of the test scores and measurements, as well as the relevance of the measurement to its applied purpose and its utility in an applied setting. Construct validity, in this case, includes information collected from content and criterion-related validation procedures.
    • Face - the perceived extent of acceptability or legitimacy of an instrument or process to teachers, administrators, policymakers, students, parents, the general public, and other stakeholders concerned with teacher evaluation and the quality of teaching.
    • Instructional - the degree to which the items on a test measure: (a) what is actually being taught, and (b) what the individuals being assessed have had an opportunity to learn.
    • Predictive - the relationship of a measure to performance in a future context such as a new work setting or to the results obtained on a future measure assessing a similar or a different (but presumably related) attribute.
    • Systemic - the negative and positive consequences of testing that should be monitored in order to evaluate the long-range value of the test.

  • Validity Coefficient - a measure of the degree of validity, usually expressed as the correlation between the measure in question and another measure or a variable. See   Correlation, Validation, Validity.

  • Validity Generalization Study - an investigation of the degree of generalizability or of transportability of a policy, instrument, process, procedure, or evaluation system. See   Generalizability, Transportability.

  • Value - an estimation or a measure of the merit and/or worth of a teacher in terms of the intrinsic qualities of the individual teacher (merit) and of the teacher's potential benefit to the school and its students (worth). See  Merit, Worth.

  • Value-Added - the change in an attribute or product that can be linked to an intervention. Examples are the change in student test scores before and after completion of a course or the change in the effectiveness of a teacher's classroom management skills as a result of participation in a workshop series on these skills.

  • Variable - a behavior, characteristic, or event that can change in value from one context to another or over time, or from one individual or group to another (e.g., gender, fluency in a second language, course units completed, test scores, years of teaching experience). See  Input Variables, Outcome Variables, Process Variable.

  • Verification - the process of checking the accuracy of data and information about the teacher's experience, training, performance, and other attributes. See   Accuracy, Audit, Congruence Analysis, Quality Check, Triangulation.

  • Weighted Score - a score adjusted by such factors as the importance of the attribute assessed to teaching performance, or the reliability and validity of the assessment from which the score was derived, or a combination of such factors. See  Composite Score, Configural Scoring Rule, Preponderance, Synthesis.

  • Wisdom of Practice - that which a teacher learns about teaching through direct experience and/or shared, detailed descriptions of teaching (e.g., case studies).

  • Work Sample - an assessment method that uses actual and typical on-the-job activities or tasks (e.g., asking a teacher to write a report to parents on a student's progress during the past semester). See  Surrogate Task, Task, Work Simulation.

  • Work Simulation - a surrogate or imitation of a work sample task (e.g., asking a teacher to list what features he/she would look for when grading student book reports and to provide the scoring rubric). See  Simulation, Task.

  • Worth - the system-related or extrinsic value of a teacher to the school and its students. For example, being able to speak Spanish could be of high worth in a school that does not have enough Spanish-speaking teachers, whereas being able to coach basketball may be of low worth to the school if there are already several teachers who can coach basketball. Knowing Spanish and being able to coach basketball still reflect teacher merit and have value, and may or may not have worth at another school or at another time in the same school. See  Merit, Value.

Return to the Menu

Glossary References

Agencies and Associations with resources related to teacher evaluation.

Jump to Top of Page