S5(R3) (MHLW Step 5)S5(R3) (ICH Step 4) | メディカリンガル株式会社

薬生薬審発0129第８号
令和３年１月29日

Adopted on 18 February 2020
This Guideline has been developed by the appropriate ICH Expert Working Group and has been subject to consultation by the regulatory parties, in accordance with the ICH Process. At Step 4 of the Process the final draft is recommended for adoption to the regulatory bodies of ICH regions.

薬品の生殖発生毒性評価に係るガイドラン

DETECTION OF REPRODUCTIVE AND DEVELOPMENTAL TOXICITY FOR HUMAN PHARMACEUTICALS

1. 緒言及び一般原則

本ガイドラインの目的は、医薬品の臨床試験及び製造販売承認申請に必要とされる生殖発生毒性の評価に関する国際標準を推奨し、調和を促進することである。本ガイドラインでは、リスクを特定、評価及び伝達する上で利用可能なデータを補完するためにとり得る戦略及び試験計画を記載する。さらに、試験データを解釈する際に検討すべき概念や推奨事項も提供する。
本ガイドラインは、1993年に発出されたICHガイドライン「S5 Detection of Toxicity to Reproduction for Medicinal Products」（医薬品の生殖発生毒性試験）の改定版である。本改定版では、他のICHガイドラインとの整合をとるとともに、用量設定における曝露マージンの利用について詳述し、リスク評価に関する項を設け、さらに適用範囲を拡大してワクチン及びバイオテクノロジー応用医薬品（以下、「バイオ医薬品」）も対象とする。また、代替試験法（以下、「代替法」）に関する適格性確認や使用可能なシナリオについて記載し、生殖発生毒性試験の延期に関する選択肢も提供する。医薬品の生殖発生への影響を評価するためには、一般的に、医薬品及び適切な場合はその代謝物（ICH M3 (1)、ICH S6 (2)）の曝露による生殖発生の全ステージへの潜在的影響に関する情報を利用することができる。どのようなガイドラインにおいても、起こりうるすべての事例をカバーするために十分な情報は提供できないため、試験戦略には柔軟性が必要である。

1. INTRODUCTION & GENERAL PRINCIPLES

The purpose of this document is to recommend international standards for, and promote harmonization of, the assessment of nonclinical developmental and reproductive toxicity (DART) testing required to support human clinical trials and marketing authorization for pharmaceuticals. The guideline describes potential strategies and study designs to supplement available data to identify, assess, and convey risk. General concepts and recommendations are also provided that should be considered when interpreting study data.
This is a revision of the ICH guideline “S5 Detection of Toxicity to Reproduction for Medicinal Products” that was originally published in 1993. This revision brings the guideline into alignment with other ICH guidelines, elaborates on the use of exposure margins in dose level selection, incorporates a section on risk assessment, and expands the scope to include vaccines and biopharmaceuticals. It also describes qualification of alternative assays, potential scenarios of use, and provides options for deferral of developmental toxicity studies.
To assess a human pharmaceutical’s effect on reproduction and development, there should generally be information available that addresses the potential impact of exposure to a pharmaceutical and, when appropriate, its metabolites (ICH M3 (1), ICH S6 (2)) on all stages of reproduction and development. No guideline can provide sufficient information to cover all possible cases, and flexibility in testing strategy is warranted.

1.1 試験の目的
生殖発生毒性試験の目的は、ヒトでのリスク評価に資する情報となる哺乳類の生殖発生に対する医薬品の影響を明らかにすることである。曝露による即時的及び遅発的な作用を検出するためには、必要に応じて、一連の試験で完全なライフサイクル（即ち、受精から次世代の受精までの期間）を通じて観察すべきである。一般的には以下の生殖発生ステージでの評価が実施される。

A) 交尾前～受精（成熟雌雄動物の生殖機能、配偶子の発生及び成熟、交尾行動、受精）
B) 受精～着床（成熟雌動物の生殖機能、着床前発生、着床）
C) 着床～硬口蓋閉鎖（成熟雌動物の生殖機能、胚発生、主要な器官の形成）
D) 硬口蓋閉鎖～妊娠終了（成熟雌動物の生殖機能、胎児の発生と成長、器官の発生と発達）
E) 出生～離乳（分娩と授乳、新生児の子宮外生存への適応、離乳前の発生と成長）
F) 離乳～性成熟（離乳後の発生と成長、自立生存への適応、性成熟の開始と完全な性機能の確立、次世代への影響）

対象集団に関連しない生殖発生ステージを除き、全てのステージにおけるリスクを評価すべきである。各試験でカバーする生殖発生ステージは申請者の判断に委ねられるが、医薬品開発における試験の実施時期については、対象集団や薬剤の開発段階に依存する（ICH M3、ICH S6及びICH S9(3)参照）。

1.1. Aim of Studies
The aim of DART studies is to reveal any effect of the pharmaceutical on mammalian reproduction relevant for human risk assessment. As appropriate, the set of studies conducted should encompass observations through one complete life cycle (i.e., from conception in one generation through conception in the following generation), and permit detection of immediate and latent adverse effects. The following stages of reproduction are generally assessed:
A) Premating to conception (adult male and female reproductive functions, development and maturation of gametes, mating behavior, fertilization).
B) Conception to implantation (adult female reproductive functions, preimplantation development, implantation).
C) Implantation to closure of the hard palate (adult female reproductive functions, embryonic development, major organ formation).
D) Closure of the hard palate to the end of pregnancy (adult female reproductive functions, fetal development and growth, organ development and growth).
E) Birth to weaning (parturition and lactation, neonate adaptation to extrauterine life, pre-weaning development and growth).
F) Weaning to sexual maturity (post-weaning development and growth, adaptation to independent life, onset of puberty and attainment of full sexual function, and effects on second generation).

The risks to all stages should be assessed, unless the stage is not relevant to the intended population. The stages covered in individual studies are left to the discretion of the Sponsor, although the timing of studies within the pharmaceutical development process is dependent on study populations and phase of pharmaceutical development (see ICH M3, ICH S6 and ICH S9 (3)).

2. ガイドラインの適用範囲

本ガイドラインは、バイオ医薬品、感染症ワクチン（及び、ワクチンに含まれる新規構成成分）を含むすべての医薬品、及び新添加剤に適用される。なお、本ガイドラインでは、「医薬品」という用語を、これらすべての治療モダリティを含むものとして使用する。本ガイドラインは、細胞加工製品及び遺伝子治療用製品には適用されない。本ガイドラインで概説する方法論の原則（試験計画、用量設定及び動物種選択など）は、生殖発生毒性試験の実施が適切なすべての化合物に適用される。生殖発生毒性試験の要否及び実施時期を検討するにあたっては、本ガイドラインとICH M3、ICH S6及びICH S9を参照すべきである。

2. SCOPE OF THE GUIDELINE

This guideline applies to all pharmaceuticals, including biopharmaceuticals, vaccines (and their novel constitutive ingredients) for infectious diseases, and novel excipients that are part of the final pharmaceutical product. For the purposes of this guideline, the term “pharmaceutical” is used to encompass all of these treatment modalities. This guideline does not apply to cellular therapies, gene therapies and tissue-engineered products. The methodological principles (e.g., study design, dose selection and species selection, etc.) outlined in this guideline apply to all compounds for which the conduct of reproductive and/or developmental toxicity studies is appropriate. This guideline should be read in conjunction with ICH M3, ICH S6, and ICH S9 regarding whether and when nonclinical DART studies are warranted.

3. 生殖毒性評価に関する一般的考慮事項

開発中のほとんどの医薬品については、いくつかの例外がありうるものの、上述した全ての生殖発生ステージを評価すべきである。臨床開発を進めるためには、一般的に以下の3種類のin vivo試験を用いて生殖発生ステージの評価が行われている：1）受胎能及び着床までの初期胚発生に関する試験（FEED試験）（ステージA～B）、2）2種の動物種を用いた胚・胎児発生に関する試験（EFD試験）（ステージC～D）、及び3）出生前及び出生後の発生並びに母体の機能に関する試験（PPND試験）（ステージC～F）。化合物ごとに評価する生殖発生ステージを決定し、実施すべき最も適切な試験を特定すべきである。生殖発生への影響を評価するにあたり、総合的に試験戦略を構築する上で考慮すべき重要な事項を以下に示す。

• 対象患者集団及び使用条件（特に生殖能力及び疾患の重篤度との関連性）
• 医薬品の剤型と臨床適用経路
• 毒性（in vitro、ex vivo及び非哺乳類を用いた試験、並びに構造活性相関も含む）、薬力学、薬物動態及び他の医薬品との薬理学的類似性に関連するデータ
• 医薬品の標的に関する生物学的特性や生殖発生における既知の役割

上記の考慮すべき事項については、本ガイドライン中で詳細に言及している。
総合的なリスク評価を損なわない範囲で、動物の使用を最小限に抑える試験戦略をとるべきである。そのアプローチとしては、一般的なデザインの試験を組み合わせた試験の実施（7項参照）や、適切に適格性を確認された代替法（附属書2参照）を用いたリスク評価がある。第Ⅲ相臨床試験前に開発が断念される医薬品も多いことから、ICH M3に示されるように、検討中の臨床試験をサポートする試験（妊娠可能な女性を組み入れるための胚・胎児発生毒性データなど）を適切な時期に実施することで、動物の使用を減らすことが可能である。

生殖発生毒性試験はリスク評価に利用されることから、原則としてGLPに従って実施すべきである。しかしながら、非GLP試験において適切な生殖発生毒性リスクが明らかになった場合には、当該知見を確認するためにGLP試験を繰り返す必要はない。適切なリスクとは、臨床での曝露量又はそれに近い曝露量で生じるものであり、疑いなくヒトへ外挿できる事象である（9項参照）。試験の種類や状況によっては、特別な試験系や試験方法を用いた試験については、必ずしもGLP下での実施が求められない場合がある。しかし、このような場合でも、科学的に質の高い水準を適用すべきであり、データの収集記録を容易に確認できるようにすべきである。また、GLPに準拠しない部分を試験報告書で特定するとともに、それによる試験の結果／データの解釈が安全性評価全体に及ぼす影響を考慮すべきである。

3. GENERAL CONSIDERATIONS ON REPRODUCTIVE TOXICITY ASSESSMENT

The majority of pharmaceuticals being developed should be assessed for all stages of the reproductive cycle identified above, although there can be some exceptions which should be justified, as indicated below. To support clinical development, these stages have typically been evaluated using three in vivo study types: 1) a fertility and early embryonic development study (FEED – stages A and B), 2) embryo-fetal development studies in two species (EFD – stages C and D), and 3) a pre- and a postnatal development study (PPND – stages C through F). For each compound, the stages that are to be evaluated should be determined and the most appropriate studies to conduct should be identified. Key factors to consider when developing an overall integrated testing strategy to evaluate effects on reproduction and development include:
• The targeted patient population and conditions of use (especially in relation to reproductive potential and severity of disease);
• The formulation of the pharmaceutical and route(s) of administration intended for humans;
• Relevant data on toxicity (which can also include data from in vitro, ex vivo and non-mammalian studies, and structure-activity relationships), pharmacodynamics, pharmacokinetics, and pharmacological similarity to other pharmaceuticals;
• Aspects of the general biology of the pharmaceutical target, or known roles of the target in reproduction or development.
These concepts are discussed in more detail throughout the guideline.
To the extent that it does not diminish the overall risk assessment, the experimental strategy should minimize the use of animals. Approaches towards this goal can include the conduct of studies that combine typical study types (see Section 7), as well as appropriately qualified alternative assays for risk assessment (see Annex 2). Since many clinical development programs are terminated prior to Phase 3, animal use can also be reduced by appropriately timing studies to support ongoing clinical development (e.g., embryo-fetal developmental toxicity data to support enrollment of women of childbearing potential) as per ICH M3.
DART studies should, in general, be conducted according to Good Laboratory Practice (GLP) regulations, as they will contribute to the risk assessment. However, if a relevant DART risk is identified in a non-GLP study, repetition of the study to confirm the finding(s) under GLP conditions is not necessarily warranted. A relevant risk is one that occurs at or near intended clinical exposures and is of a nature that is reasonably likely to translate to humans (see Section 9). It is recognized that GLP compliance is not expected for some study types, or aspects of some studies, employing specialized test systems or methods. However, high quality scientific standards should be applied with data collection records readily available. Areas of non-compliance should be identified within the study report and their impact on study results/data interpretation should be considered relative to the overall safety assessment.

3.1 対象患者集団／適応症に関する考慮事項
対象患者集団や適応症によって、生殖発生毒性試験の実施範囲が影響されることがある。対象集団において、生殖発生毒性が医薬品のリスク評価にほとんど影響を及ぼすことがないと考えられる疾患の場合、生殖発生の全ステージを評価する試験は必要ない。例えば、閉経後の女性のみの患者集団、小児や思春期前の若年集団、妊娠の可能性を排除することができる入院環境の患者集団を対象とする場合には、全ステージをカバーする試験は必ずしも必要ではない。

3.1. Target Patient Population/ Therapeutic Indication Considerations
The intended patient population or therapeutic indication can influence the extent of DART testing. Studies evaluating all stages of reproduction and development are not warranted if the disease indicates that DART will have minimal impact on the risk of the pharmaceutical in the target population. For example, studies covering all stages are not necessarily appropriate for an exclusively post-menopausal female patient population, for use in the pediatric or juvenile pre-pubescent population, or for patient populations in hospitalized settings where pregnancy can be excluded.

3.2 薬理学的考慮事項
試験戦略を検討する前に、まず医薬品の意図する薬理作用が、受胎能、正常な胚・胎児発生、あるいは特定のエンドポイントの評価に適さないこと（例えば、全身麻酔剤における交尾行動の評価など）を確認すべきである。当該評価においては、同様の薬理作用を持つ他の医薬品のデータ、標的に関する既知の作用、あるいはヒトの遺伝性疾患に関連する知見が根拠になりうる。例えば、早産を防ぐために開発されている医薬品では、PPND試験のデザインを修正することが適切となる。意図する薬理作用が試験のエンドポイントにおいて適切でない場合、その根拠を示した上で、特定の生殖に関するエンドポイントの評価は必要ではない。

3.2. Pharmacology Considerations
Before designing a testing strategy, it should be determined if the intended pharmacologic effects of a pharmaceutical are known to be incompatible with fertility, normal EFD, or assessment of particular endpoints (e.g., a general anesthetic and assessment of mating behavior). This assessment can be based on data with other pharmaceuticals with similar pharmacology, known effects of target engagement, or on knowledge of effects in humans with related genetic diseases. For example, it would be appropriate to modify the design of a PPND study for a pharmaceutical developed to prevent pre-term labor. If the intended pharmacologic effects are incompatible with the study endpoints, testing for a particular reproductive endpoint is not warranted, with justification.

3.3 毒性に関する考慮事項
性成熟に達した動物を用いた反復投与毒性試験では、生殖器毒性に関する重要な情報が得られる場合があり、生殖発生毒性試験のデザインに影響を及ぼす可能性がある。化合物に関する既存の毒性データを考察する際には、用量段階、トキシコキネティクスに関するプロファイル、投与期間を常に考慮すべきである。例えば、精巣に影響を与える化合物では、標準的な受胎能試験のデザインを改変して、投与期間や同居開始時期を変更することができる。

3.3. Toxicity Considerations
Repeated–dose toxicity studies with sexually mature animals can provide important information on toxicity to reproductive organs that can affect the design of a DART study. The existing toxicology data for the compound should always be considered, taking into account the dose levels, toxicokinetic profile, and dosing duration. For example, the standard fertility study design can be modified to alter the duration of dosing, or the start of cohabitation, for a compound that affects testicular tissue.

3.4 実施時期に関する考慮事項
生殖発生毒性試験の実施時期については、ICH M3、ICH S6及びICH S9に一般的なガイダンスが記載されている。特定の生殖発生毒性を評価する時期は、臨床試験又は対象患者集団において、当該医薬品を安全に使用するために、関連するデータが必要か否かを踏まえて検討すべきである。その結果、特定の生殖発生ステージへの影響を評価する時期を変更することが適切となる場合がある。その他の選択肢については4.2.2項及び4.2.3項で述べる。

3.4. Timing Considerations
General guidance on the timing for conduct of studies assessing reproductive and developmental endpoints is described in ICH M3, ICH S6, and ICH S9. The timing for when to conduct specific DART assessments should take into consideration the need for these data to support the safe use of the pharmaceutical in clinical trials or the intended patient population. Consequently, it can be appropriate to consider altering the timing of the assessment of specific reproductive stages. Additional options are discussed in Section 4.2.2 and 4.2.3.

3.5 トキシコキネティクス（TK）
曝露データは生殖発生毒性試験（用量設定試験又は本試験）、あるいは反復投与毒性試験のいずれかで得ることができる。しかしながら、妊娠によってTKパラメータに意味のある変化が生じる可能性があることから、妊娠によって曝露量が変化するかどうかを確認することが推奨される。用量設定が曝露量比に基づく場合（6.1.3項参照）、妊娠動物におけるTKデータはGLP下で得ることが望ましい。サンプリング時期については、その適切性を示すべきである。生殖発生ハザードに関する矛盾点や不確かな試験データを解釈する上で、胚又は胎児の薬物濃度に関する情報が有用な場合もある。その場合には、別試験で実際の曝露量を測定することも可能である。しかしながら、その結果をヒト胎児における推定薬物濃度と直接比較することは適切ではない。
乳汁移行の確認が必要な場合、乳汁又は離乳前の出生児における曝露量から確認することができる
TKデータの収集に関する一般的な考え方については、ICH S3A (4)に記載されている。

3.5. Toxicokinetics (TK)
Exposure data can be generated in either reproductive (dose range finding (DRF) or pivotal) or repeated-dose toxicity studies. However, given the potential for meaningful changes in TK parameters induced by pregnancy, it is recommended to determine if pregnancy alters exposure. If dose selection is based on exposure ratio (see section 6.1.3), GLP-compliant TK data in pregnant animals is expected. Sampling day(s) should be justified.
When warranted, determination of the pharmaceutical’s concentration in the embryo or fetus can facilitate interpretation of discordant or equivocal evidence of developmental hazard. This information can be collected in a separate study to determine the actual exposure. However, a direct comparison to the potential levels in the human conceptus is not appropriate.
Evidence of lactational excretion can be obtained, when warranted, by sampling milk or by demonstrating exposure in offspring during the pre-weaning period.
General concepts regarding TK data collection are discussed in ICH S3A (4).

4. 哺乳類を用いたin vivo試験のデザインと評価

医薬品の潜在的な生殖発生毒性リスクを評価するための戦略には、一般に、1種以上のin vivo試験が含まれる。一部の動物種（ヒト以外の霊長類（以下、「NHP」）など）では実施不可能であるが、全体としては、生殖発生の全ステージを網羅して評価することが重要である。ほとんどの医薬品では、通常、三試験計画法が適切となろうが、特定の製品ニーズに対応し、また使用動物数を削減するためには、これらの試験デザインを様々に組み合わせることも可能である。FEED試験、EFD試験及びPPND試験の詳細、又はそれらの組合せによる試験については附属書1を参照のこと。各試験でカバーする生殖発生ステージは申請者の判断に委ねられる。医薬品に関して入手し得るすべての薬理学的データ、トキシコキネティクス及び毒性学的データを考慮して、どの試験デザインを選択すべきか判断しなければならない。

4. DESIGN AND EVALUATION OF IN VIVO MAMMALIAN STUDIES

The strategy to evaluate the potential reproductive and developmental risk of a pharmaceutical generally includes one or more in vivo studies. The key factor is that, in total, they leave no gaps between stages and allow for evaluation of all stages of the reproductive process, although in some species (e.g., the non-human primate (NHP)) it is not possible to evaluate all stages. For most pharmaceuticals, the 3-study design will usually be appropriate, although various combinations of these study designs can be conducted to address specific product needs and to reduce animal use. Study details for the FEED, EFD, and PPND studies, and combinations thereof, can be found in Annex 1. The stages covered in individual studies are left to the discretion of the sponsor. All available pharmacological, toxicokinetic, and toxicological data for the pharmaceutical should be considered in determining which study design(s) should be used.

4.1 受胎能及び初期胚発生（FEED）に関する戦略
FEED試験の目的は、雄動物及び雌動物において交配前から交尾、着床に至るまでの投与による有害作用を検討することである。これは生殖発生ステージA～Bの評価となる。試験期間が短く、すべての有害作用を明らかにするには不十分な場合もあるが、少なくとも2週間の投与期間による反復投与毒性試験の成績を利用することで、用量設定試験を別途実施することなく、受胎能試験をデザインすることができる場合も多い。
対象集団への曝露を許容する上で、FEED試験が必要とされる場合には、ほとんどの場合、交配による評価が求められる。このような試験は一般的にげっ歯類を用いて行われる。受胎能に対する有害作用が予測されない場合、同一の試験で雌雄ともに投与を行い、交配させることができる。当該試験で受胎能に対する影響が認められた場合には、どちらの性が被験物質の投与により影響を受けたかを明らかにすべきである。一方、作用機序や反復投与試験の結果から有害作用が予測される場合は、片性の動物のみに投与し無処置の動物と交配させることができる。この試験は、1つのFEED試験の中で異なる投与群を用いても、2つの異なる試験で実施することも可能である。受胎能及び初期胚発生に対する有害作用の回復性は、リスク評価に重要な影響を与え得る。
げっ歯類を用いたFEED試験デザイン（附属書1参照）では、雌動物で性周期、卵管内輸送、着床、着床前段階の胚発生に及ぼす影響を検出することができる。性周期を評価する際には、被験物質投与に関連した影響と動物間／個体内のばらつきを区別するために、性周期のベースラインデータ（最低2～3サイクル）を取得することが重要である。性周期観察は交尾確認まで継続すべきである。
同居前2～4週間に投与されたげっ歯類を用いたFEED試験デザインでは、雄動物で精子形成及び精巣上体内精子輸送に対する影響を検出することができる。反復投与試験のデータから精巣に対する毒性が示唆される場合は、交配前の投与期間を10週間に延長することが適切な場合もある。これにより、精巣上体内精子輸送に加えて、すべての精子形成期間に対する影響が評価可能となる。FEED試験では、雄動物の生殖器における病理組織学的検査では検出されない機能的な影響（交尾行動、精巣上体内の精子成熟、射精など）も検出できる。
被験物質の作用機序や既存の試験データに基づく懸念がある場合には、受胎能に対する影響をより特徴づけるため、反復投与試験や受胎能試験で追加の検査（精子数及び形態／運動性評価のための精子の採取、ホルモンレベルの測定、性周期観察など）を実施することができる。

4.1.1 バイオ医薬品に関する考慮事項
げっ歯類又はウサギにおいて薬理学的活性を有するバイオ医薬品の場合は、これらの動物種のいずれかを用いたFEED試験が推奨される。通常イヌやNHPなどの非げっ歯類を用いて交配による評価を行うことは現実的ではない。例えば、NHPが薬理学的に適切な唯一の動物種である場合（多くのモノクローナル抗体の場合など、ICH S6参照）には、少なくとも3カ月間の反復投与毒性試験で得られた生殖器の病理組織学的検査を受胎能の評価の代わりとして使用することができる。この際には、雌雄両方の生殖器の包括的な病理組織学的検査を含めるべきである
（注1）。進行がんの治療を目的としたバイオ医薬品ではFEED試験は必要とされないが、それ以外のバイオ医薬品では、生殖器の適切な評価を行うため、動物は試験開始時に性成熟に達しているべきである。これらの病理組織学的検査データは、生殖組織の構造に関する情報を提供するだけで受胎能の機能的評価はできないことから、必ずしも病理組織学的評価の結果だけで受胎能及び初期胚発生に対する影響を予測することが可能とは限らない。

4.1. Strategy to Address Fertility and Early Embryonic Development (FEED)
The aim of the FEED study is to test for adverse effects resulting from treatment initiated prior to mating of males and/or females and continued through mating and implantation. This comprises evaluation of Stages A and B of the reproductive process. Results from repeated-dose toxicity studies of at least two weeks duration can often be used to design the fertility study without conducting further dose ranging studies, although studies of such short duration can be insufficient to reveal all adverse effects.
A mating phase is expected in most cases when a FEED study is warranted to support exposure of the target population. Such studies are typically performed in rodents. If no adverse effects on fertility are anticipated, both sexes can be treated and cohabited together in the same study. If effects on fertility are identified in the study, the affected sex should then be determined. In contrast, if adverse effects are anticipated based on mode of action or on the results of repeated-dose studies, each treated sex can be cohabited with untreated animals of the opposite sex. This can be achieved using separate treatment arms within a single study or by the conduct of two separate FEED studies. Reversibility of adverse effects on fertility and early embryonic development can have an important impact on risk assessment.
The FEED study design in female rodents (see Annex 1) allows for the detection of effects on the estrous cycle, tubal transport, implantation, and development of preimplantation stages of the embryo. When estrous/menstrual cycles are evaluated, it is important to obtain baseline cycle data (over 2 or 3 cycles minimum) to distinguish between treatment-related effects and inter/intra animal variability. The monitoring of estrous cyclicity should continue through the time of confirmation of mating.
The FEED study design for male rodents that includes 2 to 4 weeks of treatment prior to cohabitation allows for the detection of effects on spermatogenesis and epididymal transport. When data from repeated-dose studies suggest toxicity to the testis, it can be appropriate to extend the duration of pre-cohabitation treatment to 10 weeks; this permits assessment of effects on the full spermatogenic cycle as well as epididymal transport. The FEED study additionally permits detection of functional effects (e.g., on libido, epididymal sperm maturation, ejaculation) that cannot be detected by histological examinations of the male reproductive organs.
When there is cause for concern based on mode of action or data from previous studies, additional examinations can be included in repeated-dose toxicity and/or fertility studies (e.g., sperm collection for counts and morphology/motility assessments, measuring hormone levels, or monitoring of the estrous/menstrual cycle) to further characterize potential effects on fertility.

4.1.1. Considerations for Biopharmaceuticals
If the biopharmaceutical is pharmacologically active in rodents or rabbits, a FEED study in one of these species is recommended. Mating evaluations are not generally feasible in non-rodents such as dogs and NHPs. For example, if NHPs are the only pharmacologically relevant species (as for many monoclonal antibodies, see ICH S6), histopathological examinations of the reproductive tissues from the repeated-dose toxicity studies of at least three months duration can serve as a substitute for the fertility assessments. Such an approach should include a comprehensive histopathological examination of the reproductive organs from both male and female animals (Note 1). Unless the biopharmaceutical is intended to treat advanced cancer, in which case FEED studies are not warranted, animals should be sexually mature at study initiation in order for an adequate evaluation of the reproductive tissues to be made. These data would only provide information on the structure of the reproductive tissues, as no functional assessment of fertility can be made and predicting effects on fertility and early embryonic development is not always possible based solely on the results of histopathology assessments.

4.2 胚・胎児発生（EFD）に関する戦略
EFD試験の目的は、胎児器官形成期（ステージC）の妊娠雌動物に投与し、母動物及び胚・胎児の発生への有害作用を検出することである。EFD試験には、胎児の発生及び生存に関する評価が含まれる（ステージC～D）。
ほとんどの低分子化合物では、通常、EFDに対する影響の評価は2種の動物種（げっ歯類及び非げっ歯類［通常はウサギ］）を用いて実施される。試験動物種のうち少なくとも1種は、意図する薬力学的反応を示す動物種を用いるべきである。通常用いられる試験動物種（5.1項参照）のいずれにおいても、薬力学的に活性を示さない医薬品の場合には、通常用いられない動物種（5.2項参照）、遺伝子改変動物（5.3項参照）、種特異的なサロゲート分子（5.3項参照）（オリゴヌクレオチドの場合など）の使用を検討することができるが、その際には、薬理学的に適切なモデルの特性評価が十分に行われていることが前提となる。通常、遺伝子改変動物とサロゲート分子は、ハザードの特定には最も有用であるが、リスク評価に使用する場合は限界がある。適切なモデルがない場合、正常であれ病的状態であれ、薬理学的な標的がヒトにしか発現していない場合などであっても、オフターゲット作用や副次的薬理作用による毒性を検出するため、2種の動物種を用いたEFD試験を実施すべきである。
最大推奨臨床用量（Maximum Recommended Human Dose：MRHD）における推定臨床曝露量と同程度の曝露量で、形態異常や胚・胎児致死性（Malformations or Embryo-Fetal Lethality：MEFL）の誘発に関する明らかに陽性の結果が得られれば、開発している医薬品のリスク評価は当該動物種1種で十分と考えられる。
限定された条件下では、EFD本試験の代わりに他のアプローチを用いることもできる（附属書2参照）。あるいは、EFD試験を実施しなくても、リスクを伝えるにあたり適切な情報が得られる場合もある。意図する薬理作用によるEFDへの有害作用を示唆するエビデンス（作用機序、遺伝子改変動物の表現型など）があれば、リスクを伝えるには十分な可能性がある。

4.2.1 バイオ医薬品に関する考慮事項
バイオ医薬品のEFDへの影響は、2種の動物種（げっ歯類1種及び非げっ歯類1種）がいずれも薬理学的に適切であれば、通常、2種の動物種を用いて評価すべきである。しかしながら、げっ歯類は薬理学的に適切でない場合が多く、その場合には、薬理学的に適切な1種類の非げっ歯類のみを用いてEFDを評価することが可能である。適切な動物種が唯一NHPの場合には、EFD試験の代わりに、ePPND試験を実施することもできる。進行がんの治療を目的としたバイオ医薬品は、通常、薬理学的に適切な動物種1種を用いて評価するだけでよい（ICH S9参照）。生殖発生毒性試験に適したいずれの動物種を用いても、ヒトの標的分子と相同な配列を有する分子（オーソログ）にバイオ医薬品が相互作用せず、適切な動物種が特定できない場合には、ICH S6に記載されているように、サロゲート分子や遺伝子改変動物の使用を検討することが可能である。サロゲート分子を用いて臨床曝露量に対する安全域を算出することは適切ではない。適切な動物種、遺伝子改変動物又はサロゲート分子が利用できない場合には、in vivo生殖発生毒性試験の実施意義はない。その場合は、リスク評価に使用したアプローチ、又は試験を実施しないことの適切性を説明すべきである。

4.2.2 EFDリスクに対処するための代替アプローチ
4.2.2.1 代替法の利用
胚・胎児発生に対する潜在的ハザードを検出するために、in vitro、ex vivoや非哺乳類を用いたin vivoなどのいくつかの代替法が開発されている。これらの代替法はEFDに対する有害作用に関する創薬スクリーニングに使用され、毒性メカニズムの理解を深める一助となっており、（特にヒト特異的な標的について）非臨床データをヒトでのリスクに外挿する上で役立つ場合もある。これらの目的で代替法を継続的に利用することが推奨される。適格性が確認された場合、その代替法は、従来のin vivo試験の実施を延期又は（特定の状況において）代替する可能性がある。これには、使用動物数を削減できる可能性があるという更なる利点もある。代替法の適格性を確認する際に考慮すべき事項や、代替法の利用が適切なシナリオの例を附属書2に示す。代替法を取り入れたアプローチは、ヒトでの安全性を担保するにあたり、上述した現行試験の枠組みと比較して、少なくとも同等の信頼度を有するべきである。本文書作成時点での科学の進捗を考えると、規制当局の受入れを目的とする場合、段階的なアプローチや組合せによるアプローチの中で、複数の代替法が使用されることが想定される。各代替法の化学的な適用領域、及び代替法の対象となる生物学的メカニズムの特性評価によって使用の範囲が定められ、試験戦略の適格性（当局の受入れの可能性）は、各々の適用範囲内で判断される。

4.2.3 総合的試験戦略の一環としてin vivo本試験を延期することが可能なアプローチ

適切な試験戦略は、科学的根拠の重み付け（Weight-of-Evidence）の積み重ねにより成り立つ。
ICH M3では、2種の動物において予備的な胚・胎児発生（Preliminary EFD：pEFD）毒性データが得られている場合、EFD本試験を実施する前であっても、妊娠可能な女性（Women of Childbearing Potential：WOCBP）を限定的（最大150人のWOCBPを最長3カ月）に臨床試験に組み入れることが可能とされている。これらの考慮事項を踏まえ、本ガイドラインではICH M3を拡大し、第Ⅲ相臨床試験の前にWOCBPを臨床試験に組み入れることが許容されうる2つの追加的オプションを以下に記載する。
1) 1種の動物種における結果を予測する適格性が確認された代替法（附属書2参照）を、第二の動物種のpEFD試験データと組み合わせることで、WOCBPを限定的（最大150人のWOCBPを最長3カ月）に臨床試験に組み入れることを可能とする。その場合、通常、代替法と第二の動物種のpEFD試験データによって、げっ歯類と非げっ歯類の両方の動物種を評価されることになる。
2) 薬理学的に適切な動物種を用いてエンドポイントを追加し（特に、1群あたりの評価可能な同腹児数を増やし、胎児の骨格検査を含める）、GLP下で実施した少なくとも1つのpEFD試験が利用可能な場合、第二の動物種を用いたpEFD試験と組み合わせることにより、すべての地域において、第Ⅱ相までの臨床試験に組み入れるWOCBPの人数に制限を設けないことが可能となる。

4.2. Strategies to Address Embryo-Fetal Development (EFD)
The aim of the EFD studies is to detect adverse effects on the pregnant female and development of the embryo and fetus following treatment (Stage C) of the pregnant female during organogenesis. EFD studies include evaluation of fetal development and survival (Stages C through D).
For most small molecules, effects on EFD are typically evaluated in two species (i.e., rodent and non-rodent (typically rabbit)). At least one of the test species should exhibit the desired pharmacodynamic response. If the pharmaceutical is not pharmacodynamically active in any routinely used species (Section 5.1) then non-routine species (Section 5.2), genetically modified animals, or use of a species-specific surrogate molecule (Section 5.3) (e.g., in the case of oligonucleotides) can be considered, provided there is sufficient characterization of the model to ensure pharmacologic relevance. Genetically modified animals and surrogate molecules are generally most useful for hazard identification, but have limitations when used for risk assessment. Even when there are no relevant models (e.g., the pharmacological target only exists in humans, either normally or in the diseased state), EFD studies should be conducted in two species to detect the adversity of off-target effects or secondary pharmacology.
Clearly positive results for the induction of malformations or embryo-fetal lethality (MEFL), in a single species, at exposures similar to that at the projected clinical exposure at the maximum recommended human dose (MRHD) can be sufficient for risk assessment.
Under limited circumstances, other approaches can be used in place of definitive EFD studies (see Annex 2). Alternatively, there can be adequate information to communicate risk without conducting EFD studies. Evidence suggesting an adverse effect of the intended pharmacological mechanism on EFD (e.g., mechanism of action, phenotypic data from genetically modified animals) can be sufficient to communicate risk.

4.2.1. Considerations for Biopharmaceuticals
The effect of biopharmaceuticals on EFD should typically be assessed in two species (one rodent and one non-rodent) if both are pharmacologically relevant. However, the rodent is often not pharmacologically relevant, in which case EFD assessment in a single pharmacologically relevant non-rodent species can be conducted. In cases where the NHP is the only relevant species, an enhanced pre-and postnatal development (ePPND) study can be conducted instead of an EFD study. Biopharmaceuticals intended for the treatment of advanced cancer typically need only be assessed in a single pharmacologically relevant species (ICH S9).
When no relevant species can be identified because the biopharmaceutical does not interact with the orthologous target in any species relevant to reproductive toxicity testing, use of surrogate molecules or transgenic models can be considered, as described in ICH S6. Calculating safety margins relative to human exposures with surrogate molecules is not appropriate. If there are no relevant species, genetically modified animals or surrogates available, in vivo reproductive toxicity testing is not meaningful. In this case, the approach used for risk assessment, or rationale for not conducting studies, should be justified.

4.2.2. Alternative Approaches for Addressing EFD Risk
4.2.2.1. Use of Alternative Assays
A number of alternative in vitro, ex vivo, and non-mammalian in vivo assays (alternative assays) have been developed to detect potential hazards to embryo-fetal development. They have been used as drug discovery screens for adverse effects on EFD and have assisted in the understanding of the mechanism of toxicity, which can be useful for translating nonclinical data to human risk (especially for human-specific targets).
The continued use of alternative assays for these purposes is encouraged.
If properly qualified, alternative assays have the potential to defer or replace (in certain circumstances) conventional in vivo studies. This has the added benefit of potentially reducing animal use. Concepts to consider when qualifying these assays, and examples when the use of such assays could be appropriate, appear in Annex 2. Approaches that incorporate alternative assays should provide a level of confidence for human safety assurance at least equivalent to that provided by the current testing paradigms described above. Based on the direction of scientific development as of the writing of this document, it is expected that for regulatory purposes multiple alternative assays will be used within a tiered or battery approach. These testing strategies will be qualified within a certain context of use, which is defined by the chemical applicability domain of the assay, and by characterization of the biological mechanisms covered by the assay.

4.2.3. Potential Approaches to Defer Definitive In Vivo Testing as Part of an Integrated Testing Strategy
The design of an appropriate testing strategy relies on a cumulative weight-of-evidence approach. ICH M3 allows preliminary embryo-fetal developmental (pEFD) toxicity data from two species to support the limited inclusion of women of childbearing potential (WOCBP) (up to 150 WOCBP for up to 3 months) before conducting definitive EFD studies. Based on these considerations, this guideline expands on ICH M3 by allowing two additional options to support inclusion of WOCBP prior to Phase 3 clinical trials:
1) Qualified alternative assays which predict the outcome in one species (see Annex 2), can be combined with a pEFD from a second species to enable the limited inclusion of WOCBP (up to 150 WOCBP for up to 3 months). The alternative assay and the second species should generally cover both a rodent and a non-rodent species.
2) Additional endpoints incorporated into at least one GLP pEFD study (specifically increasing the group size of evaluable litters with inclusion of skeletal examinations) performed in a pharmacologically relevant species, if available, combined with a pEFD in a 2nd species allows all regions to include an unlimited number of WOCBP in clinical trials through Phase 2.

4.3 出生前及び出生後の発生並びに母体の機能（PPND）に関する戦略
PPND試験の目的は、着床から離乳までの間、母動物に曝露したときの有害作用を検出し、妊娠中あるいは授乳中の雌動物及び児の発生に対する影響を評価することである。この期間における有害作用は遅発性の場合があることから、児の発達は性成熟完了まで評価する（ステージC～F）。PPND試験には通常、げっ歯類が用いられるが、必要に応じて、他の動物種も利用可能である（附属書1参照）。ほとんどの場合、それまでに実施した他の試験結果から必要な情報が入手できることから、予備的（用量設定）PPND試験は必要とされない。しかしながら、出生児を離乳前あるいは離乳時まで観察する予備的PPND試験によって、用量設定、試験デザインのための情報、あるいは出生児の曝露データが得られることもある。
小児用医薬品の開発にあたり、改変されたPPNDやePPND試験デザインを検討している場合は、ICH S11(5)を参照すること。

4.3.1 バイオ医薬品に関する考慮事項
NHPのみで評価可能な医薬品に関しては、ePPND試験により限定的な出生後評価が可能であるが、出生児を成熟までの期間を通して評価することは現実的ではない（附属書1及びICH S6参照）。

4.3. Strategy to Address Effects on Pre- and Postnatal Development (PPND)
The aim of the PPND study is to detect adverse effects following exposure of the maternal animal from implantation through weaning to evaluate effects on the pregnant or lactating female and development of the offspring. Since manifestations of effects induced during this period can be delayed, development of the offspring is monitored through sexual maturity (i.e., Stages C to F). The rodent is usually used to assess PPND; however, other species can be used as appropriate (See Annex 1).
In most cases, a preliminary (dose range finding) PPND study is not warranted, because the appropriate information is generally available from prior studies. However, a preliminary PPND study with termination of the pups before or at weaning can be used to select dose levels or inform study design and/or to provide pup exposure data.
If a modified PPND/ePPND study design is being considered to support pediatric development, see ICH S11 (5).

4.3.1. Considerations for Biopharmaceuticals
For pharmaceuticals that can only be tested in the NHP, the ePPND study can provide a limited assessment of postnatal effects, but it is not generally feasible to follow the offspring through maturity (See Annex 1 and ICH S6).

5. 試験系の選択

5.1 通常の試験動物種
生殖発生毒性の検出には哺乳類を使用すべきである。先行して実施された毒性試験と同じ動物種・系統を使用することにより、追加の動物使用、薬物動態や代謝の特徴づけ、及び用量設定のための追加試験を避けることができる。使用する動物種は、特性が明確になっており、特定の試験のエンドポイント（健康状態、受胎能、繁殖能、形態異常及び胚・胎児死亡の自然発生率など）に対する影響を検出するのに適したものを選ぶべきである。

5.1.1 生殖発生毒性試験の動物種の選択
ラットは一般的に生殖発生毒性試験に適しており、実用的で薬理学的にもよく理解されていること、非臨床所見を解釈する上で広範な毒性データが利用可能であること、さらに背景データが多く存在することなどの理由により、げっ歯類で最もよく使用されている。同様の理由で、マウスもげっ歯類としてよく使用される。
EFD試験では、例外はあるものの（ワクチンやバイオ医薬品など、5.1.2項及び5.2項参照）、第二の動物種として、通常、非げっ歯類を用いた試験が行われる。ウサギは、げっ歯類では検出できなかったヒト催奇形性物質の特定に役立つことが明らかになっており、豊富な背景データ、動物の入手しやすさ及び実用性から、通常、非げっ歯類として使用される。

5.1.2 予防用及び治療用ワクチンのための動物種選択
ワクチンの非臨床試験に用いられる動物種は、（アジュバントの有無によらず）ワクチンに対して免疫反応を示さなければならない。実施する生殖発生毒性試験の種類及び動物種の選択については、観察される免疫反応と適切な投与量の投与可否に基づいて適切性が示されるべきである。通常、ワクチンの生殖発生毒性試験にはウサギ、ラット及びマウスが使用される。免疫反応には質的及び量的な種差（液性免疫及び細胞性免疫など）が存在する可能性があるが、通常、1種の動物種を用いた生殖発生毒性試験の実施で十分である。母体抗体の胎盤通過の程度及び経時的変化は動物種により異なるものの、ウサギ、ラット又はマウスを用いた生殖発生毒性試験を実施することで、ワクチン構成成分／製剤における潜在的な胚・胎児毒性や妊娠中の安全性に関する重要な情報が得られる。NHPの使用については、免疫反応を示す適切な動物種が他にない場合に限るべきである。
適切な動物モデル（NHPを含む）がない場合でも、ウサギ、ラット又はマウスを用いたEFD試験を実施することで、ワクチン構成成分／製剤における潜在的な胚・胎児毒性や妊娠中の安全性に関する重要な情報が得られる。

5.2 通常用いられない動物種
様々な生殖ステージに対する医薬品の影響を評価する上で、ラット、マウス及びウサギ以外の動物種を用いることができる。他の動物種の使用を検討する際には、被験物質、試験デザインと選択したエンドポイント、試験結果の臨床への外挿性の観点から、利点と欠点（附属書1、表1参照）を考慮すべきである。
NHPは通常用いられない試験動物種と考えるべきである。ICH S6に記載されているとおり、霊長類でのみ薬理学的活性を有するバイオ医薬品では、胚・胎児発生及び出生後早期の発達に対する影響の評価に通常、NHPが用いられる。ただし、NHPを用いて生殖発生毒性リスクを評価する際には、制約のあるエンドポイントがあることにも考慮する。（附属書1及びICH S6参照）。

5.3 病態モデル動物、遺伝子改変動物及びサロゲート分子の使用
意図する薬理作用が生殖発生に及ぼす影響を調べる際には、病態モデル動物、遺伝子改変動物及びサロゲート分子が有用な場合がある。病態モデル動物を用いた試験は、正常動物から得られたデータでは誤解を生じる場合や、病態モデル動物以外では臨床での病態生理に適用できない場合に有用である。当該モデルは、評価しようとする生殖発生エンドポイントに対して薬理学的に適切なものであるべきである。また、そのモデルにおける病態生理の経時的変化も明らかにすべきである。ヒトの病態生理との差異が認められる場合であっても、データの解釈に混乱を生じる懸念が高くなければ、その使用を除外するものではない。動物間のばらつきを明確にした上で、試験の背景を踏まえて、当該データを適切に取り扱うべきである。背景データが限られる場合は、データの解釈を裏付けるために、エンドポイントに関する文献等のデータを利用するか、試験実施期間中に当該情報を取得すべきである。
遺伝子改変動物は、生殖発生毒性パラメータに対する医薬品のオンターゲット作用に関する情報を得るために利用可能である。これらのモデルから、標的の生物学的特性が通常の試験動物種における生殖発生への有害作用と密接に関連するかどうかに関する情報が得られる。
通常の試験動物種において、医薬品が標的に対して十分な活性を示さない場合には、生殖発生への潜在的な有害作用を評価するために、サロゲート分子を用いることが可能である。

5. TEST SYSTEM SELECTION

5.1. Routine Test Species
Mammalian species should be used to detect DART. The use of the same species and strain as in already completed toxicity studies can eliminate the need to use additional animals or conduct additional studies to characterize pharmacokinetics and metabolism, and/or for dose range finding. The species used should be well-characterized and relevant for detecting effects on the endpoints in a particular study (e.g., with respect to health, fertility, fecundity, background rates of malformation and embryo-fetal death, etc.).

5.1.1. Selection of Species for DART Testing
The rat is generally appropriate for DART testing and is the most often used rodent species for reasons of practicality, general knowledge of pharmacology in this species, the extensive toxicology data usually available for interpretation of nonclinical observations and the large amount of historical background data. The mouse is also often used as the rodent species for many of the same reasons.
For assessment of EFD only, a second mammalian non-rodent species is typically evaluated, although there are exceptions (e.g., vaccines and biopharmaceuticals, see Sections 5.1.2 and 5.2, respectively). The rabbit has proven to be useful in identifying human teratogens that have not been detected in rodents and is routinely used as the non-rodent species based on the extensive historical background data, availability of animals, and practicality.

5.1.2. Species Selection for Preventative and Therapeutic Vaccines
The animal species selected for testing of vaccines (with or without adjuvants) should demonstrate an immune response to the vaccine. The type of developmental toxicity study conducted, and the choice of the animal model, should be justified based on the immune response observed and the ability to administer an appropriate dose. Typically, rabbits, rats, or mice are used in developmental toxicity studies for vaccines. Even though quantitative and qualitative differences can exist in the responses (e.g., in humoral and cellular endpoints) between species, it is usually sufficient to conduct developmental toxicity studies in a single species. Although the degree and time course of transfer of maternal antibodies across the placenta varies between species, a developmental toxicity study in rabbits, rats, or mice can still provide important information regarding potential embryo-fetal toxicity of the vaccine components/formulation and safety of the product during pregnancy. NHP should be used only if no other relevant animal species demonstrates an immune response.
When there is a lack of an appropriate animal model (including NHP), an EFD toxicity study in rabbits, rats, or mice can still provide important information regarding potential embryo-fetal toxicity of the vaccine components/formulation and safety of the product during pregnancy.

5.2. Non-routine Test Species
Species other than the rat, mouse or rabbit can be used to evaluate the effects of pharmaceuticals on various reproductive stages. When considering the use of other species, their advantages and disadvantages (summarized in Table 1 of Annex 1) should be considered in relation to the pharmaceutical being tested, the study design and selected endpoints, and the ability to extrapolate results to the human situation.
NHPs should be considered a non-routine test species. They are most typically used for evaluating effects on embryo-fetal development and early postnatal development for biopharmaceuticals that are only pharmacologically active in primates, as described in ICH S6. However, there are additional considerations that limit the utility of studies in NHPs for assessing some endpoints for DART risk assessment (see Annex 1 and ICH S6).

5.3. Use of Disease Models, Genetically Modified Models, and Surrogate Molecules
Animal models of disease, genetically modified models, and surrogate molecules can be valuable for investigating the effect of the intended pharmacology on development and reproduction. Studies in disease models can be of value in cases where the data obtained from healthy animals could be misleading or otherwise not apply to the disease conditions in the clinical setting. The model should be pharmacologically relevant and appropriate for the development and reproductive endpoints being assessed. The pathophysiology of the disease course in the model should be characterized. Some differences from the human pathophysiology would not preclude its use if these are unlikely to confound data interpretation. Animal-to-animal variability should be characterized and appropriate within the context of the study. If historical control information is limited, reference data for the study endpoints should be available or should be generated during the study to aid data interpretation.
Genetically modified models can be used to provide information about on-target effects of a pharmaceutical on DART parameters through permanent or conditional alterations in target activity. Such models can inform on whether the biology of the target is closely linked to adverse effects on reproduction and development in routine test species.
When the pharmaceutical does not have adequate activity against the target in the routine test species, surrogate molecules can be used to assess potential adverse effects on reproduction and development.

6. 用量設定、投与経路及び投与スケジュール

生殖発生毒性試験での用量、投与スケジュール及び投与経路は、試験デザインを検討する上で重要な事項であり、入手可能なあらゆる情報（薬理作用、反復投与毒性、薬物動態、用量設定試験など）に基づいて設定すべきである。低分子及びバイオ医薬品の用量設定の原則に関するガイダンスは、それぞれICH M3及びICH S6に示されている。試験系における忍容性に関する十分な情報が入手できない場合には、用量設定試験を実施することが望ましい。

6.1 用量設定
生殖発生毒性試験には、数多くの利用可能な用量設定指標がある。本項で検討するすべての用量設定指標は、試験デザインの観点からはいずれも適切と考えられる。本試験での高用量は、6.1.1項から6.1.5項に記載する設定根拠の1つ以上を満たすことが想定される用量とすべきである。用量設定は、それまでの試験（反復投与試験、TK試験、用量設定試験など）で観察された影響を考慮して行うべきである。リスク評価に必要な情報を得るにあたり、3用量より少ない用量段階でも十分な場合もある。
以下に述べる用量設定指標を用いない場合でも、高用量選択の適切性はケースバイケースで示すことが可能である。

6.1.1 毒性に基づく用量設定指標
毒性に基づく用量設定指標は、高用量群の親動物における、ごく軽度の毒性の発現に基づいて決定される。先行して実施された試験から決定される高用量の規定要因を以下に示すが、これらに限定されるものではない。

• 体重変化（変化量又は絶対値；増加・減少）。一過性のわずかな体重増加量又は体重の変化は、用量設定の根拠として適切ではない。体重変化の影響を評価する際には、試験における投与期間全体を考慮すべきである。
• 過剰な薬理作用（過度な鎮静や低血糖など）
• 毒性学的反応（けいれん、著しく高い胚・胎児死亡率、臨床病理学的な変動など）。計画した生殖発生毒性試験の評価に影響を及ぼす可能性がある特定の標的臓器毒性。

6.1.2 全身曝露の飽和に関する用量設定指標
投与薬物に関連する物質の全身性利用率（アベイラビリティ）を測定し、全身曝露の飽和によって高用量設定の適切性を示すことは可能である。用量を増加させたとしても、未変化体又は代謝物の血漿中濃度が上昇しない場合には、用量を増加する意義はない。

6.1.3 曝露マージンに基づく用量設定指標
MRHDにおける曝露量に対して予測される曝露マージンを示すことで、用量設定の適切性を示すことは可能である。低分子の場合、MRHDにおけるAUC又はCmaxを十分に上回る全身曝露が得られるのであれば、高用量設定において曝露量を用量設定指標とすることは可能である。妊娠動物における曝露量が、MRHDにおける曝露量の25倍を超えるのであれば、通常、生殖発生毒性試験における最大用量として適切である（注2）。25倍の曝露マージンは、GLPに準拠した用量設定試験／pEFD試験又は本試験で確定すべきである。通常、当該曝露マージンは未変化体に基づいて計算するが、ヒトでの主要な代謝物についても十分な曝露マージンを確保することを検討すべきである（ICH M3及び ICH M3 Q&A参照）。プロドラッグにおいて、特に未変化体に対する活性代謝物の曝露量の比率がヒトと比べて低い試験動物種の場合、活性代謝物に基づいて曝露マージンを確定することがより適切である。当該曝露量の比較においては、未変化体又は代謝物を選択した根拠を示すべきである。
MRHDにおける曝露量の25倍を超える曝露でのみ、試験動物種において薬力学的活性がみられる医薬品では、過度な薬理作用の有害作用を評価する目的で、より高用量での評価が求められる場合があるが、リスク評価をする上で不適切なオフターゲット作用が発現しやすい。曝露量に基づくエンドポイントをEFD試験における用量設定の根拠とする場合には、GLP試験における妊娠動物のTKデータが求められる。当該曝露量として、結合型と非結合型を合わせた曝露量と非結合型の曝露量のいずれを選択するかについては適切性を示すべきであり、ICH S3Aに概説されているとおり、非臨床開発プログラム全体との整合性がなければならない。

6.1.3.1 バイオ医薬品における曝露量に基づくアプローチ
ICH S6に示されている通り、曝露量に基づくマージンを示すことにより、バイオ医薬品の用量設定の適切性を示すことが可能である。一般に、非臨床試験に用いる動物種において意図する薬理作用が最大となる用量、あるいはMRHDでの曝露量の10倍程度の曝露が得られる用量のいずれか高い方とすべきである。標的結合親和性の種差及びその他の関連要因による用量調節についてはICH S6を参照のこと。

6.1.4 投与可能な最大用量（MFD）に基づく用量設定指標
投与経路／投与頻度、及び試験動物種の解剖学的／生理学的特性に関連した原薬（又は製剤）の物理化学的特性によって、投与可能な医薬品の用量が制限される場合には、MFDを高用量の設定に用いることが可能である。ICH M3 Q&A (1)に示されている通り、MFDを用いて高用量を設定する際には、投与量を最大にするよりも、試験動物種での曝露が最大となるよう設定すべきである。なお、1日あたりの可能な総曝露量を増やすために、投与頻度の変更を検討することもできる（6.3項参照）。

6.1.5 限界量に基づく用量設定指標
1 g/kg/日未満の用量段階で、高用量設定のための要素が満たされない場合、一般に限界量として1 g/kg/日を適用することができる（その他の考慮事項についてはICH M3参照）。

6.1.6 高用量以外の用量設定
生殖発生毒性に関しては、通常、無毒性量（NOAEL）を求めることが望ましい。高用量以外の用量設定では、曝露量、薬理作用及び毒性所見を考慮して、適切な場合、所見の用量反応が見られるようにすべきである。低用量はMRHDにおける曝露の数倍（1～5倍など）となる用量とすべきである。ヒトにおける治療域よりも低い曝露の用量を設定する場合は、その適切性を示すべきである。

6.2 投与経路
投与経路は、通常、ヒトでの予定適用経路とすべきである。ただし、ヒトでの予定適用経路で十分な曝露が得られない場合や、ヒトでの予定適用経路が使用できない場合は、別の経路を検討すべきである。ヒトで複数の投与経路が検討される場合には、すべての臨床適用経路での全身曝露に対して十分な全身曝露が得られ、かつ、代謝物が十分にカバーされるのであれば、当該動物種に関しては単一の投与経路で十分である。

6.3 投与スケジュール
毒性試験における投与スケジュールは、曝露プロファイルを決定するため、リスク評価において重要になる。多くの場合はヒトでの投与スケジュールと同様にすれば十分であるが、頻度を増減させることが適切な場合もある。例えば、試験動物種において速やかに代謝される化合物では1日2回投与が求められることがあるが、より高い頻度の投与スケジュールが想定される場合は、実際的な要素（試験の実行性、動物へのストレスなど）を考慮すべきである。また、試験で評価する生殖発生の重要なステージのすべてにおいて、十分な曝露が得られることを確実にするために、投与スケジュールを変更することも重要である。

6.4 ワクチンの用量設定及び試験デザイン
本ガイドラインは、感染症に対する予防又は治療に用いられるワクチン（アジュバントの有無を問わない）にも適用される。本ガイドラインの適用範囲には含まれないが、その他の適応症
（がんなど）に対するワクチンにおける非臨床試験にも、本ガイドラインの原則を適用可能である。予防又は治療用ワクチンの生殖発生毒性試験の種類は、ワクチンの対象集団及び関連する生殖発生リスクによって決まる。一般的に、新生児、思春期前の小児又は高齢者を対象に開発されているワクチンについては、生殖発生毒性試験は必要とされない。ワクチンの生殖発生毒性試験では、通常、臨床での投与経路を用いて、動物で免疫反応を惹起することができる単一用量で評価すれば十分である（5.1.2項）。この用量は、体重換算をしないヒトでの最大用量（すなわち、ヒトでの1回投与量＝動物での1回投与量）とすべきである。投与できる容量の限界又は用量制限毒性（局所又は全身）により、動物にヒトでの最大用量を投与することができない場合、体重換算（mg/kg）を用いて、ヒトの体重換算用量を上回る用量を選択することが可能である。ヒトでの1回投与量より少ない用量を用いる場合には、ヒトでの1回投与量を動物に投与できない根拠を示すべきである。
ワクチンの投与計画は、母動物での抗体価や胚・胎児及び出生後早期にわたる免疫反応を最大化すべきである。投与のタイミングや投与回数は、ワクチンごとの免疫反応発現までの時間又は継続時間によって決まる。妊娠中に投与するワクチンを開発する場合には、目的とする用途（妊婦や出生後早期の出生児の予防など）に基づいて、具体的な試験デザインの適切性を示すべきである。
連日投与では、ワクチンの構成成分の過剰な曝露になる場合がある。妊娠動物に対しては、連日投与よりも間歇投与が推奨される。また、間歇投与のほうが、感染症を適応とする多くの予防及び治療用ワクチンに関して、予定している臨床での免疫スケジュールに近いものとなる。通常の試験動物種における妊娠期間が短いことを考慮し、妊娠中の胎児に影響を及ぼしやすい時期（すなわち器官形成期）に十分な免疫反応が得られているよう、交配の数日前あるいは数週間前に動物に初回免疫投与を実施することが通常推奨される。投与計画はヒトで予定される接種スケジュールに従って変更できる。器官形成期の初期に少なくとも1回は投与を実施すべきである。これはワクチン製剤の構成成分による直接的な胚毒性への影響を評価するとともに、残りの妊娠期間を通じて高い抗体価を維持するためである。もし胚・胎児毒性が認められれば、所定の複数時点において投与される動物のサブグループを用いて、さらに評価してもよい。ワクチンに新規活性構成成分（新規アジュバントなど）が含まれる場合、非ワクチン製品の試験と同様な追加の評価方法を検討することが適切であろう。

6. DOSE LEVEL SELECTION, ROUTE OF ADMINISTRATION AND SCHEDULE

The choice of dose levels, schedule and route of administration are important study design considerations and should be based on all available information (e.g., pharmacology, repeated-dose toxicity, pharmacokinetics, and dose range finding studies). Guidance on the principles of dose selection for small molecules and biopharmaceuticals is given in ICH M3 and ICH S6, respectively. When sufficient information on tolerability in the test system is not available, dose range finding studies are advisable.

6.1. Dose Selection
There are a number of dose selection endpoints that can be used for DART studies. All endpoints discussed in this section are considered equally appropriate in terms of study design. The high dose in the definitive studies should be one that is predicted to comply with one or more of the concepts set forth in sections 6.1.1 to 6.1.5 below. The selected doses should take into account observations made in previous studies (e.g., repeated-dose, TK, DRF, etc.). There can be instances where fewer than three dose levels are sufficient to provide the necessary information for risk assessment.
Justification for high dose selection using endpoints other than those discussed below can be made on a case-by-case basis.

6.1.1. Toxicity–based Endpoint
This endpoint is based on inducing a minimal level of toxicity in the parental animals at the high dose. Factors limiting the high dose determined from previously conducted studies could include, but are not limited to:
• Alterations in body weight (gain or absolute; either reductions or increases). Minor, transient changes in body weight gain or body weight are not appropriate for dose selection. When assessing weight change effects, the entire dosing duration of the study should be considered.
• Exaggerated pharmacological responses (e.g., excessive sedation or hypoglycemia)
• Toxicological responses (e.g., convulsions, excessive embryo-fetal lethality, clinical pathology perturbations). Specific target organ toxicity that would interfere with the study endpoints within the duration of the planned DART study.

6.1.2. Saturation of Systemic Exposure Endpoint
High dose selection based on saturation of systemic exposure measured by systemic availability of pharmaceutical-related substances can be appropriate. There is little value in increasing the administered dose if it does not result in increased plasma concentration of parent or metabolites.

6.1.3. Exposure Margin Based Endpoint
It can be appropriate to select doses based on predicted exposure margins relative to the exposure at the MRHD. For small molecules, a systemic exposure representing a large multiple of the human AUC or Cmax at the MRHD can be an appropriate endpoint for high dose selection. Doses providing an exposure in pregnant animals > 25˗fold the exposure at the MRHD are generally considered appropriate as the maximum dose for DART studies (Note 2). The 25-fold exposure margin should be established in a GLP-compliant dose range finding/pEFD or definitive study. Usually this multiple should be determined based on parent drug levels; however, consideration should also be given to ensuring an adequate exposure margin to major human metabolites (see ICH M3 and ICH M3 Q&A). In the case of prodrugs, it can be more appropriate to establish the exposure multiple on the basis of the active metabolite, particularly if the test species has a lower ratio of active metabolite to prodrug, compared to humans. The basis for the moiety used for comparison (parent drug or metabolite) should be justified.
For pharmaceuticals that have demonstrated pharmacodynamic activity in the test species only at exposures > 25-fold that projected at the MRHD, higher doses can be warranted to assess adverse effects of exaggerated pharmacology. However, irrelevant off-target effects are more likely to be observed.
When exposure-based endpoints are used as the basis for selection of the dose levels for EFD studies, TK data from pregnant animals in a GLP-compliant study is expected. The choice for the use of total vs. fraction unbound pharmaceutical exposures should be justified and consistent with the entire nonclinical development program as outlined in ICH S3A.

6.1.3.1. Exposure-based Approach for Biopharmaceuticals
Exposure-based margins can be appropriate to select doses for biopharmaceuticals as per ICH S6. Generally, the dose should provide the maximum intended pharmacological effect in the preclinical species or provide an approximately 10-fold exposure multiple over the maximum exposure to be achieved in the clinic, whichever is higher. ICH S6 should be consulted with regard to dose adjustment for differences in target binding affinity and other relevant factors.

6.1.4. Maximum Feasible Dose (MFD) Endpoint
The MFD can be used for high dose selection when the physico-chemical properties of the pharmaceutical (or formulation) associated with the route/frequency of administration and the anatomical/physiological attributes of the test species limit the amount of the pharmaceutical that can be administered. Use of the MFD should maximize exposure in the test species, rather than maximize the administered dose, as per ICH M3 Q&A (1). Note that changes to the frequency of dose administration can be considered to increase the total feasible daily exposure (see Section 6.3).

6.1.5. Limit Dose Endpoint
A limit dose of 1 g/kg/day can generally be applied when other dose selection factors have not been attained with lower dose levels (see also ICH M3 for other considerations).

6.1.6. Selection of Lower Dose Levels
It is generally desirable to establish a no observed adverse effect level (NOAEL) for DART. The selection of lower dose levels should take into account exposure, pharmacology, and toxicity, such that the dose-response of findings can be established when appropriate. The low dose should generally provide a low multiple (e.g., 1 to 5-fold) of the human exposure at the MRHD. Dose levels that yield exposures that are sub-therapeutic in humans should be justified.

6.2. Route
In general, the route of administration should be the clinical route. If, however, sufficient exposure cannot be achieved using the clinical route or the clinical route is not feasible, a different route should be considered. When multiple routes of administration are being evaluated in humans, a single route in the test species can be adequate provided that sufficient systemic exposure is achieved compared to that of all clinical routes and that there is adequate coverage for the metabolites.

6.3. Schedule
Dosing schedules used in the toxicity studies determine the exposure profile, which can be important in the risk assessment. Although mimicking the clinical schedule is often sufficient, a more or a less frequent schedule can be appropriate. For example, twice daily dosing can be warranted with compounds that are quickly metabolized in the test species, although pragmatic factors (e.g., study logistics, stress on animals) should be considered when a more frequent schedule is contemplated. It can also be important to alter the dosing schedule to ensure that adequate exposure is obtained at all critical stages of reproduction and/or development being evaluated in a given study.

6.4. Dose Selection and Study Designs for Vaccines
This guideline covers vaccines (adjuvanted or not) used in both preventative and therapeutic indications against infectious diseases. While not within the scope of this guideline, the principles outlined can be applicable to the nonclinical testing of vaccines for other indications as well (e.g., cancer).
The types of reproductive and/or developmental toxicity studies used for preventative and therapeutic vaccines depend on the target population for the vaccine and the relevant reproductive risk. Generally, DART studies are not warranted for vaccines being developed for neonates, pre-pubertal children, or geriatric populations.
For reproductive toxicity studies of vaccines, it is typically sufficient to assess a single dose level capable of eliciting an immune response in the animal model (Section 5.1.2), using the clinical route of administration. This single dose level should be the maximum human dose without correcting for body weight (i.e., 1 human dose = 1 animal dose). If it is not feasible to administer the maximum human dose to the animal because of a limitation in total volume that can be administered, or because of dose-limiting toxicity, whether local or systemic, a dose that exceeds the human dose on a mg/kg basis can be used. To use a reduced dose, justification as to why a full human dose cannot be used in an animal model should be provided.
The vaccination regimen should maximize maternal antibody titers and/or immune response throughout the embryonic, fetal, and early postnatal periods. Timing and number of doses will depend on the onset and duration of the immune response of the particular vaccine. When developing vaccines to be given during pregnancy, a justification should be provided for the specific study design, based upon its intended use (e.g., protecting the mother during pregnancy or protecting the child early postnatally).
Daily dosing regimens can lead to overexposure to the vaccine constituents. Episodic dosing of pregnant animals rather than daily dosing is recommended. Also, episodic dosing better approximates the proposed clinical immunization schedule for most preventive and therapeutic vaccines. Considering the short gestational period of routine animal species, it is generally recommended to administer a priming dose(s) to the animals several days or weeks prior to mating in order to elicit peak immune response during the critical phases of pregnancy (i.e., the period of organogenesis). The dosing regimen can be modified according to the intended vaccination schedule in humans.
At least one dose should be administered during early organogenesis to evaluate potential direct embryotoxic effects of the components of the vaccine formulation and to maintain a high antibody response throughout the remainder of gestation. If embryo-fetal toxicity is observed, this can be further assessed using subgroups of animals that are dosed at certain time points.
In cases where a vaccine includes a novel active constitutive ingredient (including novel adjuvants), consideration of additional testing strategies similar to those for non-vaccine products can be appropriate.

7. げっ歯類を用いた組合せによる試験計画法

ほとんどの医薬品開発では、三試験計画法（FEED試験［ステージA～B］、EFD試験［ステージC～D］及びPPND試験［ステージC～F］）が用いられてきたが、使用動物数を削減する目的で、これらの試験デザインを様々に組み合わせることも可能である。組合せによる試験計画法の主な利点は、より少数の動物を用いて、関連するすべての生殖発生ステージを評価できることである。また、特に半減期の長い医薬品では、臨床での曝露期間に近い曝露を実現することができる。広く用いられている組合せによる試験計画法として、受胎能試験とEFD試験を統合した試験（ステージA～D）に個別のPPND試験（ステージC～F）を組み合わせるものがある。
FEED試験、EFD試験、PPND試験、あるいはそれらを統合した試験のデザイン及び試験の詳細については、附属書1を参照のこと。雌雄の受胎能への影響が懸念されない場合、あるいは反復投与毒性試験において生殖器に毒性が確認され、投与期間を延長することが適切と考えられる場合には、反復投与毒性試験とFEED試験を組み合わせた試験デザインを考慮してもよい。反復投与毒性試験で規定された投与期間を満了した後、雄動物を性成熟に達した雌動物（無処置又は交配前に2週間以上投与されたもの）と交配することも可能である。この組合せによる試験では使用動物数を削減できるが、各群の交配数を16以上とすべきである。交配前に2週間以上投与された雌動物への投与を、更に器官形成期終了まで延長して、EFDのエンドポイントを評価できるようにすることも可能である（附属書1）。

7. POSSIBLE COMBINATION STUDY DESIGNS IN RODENTS

Although three separate study designs, i.e., FEED (stages A and B), EFD (stages C through D) and PPND (stages C through F) have been employed to develop the majority of pharmaceuticals, various combinations of these study designs can be conducted to reduce animal use. The main advantage of combination designs is that all relevant stages of the reproductive process can be assessed using fewer animals. Combination studies can also better mimic the exposure duration in the clinic, especially for drugs with long half-lives. A common combination study design is a combined Fertility and EFD study (stages A through D) with a separate PPND study (stages C through F).
Designs and study details for FEED, EFD, and PPND studies, and combinations thereof, can be found in Annex 1.
In cases where no effects on male or female fertility are anticipated, or where extending the dosing period is appropriate due to observation of reproductive organ toxicity in a repeated-dose toxicity study, a combination design of repeated-dose and fertility studies can be considered. After a defined dosing period within the repeated-dose toxicity study, males can be paired with sexually mature females (whether untreated, or dosed for at least two weeks prior to mating). This combination study can reduce the number of animals used, but the number of mating pairs per group should be at least 16. Further, if treated, dosing of females can be extended until the end of organogenesis, thereby allowing evaluation of EFD endpoints (Annex 1).

8. データの報告及び統計

8.1 データの報告
試験に用いたすべての動物の成績を説明できるように、明確かつ簡潔に個々の値を表に示すべきである。データの表は、個々の動物とその受胎産物を、試験開始から試験終了まで容易に追跡できるものとすべきである。
胎児の形態学的異常所見は、業界で統一された用語を用いて記述すべきである。各同腹児のすべての所見を受胎産物別に明確に記載すべきである。異常所見を種類別に要約した一覧表を作成すべきである。妊娠していなかった動物のデータが要約表に含まれているか否かを明記すべきである。試験データの解釈は、主に同試験内の対照群との比較によって行われる。データ解釈の補助として、背景データを使用することができる。実施施設で得られた最近の背景データが望ましい。一般に直近の5年間のデータが望ましく、このようなデータであれば遺伝的浮動を確認することが可能である。

8.2 統計
本試験では、投与群と対照群の間の有意差を評価する統計的検定が求められる。生殖発生毒性試験のデータセットの多くは正規分布に従わないため、ノンパラメトリックな統計手法を用いる必要がある。帝王切開、胎児及び出生後のデータの要約統計量は、同腹児を解析単位として算出すべきである。統計学的有意差がある場合に必ずしも毒性学的な意義があるとは限らず、統計学的有意差がない場合でも必ずしも影響がないとも限らない。生物学的妥当性の判断には、入手可能なすべての薬理学的及び毒性学的データが有用であることも多い。

8. DATA REPORTING AND STATISTICS

8.1. Data Reporting
Individual values should be tabulated in a clear concise manner to account for all animals in the study. The data tables should allow ready tracking of individual animals and their conceptuses, from study initiation through study conclusion.
Fetal morphologic abnormalities should be described using industry-harmonized terminology. All findings for each litter should be clearly listed by conceptus. Summary listings should be prepared by type of abnormality. The inclusion or exclusion of data from non-pregnant animals in summary tables should be clearly indicated.
Interpretation of study data relies primarily on comparison with the concurrent control group. Historical control/reference data can be used to assist data interpretation. Recent historical control data from the performing laboratory is preferable. Contemporary data typically from a five-year period is desirable and permits identification of genetic drift.

8.2. Statistics
Statistical testing to assess the significance of differences between the treated and control groups is expected in definitive studies. Many of the datasets from DART studies do not follow a normal distribution, necessitating the use of non-parametric statistical methods. Cesarean, fetal and postnatal data summary statistics should be calculated using the litter as the unit of analysis. Statistical significance need not convey a positive signal, nor lack of statistical significance impute absence of effect. Determination of biological plausibility, based on all available pharmacologic and toxicologic data, is often useful.

9. リスク評価の原則

これまでの項で述べたように、臨床試験及び製造販売承認後において、使用条件下でのヒトにおける潜在的な生殖発生リスクに対処するにあたっては、当該医薬品、関連化合物、ヒト遺伝学から得られた入手可能なデータ、及び当該医薬品の標的分子がもつヒトの生殖における役割に関する知識をすべて利用すべきである。制限事項（試験系の適切性、最大曝露量など）、不確実性、非臨床における生殖発生毒性データパッケージ内のデータの相違点については、いずれもその影響を評価すべきである。一般的に、十分な曝露量下で適切な動物種を用いて実施されたin vivo本試験の結果は、代替法や予備試験から得られる結果よりも重視される。随時新たな情報が得られるため、リスク評価は製品の開発期間を通じて継続的に行われる。生殖発生毒性試験で報告されるすべての所見が有害というわけではない。所見が有害だと思われる場合は、科学的根拠の重み付けにより、いくつかの要素を検討しながらリスク評価すべきである。これには、曝露マージン、生物学的妥当性、用量反応関係、回復性、用量を制限するような親動物毒性の可能性及び動物種間での一致が含まれる。稀な形態異常が認められた場合、用量相関性がないとしても、必ずしも懸念が低くなるとは限らない。
試験動物種のNOAELにおける薬物曝露量とMRHDにおける薬物曝露量の比較は、リスク評価において重要である。この比較は最も適切な指標（AUC、Cmax、Cmin、体表面積換算した用量など）に基づいて行うべきである。一般的に、NOAELでの曝露量がMRHDでの曝露量の10倍未満である場合には懸念は増大し、10倍を超える場合には減少する。通常、MRHDにおける曝露量の25倍を超える曝露量でのみ生じる影響は、医薬品の臨床使用において懸念は小さい。他に適切な根拠がない限り、通常、最も感受性の高い動物種における曝露マージンを指標とする。生物学的妥当性の評価は、薬理学的作用機序と生殖発生における標的分子の既知の役割との比較によって行う。薬理作用の結果と解釈できる所見は、ヒトにとって懸念となることが示唆され、発現率又は重篤度の増加に用量相関性があれば因果関係はより明確になる。ある所見が生物学的に妥当と判断されない場合でも、明らかな用量反応関係がある場合には、オフターゲット毒性は否定できない。
回復性の有無によって、リスク評価の方法は変わる。例えば、投与中止後に回復するような雄動物と雌動物の受胎能に対する影響は、懸念が小さい。一方、死亡や形態異常などの重大で回復性のない発生に関わるエンドポイントは、懸念が大きい。その他の発生毒性（発育遅延、機能障害など）の回復性は所見次第である。一般的に、一過性の所見（げっ歯類における波状肋骨のような骨格変異など）は、単独で発現する場合には懸念が小さい。同様に、胎児体重の減少と共に生じた発育遅延も懸念が小さい。しかしながら、変異所見の発生頻度が全体的に増加（質的な類似性の有無を問わず）した場合には、明確な形態異常所見の増加がなくても、異常形態発生の懸念を示唆する可能性がある。
所見の重要性を判断する上では、親動物に対する毒性による影響を考慮すべきである。母動物の毒性が発現した状況下で認められた胚・胎児毒性は、ヒトへの外挿性の有無を慎重に判断すべきである。特に、同腹児ごとの所見とその母動物毒性の重篤度が一致しているかの評価が有用となりうる。発生毒性が母動物毒性による二次的な影響であると判断するには、それらの関連性を自ら実証するか、関連する公表文献から示す必要がある。また、報告された所見に関する試験間又は動物種間での一致性も有害作用の懸念を強める。試験間での一致の例としては、げっ歯類を用いたEFD試験で胎児致死の増加が認められ、かつPPND試験で生存児数の減少が認められる場合が挙げられる。動物種間での一致の例としては、ラットとウサギにおける着床後胚損失の増加の所見が挙げられる。各動物種の試験で特定された生殖発生への影響の機序に関する詳細な知見は、動物種間の反応性の違いを説明するのに役立ち、ヒトへの外挿性に関する情報となる（マウスにおけるコルチコステロイド誘発性の口蓋裂など）。
授乳に関して特に実施するリスク評価は、in vivo分娩試験（PPND又はePPND）により特定されたハザードに基づいて行う。これらのハザードには、乳汁中への薬物の分泌に起因する出生児の成長と発達に対する有害作用が含まれる。分娩試験で出生児の全身曝露データが得られた場合には、ヒト乳児で推定される授乳による曝露と比較することができる。乳汁の成分は動物種間で異なるため、動物の乳汁中薬物濃度をヒトの乳汁中薬物濃度と直接定量的に相関させることはできないが、動物の乳汁中に薬物が存在することは、一般に、ヒトの乳汁中にも薬物が存在することを示す。
最後に、利用可能なヒトでのデータは、ヒトの生殖発生リスクの総合的評価に影響を及ぼし得る。

9. PRINCIPLES OF RISK ASSESSMENT

As described in the preceding sections of this guideline, all available data garnered from the pharmaceutical, related compounds, human genetics, and knowledge of the role of target biology in human reproduction should be used to address potential reproductive risks in humans under the conditions of use, both during clinical trials and after marketing authorization. Any limitations (e.g., test system relevance, achieved exposure), uncertainties and data gaps in the available nonclinical DART data package should be addressed and their impact assessed. Generally, the results from definitive in vivo studies in an appropriate species with adequate exposures carry more weight than those from alternative assays or preliminary studies. Risk assessment is a continuous process through product development as more information becomes available.
Not all findings reported in DART studies are adverse. When a finding is deemed adverse, several factors should be considered in a weight-of-evidence evaluation for risk assessment. These can include exposure margins, biological plausibility, evidence of a dose-response relationship, potential for reversibility, the potential for confounding parental toxicity, and evidence for cross-species concordance. For rare malformations, the absence of increased frequency with dose does not always alleviate concern.
Comparison of pharmaceutical exposure at the NOAEL in the test species to the exposure at the MRHD is an important component of the risk assessment. This comparison should be based on the most relevant metric (e.g., AUC, Cmax, Cmin, body surface area-adjusted dose). In general, there is increased concern when the NOAEL occurs at exposures less than 10-fold the human exposure at the MRHD; above this threshold, concern is reduced. Effects that are limited to occurrence at more than 25-fold the human exposure at the MRHD are usually of minor concern for the clinical use of the pharmaceutical. The most relevant margin is generally the exposure metric in the most sensitive species, unless appropriately justified otherwise. Biological plausibility is assessed by comparison of pharmacologic mechanism of action with the known role of the target in reproduction or development. A finding that can be interpreted as a consequence of pharmacology suggests that it will be of concern for humans. This relationship is further strengthened by evidence that the finding is dose-related, whether characterized as increasing incidence or severity. Absence of biological plausibility does not preclude off-target toxicity, particularly if this is characterized by a dose-response relationship.
Understanding the potential for reversibility will alter the risk assessment. Effects on male and female fertility that are reversible after cessation of treatment are of less concern. Conversely, critical irreversible developmental endpoints, such as death or malformation, are of increased concern. Other forms of developmental toxicity (e.g., growth retardation, functional deficits), may or may not be reversible. Generally, transient findings (e.g., skeletal variations, such as wavy ribs in rodents) are of less concern when they occur in isolation. Similarly, variations that are indicative of growth retardation in the presence of reduced fetal weight are of less concern. However, an overall increase in the incidence of variations (qualitatively similar or not) can suggest increased concern for dysmorphogenesis in the presence of an equivocal increase in malformations.
The role of parental toxicity should be considered in determination of the relevance of findings. Embryo-fetal toxicity observed in the presence of maternal toxicity should be considered carefully to determine the likelihood that the finding is relevant for humans. Specifically, evaluation of the concordance between individual litter findings and the severity of maternal toxicity in the dam could be helpful in this assessment. It should not be assumed that developmental toxicity is secondary to maternal toxicity, unless such a relationship is demonstrated de novo, or relevant published literature can be cited.
Also, consistency of findings reported among studies, or between species can strengthen the concern for an adverse effect. Increased fetal lethality seen in a rodent EFD study that is consistent with decreased live litter sizes in the PPND study is an example of cross-study concordance. Observations of increased post implantation loss in rats and rabbits is an example of cross-species concordance. Further knowledge of the mechanism of reproductive or developmental effects identified in animal studies can help to explain differences in responses between species and provide information on the human relevance of the effect (e.g., corticosteroid-induced cleft palate in mice).
A specific risk assessment conducted for breastfeeding would be predicated on hazards identified by the in vivo littering study (PPND or ePPND). These hazards can include adverse effects on offspring growth and development that are attributed to excretion of the pharmaceutical in the milk. Systemic exposure data in the pups from the littering study, if available, can also be compared with projected lactational exposures in the human infant. While interspecies differences in milk composition preclude a direct quantitative correlation of animal milk levels to human milk levels of a pharmaceutical, the presence of pharmaceutical in animal milk generally indicates the presence of pharmaceutical in human milk.
Lastly, available human data can influence the overall assessment of human reproductive risk.

10. 注釈

注1：特に精巣と精巣上体は、精上皮の組織構造を維持できる方法を用いて採取及び処理すべきである。精子形成期間を考慮した詳細で定性的な病理組織学的検査は、精子形成に対する影響を検出する感度の高い方法である。通常必要とされないが、追加のエンドポイント（免疫組織化学検査、ホモジナイズ後の精子細胞数、フローサイトメトリー、ステージの定量的解析など）を試験デザインに組み入れ、認められた影響の特性をより明らかにすることができる。雌動物では、卵巣（卵胞、黄体、線維性間質細胞、間質腺細胞及び血管系を含む）、子宮、及び膣の詳細で定性的な病理組織学的検査を、生殖サイクル及び原始卵胞と一次卵胞の存在を考慮して実施すべきである。
注2：ヒトに対する催奇形性物質として既知あるいは推定される22の化合物を解析したところ、MEFLが認められたケースでは、少なくとも1種の動物種において、最小毒性量（LOAEL）での曝露量がMRHDでの曝露量の6倍未満であった（Andrews et al. (6)）。このことは、EFD試験での高用量選択の際に、25倍を超える曝露量比を用いればこれらすべての医薬品に対する催奇形性のハザードを十分検出できることを示している。本解析では、動物でMEFLが検出されたヒト催奇形性物質に関して、少なくとも1種の動物種におけるNOAELでの曝露量がMRHDでの曝露量の4倍未満であったことも示された。
さらにIQ DruSafeリーダーシップグループによりEFD試験に関する調査が行われた（Andrews et al. (7)）。この調査から、例えば、用量を制限するような母動物毒性が発現しない条件下において、ヒト（想定される治療用量での曝露量）に対して動物での未変化体の曝露量比が15倍以上に達していたEFD本試験は、ラットで153件、ウサギで128件であったことが明らかとなった。これらのデータによると、母動物毒性が認められない場合（認められれば高用量投与は制限される）、ヒト曝露量の25倍以上の曝露量を達成するよう動物へ投与しても、MEFLは稀にしか認められない。これらすべての場合において、MEFLは50倍を超える曝露量まで認められず、このような高曝露量での所見がヒトでのリスク評価に適しているとは考えられない。そのため、用量を制限するような母動物毒性が発現しない場合、EFD及びPPND試験の高用量として、MRHDでの総未変化体濃度におけるヒト血中曝露量に対する曝露量比が25倍を超える用量とすることは理にかなっており、ヒトのリスク評価に適した結果を検出するのに十分であると考えられる。

10. ENDNOTES

Note 1: In particular, the testes and epididymides should be sampled and processed using methods which preserve the tissue architecture of the seminiferous epithelium. A detailed qualitative microscopic evaluation with awareness of the spermatogenic cycle is a sensitive means to detect effects on spermatogenesis. While generally not warranted, additional experimental endpoints (e.g., immunohistochemistry, homogenization resistant spermatid counts, flow cytometry, quantitative analysis of staging) can be incorporated into the study design to further characterize any identified effects. In females, a detailed qualitative microscopic examination of the ovary (including follicles, corpora lutea, stroma, interstitium, and vasculature), uterus and vagina should be conducted with awareness of the reproductive cycle and the presence of primordial and primary follicles.
Note 2: An analysis of 22 known human or presumed human teratogens showed that if MEFL was observed, exposure at the lowest observed adverse effect level (LOAEL) in at least one species was < 6-fold the exposure at the MRHD (Andrews et al. (6)). This indicates that using a > 25-fold exposure ratio for high-dose selection in the EFD toxicity studies would have been sufficient to detect the teratogenic hazard for all these pharmaceuticals. The analysis also showed that for human teratogens that were detected in animal species, the exposure at the NOAEL in at least one species was < 4-fold the exposure at the MRHD.
In addition, a survey was conducted on EFD toxicity studies by the IQ DruSafe Leadership Group (Andrews et al. (7)). This survey identified 153 and 128 definitive rat and rabbit EFD studies, respectively, that achieved ≥ 15-fold animal to human parent drug exposure ratios (using human exposure at the intended therapeutic dose) in the absence of confounding (i.e., dose-limiting) maternal toxicity. These data show that dosing animals to achieve exposures ≥ 25-fold human exposures when there is no maternal toxicity (that would otherwise limit the high dose), only infrequently detects MEFL. In all these cases, MEFL findings were not observed until exposures exceeded 50-fold and findings at such high exposures are not believed to be relevant to human risk assessment. In the absence of confounding maternal toxicity, the selection of a high dose for EFD and PPND studies that represents a > 25-fold exposure ratio to human plasma exposure of total parent compound at the intended maximal therapeutic dose is therefore considered pragmatic and reasonably sufficient for detecting outcomes relevant for human risk assessment.

11. 用語

注意：本項に示す定義は本ガイドライン内で使用するためのものである。
代替法：形態異常や胚・胎児致死性（MEFL参照）を予測することを目的としたin vitro、ex vivo又は非哺乳類in vivo試験法。

適用領域：試験法で信頼して試験されうる物質の物理化学的特性及び生物学的作用機序の定義。

代替法の適格性確認（規制当局の受入れ目的）：in vivoで認められるMEFLを特定する上での代替法の予測性の確認。

構成成分：ワクチンで賦形剤、希釈剤、又はアジュバントとして使用されている化学物質又は生物学的物質。製品を投与しやすくするために別途供給される希釈剤を含む。

発生毒性：成人期に達する前に誘発される有害作用。受精から出生後までに誘発、あるいは顕在化する影響を含む。

GD 0：妊娠0日。交尾成立が確認（げっ歯類では膣スメアによる精子確認／膣栓、ウサギでは交尾の確認など）された日。

形態異常：一般的に正常な発生や生存に支障をきたす、あるいは著しく有害な永続的構造の逸脱。

予備的EFD（pEFD）毒性試験：器官形成期に曝露を行う胚・胎児発生毒性試験で、適切な用量段階を設定し、各群6匹以上の妊娠動物を用いて、胎児生存、胎児体重、外表・内臓の変化を評価する（ICH M3参照）。

サロゲート分子：医薬品がヒトで惹起するものと同様の薬理活性を試験動物種に引き起こす分子。

ワクチン：本ガイドラインでは、感染性疾患の予防と治療のためのワクチンを意図する。ワクチン（ワクチン製品という用語も含む）は完全な製剤として定義され、抗原（あるいは免疫原）及びアジュバント、賦形剤、保存剤などの添加剤が含まれる。当該ワクチンは免疫系を刺激しワクチン抗原に対する免疫反応を獲得することを目的としている。ワクチンの主な薬理作用は、感染あるいは感染性疾患の予防や治療である。

変異：生存性、発生、あるいは機能に影響を与えない構造変化（骨化遅延など）。可逆的なものもあり，生殖発生毒性試験の対照群で認められることもある。

11. GLOSSARY

Disclaimer: The definitions in this glossary are specific for their use within this guideline.
Alternative assay(s): In vitro, ex vivo or non-mammalian in vivo assay(s) intended to predict malformations or embryo-fetal lethality; see MEFL.
Applicability domain: refers to the definition of the physicochemical properties of the substances that can be reliably tested in the assay and the biological mechanisms of action covered by the assay.
Assay qualification (for regulatory use): Confirmation of the predictivity of an alternative assay(s) to identify MEFL, as observed in vivo.
Constitutive ingredients: Chemicals or biologic substances used as excipients, diluents, or adjuvants in a vaccine, including any diluent provided as an aid in the administration of the product and supplied separately.
Developmental toxicity: Any adverse effect induced prior to attainment of adult life. It includes effects induced or manifested from conception to postnatal life.
GD 0: The day on which positive evidence of mating is detected (e.g., sperm is found in the vaginal smear / vaginal plug in rodents, or observed mating in rabbits).
Malformation: Permanent structural deviation that generally is incompatible with or severely detrimental to normal development or survival.
Preliminary EFD (pEFD) toxicity study: An embryo-fetal developmental toxicity study that includes exposure over the period of organogenesis, has adequate dose levels, uses a minimum of 6 pregnant animals per group, and includes assessments of fetal survival, fetal weight, and external and soft tissue alterations (see ICH M3).
Surrogate molecule: A molecule showing similar pharmacologic activity in the test species as that shown by the human pharmaceutical in the human.
Vaccine: For the purpose of this guideline, this term refers to preventative or therapeutic vaccines for infectious diseases. Vaccine (inclusive of the term vaccine product) is defined as the complete formulation and includes antigen(s) (or immunogen(s)) and any additives such as adjuvants, excipients or preservatives. The vaccine is intended to stimulate the immune system and result in an immune response to the vaccine antigen(s). The primary pharmacological effect of the vaccine is the prevention and/or treatment of an infection or infectious disease.
Variation: Structural change that does not impact viability, development, or function (e.g., delays in ossification) which can be reversible, and are found in the normal population under investigation.