Comments about dispersion of light waves

Hervé C. Lefèvre

doi:10.1051/jeos/2022001

Abstract

Dispersion of light waves is well known, but the subject deserves some comments. Certain classical equations do not fully respect causality; as an example, group velocity v_g is usually given as the first derivative of the angular frequency ω with respect to the angular spatial frequency k_m (or wavenumber) in the medium, whereas it is k_m that depends on ω. This paper also emphasizes the use of phase index n and group index n_g, as inverse of their respective velocities, normalized to 1/c, the inverse of free-space light velocity. This clarifies the understanding of dispersion equations: group dispersion parameter D is related to the first derivative of n_g with respect to wavelength λ, whilst group velocity dispersion GVD is also related to the first derivative of n_g, but now with respect to angular frequency ω. One notices that the term second order dispersion does not have the same meaning with λ, or with ω. In addition, two original and amusing geometrical constructions are proposed; they simply derive group index n_g from phase index n with a tangent, which helps to visualize their relationship. This applies to bulk materials, as well as to optical fibers and waveguides, and this can be extended to birefringence and polarization mode dispersion in polarization-maintaining fibers or birefringent waveguides.Dispersion of light waves is well known, but the subject deserves some comments. Certain classical equations do not fully respect causality; as an example, group velocity v_g is usually given as the first derivative of the angular frequency ω with respect to the angular spatial frequency k_m (or wavenumber) in the medium, whereas it is k_m that depends on ω. This paper also emphasizes the use of phase index n and group index n_g, as inverse of their respective velocities, normalized to 1/c, the inverse of free-space light velocity. This clarifies the understanding of dispersion equations: group dispersion parameter D is related to the first derivative of n_g with respect to wavelength λ, whilst group velocity dispersion GVD is also related to the first derivative of n_g, but now with respect to angular frequency ω. One notices that the term second order dispersion does not have the same meaning with λ, or with ω. In addition, two original and amusing geometrical constructions are proposed; they simply derive group index n_g from phase index n with a tangent, which helps to visualize their relationship. This applies to bulk materials, as well as to optical fibers and waveguides, and this can be extended to birefringence and polarization mode dispersion in polarization-maintaining fibers or birefringent waveguides.

Keywords

Birefringence Chromatic dispersion Dispersion Effective index First-order dispersion Group birefringence Group index Group velocity dispersion Index of refraction Polarization mode dispersion Refractive index Second-order dispersion

1 Introduction

The theory of dispersion of light waves is well known and can be found in many textbooks, as for example [1–3], but the way it is usually presented deserves some comments. For example, causality can be seen as not being fully respected in the basic equations, within the meaning of causal link between their parameters.

This paper also emphasizes that the indexes are very convenient to understand more easily the equations describing dispersion. They should be viewed as inverses of velocity, normalized to 1/c, the inverse of light velocity c in a vacuum, knowing that there is no short word for these inverses that are involved in these equations. Furthermore, indexes are dimensionless values, which avoids dealing with units that are not always easily understandable, as we shall see.

In addition, two simple and amusing geometrical constructions are presented to derive group index n_g from refractive index n with a tangent. To my knowledge, they are original. They apply to material dispersion, but also to guidance dispersion in a fiber or a waveguide, as well as to birefringence and intrinsic polarization mode dispersion (I-PMD) in polarization-maintaining (PM) fibers or birefringent waveguides.

Obviously, this does not prevent the use of Sellmeier’s equation [4] and derivatives with a computer, to calculate precisely these indexes, but this brings a complementary view to visualize simply the question of dispersion, and the relationship between refractive index n and group index n_g.

2 Comments regarding causality

The first analysis of dispersion was done by Newton, in the mid-17th century, with a prism that can separate the various colors of white light, because the refractive index n (or index of refraction) of a glass is not constant; it is called chromatic dispersion (chroma is color, in ancient Greek). At the beginning of the 19th century, with the work of Fresnel on the mathematical theory of diffraction, it was finally accepted by the scientific community that light is a wave, as proposed by Huygens at the end of the 17th century, and that the index n is the ratio between light velocity c in a vacuum and its lower velocity in a medium. Remember that Newtonian corpuscular theory stated the opposite, with light going faster in a medium than in a vacuum, and that it required more than a century to get Huygens’ wave model accepted, because of the tremendous prestige of Newton, as discussed by Aspect in a recent historical review paper [5].

In acoustics, it was proposed by Hamilton in the first half of the 19th century that a modulated wave has in fact two velocities: the sinusoidal carrier does propagate at the velocity of a continuous wave, called the phase velocity v_φ, but the modulation term propagates at the so-called group velocity v_g. This concept of group velocity was later developed in full mathematical details by Rayleigh in his iconic book, The Theory of Sound [6], and group velocity was seen as the velocity of signal energy.

At the beginning of the 20th century, the application of this concept to optical waves raised many questions, because it could lead to a group velocity higher than c, in the case of high Anomalous dispersion, i.e., when (dn/dλ) ≫ 0, which was contradictory with the Theory of Relativity. This question was brilliantly solved by Sommerfeld and Brillouin in twin papers of 1914 [7, 8]. An English version of these papers can be found in the very interesting Brillouin’s book of 1960, Wave Propagation and Group Velocity [9]. The important result is that anomalous dispersion happens when there is absorption, as in the far ultra-violet for example, but with absorption, group velocity is not the velocity of signal energy anymore; then, it is not contradictory with Relativity.

Going back to the usual case of normal optical dispersion in transparent dielectric media, i.e., when (dn/dλ) < 0, phase velocity v_φ = c/n and group velocity v_g are classically given by [1–3]:(1) $v_{φ} = ω / k_{m} and v_{g} = d ω / d k_{m},$ where ω is the angular (temporal) frequency, and k_m is the angular spatial frequency in the medium. For k_m = 2π/λ_m = 2π · n/λ, with λ_m = λ/n being the wavelength in the medium and λ being the wavelength in a vacuum, the terms angular wave number, or wave number alone, are also used, but I prefer angular spatial frequency to outline the duality between the temporal domain and the spatial domain.

I do not like much the use of the derivative dω/dk_m for the definition of v_g, since it does not respect the causal link between k_m and ω, that I call, in short, causality in this paper: it is k_m that depends on ω, and not ω that depends on k_m. You may say that dω/dk_m and (dk_m/dω)⁻¹ are alike for Leibniz’s notation of derivatives, beauty of math, but with Lagrange’s notation, that I prefer here, it is less clear: ω′(k_m) suggests a causality that would not be fully respected, whereas [k′_m(ω)]⁻¹ does respect causality.

To be more specific, you must agree that it is the index n (involved in k_m = 2π · n/λ), that depends on the wavelength λ in a vacuum (involved in ω, with ω = 2π · c/λ), and not the opposite; it is n(λ) and not λ(n), even if λ, as a function of n, remains mathematically possible. Math does not care about causality and can inverse a function, but causality is fundamental in physics! Equation (1) should be:(2) $1 / v_{φ} = k_{m} / ω and 1 / v_{g} = d k_{m} / d ω = {k'}_{m} (ω) .$

You may think that it is nit-picking but, to me, it is important, and I wanted to share this view. I am not the only one to think that, and equation (2) is found in several textbooks [10–12], even if they do not outline the difference between (1) and (2). It must be obvious for the authors of these references, but I am not sure that it is obvious for every reader of this article.

Now, if a frequency is the inverse of a period, as we have all learned with music, there is no short word for the inverse of a velocity, that is involved in (2). The habit is to use temporal delay over a unit distance [3] or group delay per unit length [10], with t_g = 1/v_g, but it is not very concise. Slowness could have been possible, but it is not very positive wording. This can be overcome with the use of refractive index n, also called phase index since n = c/v_φ, and of group index n_g = c/v_g. They should be viewed as the inverse of a velocity, normalized to 1/c. Relativity physicists do normalize the time dependence c · t of their equations with c = 1, to get the same temporal dimension as that of the spatial coordinates; we can do the inverse! So, there are:(3) $n = (1 / v_{φ}) / (1 / c) and n_{g} = (1 / v_{g}) / (1 / c) .$

You may think that it is nit-picking again, or obvious math, but I prefer to consider that it is useful to understand dispersion better. Using group index n_g, as defined in (3), it is simple to derive its relationship with phase index n. Since k_m = n(ω)·ω/c, there is:(4) $n_{g} (ω) = (1 / v_{g}) / (1 / c) = c \cdot d k_{m} / d ω = d [n (ω) \cdot ω] / d ω .$

It is important to notice in (4) the derivative of the product [n(ω) · ω]. It explains the difference between dispersion seen as a function of wavelength λ, and seen as a function of angular frequency ω, as we shall see later. Now from (4):(5) $n_{g} (ω) = d [n (ω) \cdot ω] / d ω = n (ω) + [ω \cdot d n / d ω] .$

With Lagrange’s notation of derivative, instead of Leibniz’s notation, this yields:(6) $n_{g} (ω) = n (ω) + [ω \cdot n' (ω)] .$

This well-known equation is often written as a function of the wavelength λ in a vacuum. Since ω = 2π · c/λ, there is, by logarithmic differentiation, dω/ω = −dλ/λ, and then:(7) $n_{g} (λ) = n (λ) - [λ \cdot d n / d λ]$ or, with Lagrange’s notation:(8) $n_{g} (λ) = n (λ) - [λ \cdot n' (λ)] .$

It is often taught that there is no group velocity dispersion, when the second derivative n″(λ), or d²n/dλ², is equal to 0; for silica fibers, it is for λ = 1.3 μm. Geometrically, this corresponds to an inflexion point of the refractive index curve n, as a function of wavelength λ (Fig. 1). It is mathematically true but, again, it does not fully respect causality. The cause is that there is no group dispersion when group velocity v_g, as well as group index n_g, are constant, i.e., when its first derivative dn_g/dλ, or n′_g(λ), is equal to zero. It happens to be the case when the second derivative of the refractive (or phase) index, d²n/dλ², or n″(λ), equates zero, but it is only a consequence of the fact that dn_g/dλ = −λ · d²n/dλ²; it is not the original cause which is dn_g/dλ equates zero.

$Refractive (or phase) index n (solid line curve), and group index ng (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.3 μm, there is no group velocity dispersion since ng is constant, i.e., dng/dλ is null. This is the basic cause; the fact that d2n/dλ2 = 0 at 1.3 μm is just a consequence of dng/dλ = −λ · d2n/dλ2.$

Figure 1.Refractive (or phase) index n (solid line curve), and group index n_g (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.3 μm, there is no group velocity dispersion since n_g is constant, i.e., dn_g/dλ is null. This is the basic cause; the fact that d²n/dλ² = 0 at 1.3 μm is just a consequence of dn_g/dλ = −λ · d²n/dλ².

This result influenced a vocabulary that must be handled with care. Chromatic dispersion, also known as phase velocity dispersion, i.e., first derivative of phase index dn/dλ ≠ 0, is often called first-order dispersion, and group velocity dispersion, i.e., first derivative of group index dn_g/dλ ≠ 0, but also second derivative of phase index d²n/dλ² ≠ 0, is often called second-order dispersion.

So, I have another comment: if one considers frequencies (angular, ω, or regular temporal, f = ω/2π, or spatial, σ = 1/λ), this does not work anymore; this works only with period, i.e., wavelength λ or temporal period T = 1/f. No group velocity dispersion means that dn_g/dω (or dn_g/df, or dn_g/dσ) is equal to zero, since it is the basic cause, but from (5):(9) $d n_{g} / d ω = d [n (ω) + (ω \cdot d n / d ω)] = (2 \cdot d n / d ω) + (ω \cdot d^{2} n / d ω^{2})$

Therefore, when dn_g/dω = 0, the second derivative d²n/dω² of the phase index equates −(2·dn/dω)/ω, and it is not null. There are similar equations with f and σ, since df/f = dσ/σ = dω/ω, by logarithmic differentiation.

Nevertheless, it is possible to view second-order dispersion with frequency. It is not as visual as for wavelength, with the inflexion point of the refractive index dependence, but it remains simple mathematically. We saw in (2) that t_g = 1/v_g = dk_m/dω, and group dispersion is the first derivative of this group delay t_gover a unit distance, i.e., 1/v_g. With frequency, it is called group velocity dispersion (GVD), and:(10) $GVD = d t_{g} / d ω = d (1 / v_{g}) / d ω = d^{2} k_{m} / d ω^{2} .$

Since k_m = 2π · n/λ = [(n(ω)·ω]/c, there is no second order dispersion when the second derivative of the product n(ω) · ω is null, i.e., when d² k_m/dω² = (1/c)·d²[n(ω)·ω]/dω² = 0, whereas, with λ, it is when the second derivative of the refractive index n(λ) alone is null, i.e. d²n/dλ², as we already saw.

To promote, again, indexes as normalized inverse velocities, I have an additional comment with what is called group dispersion parameter, D, and what is called, we just saw, group velocity dispersion, GVD, as you find on refractiveindex.info web site, for example. D is classically expressed in ps/(nm km), and it is simply [3]:(11) $D = d t_{g} / d λ = d (1 / v_{g}) / d λ .$

Using the first derivative of n_g(λ), this yields:(12) $D = (1 / c) \cdot d n_{g} / d λ = (1 / c) \cdot {n'}_{g} (λ) .$

GVD is similar, but it is using the derivative with respect to ω, instead of λ:(13) $GVD = d t_{g} / d ω = d (1 / v_{g}) / d ω .$

Using the first derivative of n_g(ω), this yields:(14) $GVD = (1 / c) \cdot d n_{g} / d ω = (1 / c) \cdot {n'}_{g} (ω) .$

Its unit is square second per meter. This is concise but not easily understandable. It would be clearer to use s/[(rad/s) m], or s²/(rad m), a radian per second being the unit of ω, even if a radian is a dimensionless unit that can be omitted. This is a complicated question, by the way, to decide if a dimensionless unit can be omitted or not?

In silica, at 1550 nm, dn/dλ ≈ −0.012 μm⁻¹ and dn_g/dλ ≈ +0.007 μm⁻¹ (Fig. 2). I like these values that have a simple unit, and that you can check on index curves. Now, since 1/c is about one nanosecond per foot (a very useful value that I learned and still remember from my postdoc at Stanford University, in the early 1980s), group dispersion parameter D is about +23 ps/(nm km), and group velocity dispersion GVD is about −28 ps²/km (or fs²/mm).

$Refractive (or phase) index n (solid line curve), and group index ng (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.55 μm, the slope of the tangent to the curve n(λ), i.e., dn/dλ or n′(λ), equates minus 0.012 μm−1, and gives the chromatic, or first-order, dispersion; the one to the curve ng(λ), i.e., dng/dλ or n′g(λ), equates plus 0.007 μm−1, and gives the group velocity dispersion, or second-order dispersion. Since 1/c = 1 ns/300 mm, the dispersion parameter D(1.55 μm) = (1/c) · n′g(1.55 μm) equates + 23 ps/(nm km).$

Figure 2.Refractive (or phase) index n (solid line curve), and group index n_g (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.55 μm, the slope of the tangent to the curve n(λ), i.e., dn/dλ or n′(λ), equates minus 0.012 μm⁻¹, and gives the chromatic, or first-order, dispersion; the one to the curve n_g(λ), i.e., dn_g/dλ or n′_g(λ), equates plus 0.007 μm⁻¹, and gives the group velocity dispersion, or second-order dispersion. Since 1/c = 1 ns/300 mm, the dispersion parameter D(1.55 μm) = (1/c) · n′_g(1.55 μm) equates + 23 ps/(nm km).

To understand better ps²/km, the unit of GVD, and to compare it with ps/(nm km), the unit of D, one must see that in “ps²”, there are a first “ps” for the delay, as in ps/(nm km), and a second “ps” that is actually 1/(10⁺¹² rad/s), with the omission of dimensionless radian. At 1550 nm, where the temporal frequency f is 193.5 THz, the angular frequency, ω = 2π · f, is about 1.2 × 10⁺¹⁵ rad/s; then, a value of 10⁺¹² rad/s for the shift ∆ω, is around 0.1% of ω. The “nm” used in D for ∆λ is also a shift of about 0.1% of the wavelength, that is in the μm range, and then, the two numerical values, 23 and 28, are close. One femtosecond, for GVD, is about equivalent to one nanometer to the −1 power, for D.

In any case, ps²/km remains a strange unit to me. As a teaser, it would have been also possible to use a concise and strange unit for D: ps/mm², since 1 nm × 1 km = 1 mm × 1 mm, but with mm², it looks like an area!

Finally, remember that D and GVD have opposite signs, since ω = 2π · c/λ, and then, dω/dλ is negative, as well as dλ/dω. To talk about positive or negative group dispersion requires to specify if it is with respect to period, or to frequency. To avoid this problem, it is customary to use for group velocity dispersion the same vocabulary as the one used for phase velocity dispersion. As we saw, normal dispersion corresponds to a negative derivative of the index with respect to λ, and anomalous dispersion corresponds to a positive derivative. However, I am not very fond of this vocabulary for group dispersion, even if it is convenient. Anomalous has an understandable meaning with phase velocity dispersion, since it is when the medium is absorbing, and it is not the normal use in the transparency window. With group velocity dispersion, a positive group index derivative is not Anomalous, nor abnormal, strictly speaking. Taking the case of silica, it is simply above 1.3 μm, as seen in Figure 2!

3 Simple geometrical constructions to derive n_g from n

Let us consider the theoretical case of a material without any group dispersion. As we saw, dn_g/dλ = 0 implies that d²n/dλ² = 0. The curve of the refractive (or phase) index n as a function of wavelength λ is a simple affine function, with a constant slope, equal to dn/dλ, since d²n/dλ² = 0:(15) $n (λ) = n (0) + (λ \cdot d n / d λ) .$

Because the group index n_g(λ) is equal to n(λ) – (λ · dn/dλ), as seen in (7), there is:(16) $n_{g} (λ) = [n (0) + (λ \cdot d n / d λ)] - (λ \cdot d n / d λ) = n (0) .$

Group index n_g is constant and equal to the value n(0) of the phase index for a null wavelength (Fig. 3).

$Refractive (or phase) index n(λ) (solid line), and group index ng(λ) (dashed line), in the theoretical case of a material without any group dispersion. The straight line representing n(λ) crosses the ordinate axis corresponding to λ = 0, at the constant value of ng(λ) = n(0).$

Figure 3.Refractive (or phase) index n(λ) (solid line), and group index n_g(λ) (dashed line), in the theoretical case of a material without any group dispersion. The straight line representing n(λ) crosses the ordinate axis corresponding to λ = 0, at the constant value of n_g(λ) = n(0).

As it can be simply visualized with this Figure 3, normal dispersion, i.e., a negative slope dn/dλ, yields a group index n_g higher than the phase index n. Conversely, anomalous dispersion, i.e., a positive slope dn/dλ, yields a group index n_g lower than the phase index n, and one can easily understand that with a steep positive slope, group index can become lower than one, and even negative, which was obviously a problem with the theory of Relativity, but this was solved by Sommerfeld and Brillouin, as we already saw [7–9].

Now, with the practical case of a material with group dispersion, one must consider the tangent to the curve n(λ). Using Lagrange’s notation, which is clearer, here, than Leibniz’s notation, the equation T₀(λ) of this tangent, for a given wavelength λ₀, is:(17) $T_{0} (λ) = n (λ_{0}) + [n^{'} (λ_{0}) \cdot (λ - λ_{0})] .$

Then, it is simple to see with (7) again, that for a null wavelength:(18) $T_{0} (0) = n (λ_{0}) - [n' (λ_{0}) \cdot λ_{0}] = n_{g} (λ_{0}) .$

The tangent T₀(λ) to the curve n(λ), at λ₀, crosses the ordinate axis, i.e., when λ is zero, at the value n_g(λ₀) of the group index for λ₀ (Fig. 4). It is simple math, but it is amusing and, also, very useful to visualize simply the relationship between n and n_g.

$Refractive (or phase) index n(λ) (solid line curve), with group dispersion. The tangent T0(λ) to the curve n(λ) at λ0 crosses the ordinate axis corresponding to λ = 0, at the value ng(λ0) of the group index for λ0.$

Figure 4.Refractive (or phase) index n(λ) (solid line curve), with group dispersion. The tangent T₀(λ) to the curve n(λ) at λ₀ crosses the ordinate axis corresponding to λ = 0, at the value n_g(λ₀) of the group index for λ₀.

Knowing this, Figure 1 can be revisited, even if silica is not transparent anymore below 0.15 μm (Fig. 5). Mathematically, it remains possible to continue the tangent toward zero. Math does not care about causality, as we already saw, nor does it care about transparency!

$Refractive (or phase) index n(λ) (solid line curve), and group index ng(λ) (dashed line curve) of silica. The tangent to the curve n(λ), at λ0, crosses the extended ordinate axis, where λ = 0 μm, at the value ng(λ0) of the group index for λ0, as shown for λ0 equal to 0.85 μm, or to 1.3 μm.$

Figure 5.Refractive (or phase) index n(λ) (solid line curve), and group index n_g(λ) (dashed line curve) of silica. The tangent to the curve n(λ), at λ₀, crosses the extended ordinate axis, where λ = 0 μm, at the value n_g(λ₀) of the group index for λ₀, as shown for λ₀ equal to 0.85 μm, or to 1.3 μm.

4 The case of a single-mode optical fiber

As we saw, a CW light wave propagates at a phase velocity v_φ = ω/k_m(ω), in a bulk medium. In an optical fiber, there are discrete modes that propagate at different phase velocities v_φi = ω/β_i(ω), where β_i(ω) is called the propagation constant of mode i [2, 3, 10]. These propagation constants depend on the angular temporal frequency ω, and they have an intermediate value between the angular spatial frequency k_m2 in the cladding of refractive index n₂, and k_m1, the one in the core of refractive index n₁. In the single-mode regime, the high-order modes are above cut-off, and the fundamental mode is the only one that can propagate. Its propagation constant β(ω) follows:(19) $k_{m 2} = 2 π \cdot n_{2} / λ < β (ω) < k_{m 1} = 2 π \cdot n_{1} / λ .$

It is very convenient to use the so-called effective index n_eff of the mode defined with:(20) $β (ω) = 2 π \cdot n_{eff} (ω) / λ .$

Following (19), n_eff has an intermediate value between n₂ and n₁, and like β it depends on frequency:(21) $n_{2} < n_{eff} (ω) < n_{1} .$

The angular (temporal) frequency ω is very useful to shorten mathematical equations, but everybody is more familiar with the wavelength λ in a vacuum. You know what is 1 μm, whereas 1 rad/s is not that obvious, besides the fact that it corresponds to 0.16 Hz. Therefore, the frequency dependence of n_eff is classically presented with respect to λ_c/λ, where λ_c is the cut-off wavelength of the second-order mode. This ratio does correspond to a frequency: it is the spatial frequency 1/λ normalized to 1/λ_c.

This effective index n_eff is a phase index, and all the mathematical equations of Section 2 can be used to find the effective group index n_g-eff. It is also possible to use the geometrical construction of Section 3, that relates a group index to its corresponding phase index.

The way to proceed is to invert the classical curve n_eff(λ_c/λ), and to get the inverted curve n_eff(λ/λ_c), that depends on λ, and not on 1/λ anymore (Fig. 6).

$Effective index neff of the fundamental mode, n1 being the refractive index of the core, and n2 being the one of the cladding: (a) classical representation, as a function of the normalized spatial frequency λc/λ, in a vacuum; (b) inverted representation as a function of the normalized wavelength λ/λc, in a vacuum.$

Figure 6.Effective index n_eff of the fundamental mode, n₁ being the refractive index of the core, and n₂ being the one of the cladding: (a) classical representation, as a function of the normalized spatial frequency λ_c/λ, in a vacuum; (b) inverted representation as a function of the normalized wavelength λ/λ_c, in a vacuum.

It is interesting to notice that this inverted curve is easier to understand, for the fundamental mode, than the classical one: as the wavelength increases, the mode widens [2], and it expands more in the cladding, which decreases the effective index n_eff. However, the classical representation remains better with several modes, since their effective index curves are spread about evenly in frequency, which would not be the case with the inverted representation.

Now, once you have this inverted representation in wavelength, you must just apply the geometrical construction of Figure 4, to find the value of the effective group index n_g-eff (Fig. 7).

$Geometrical construction to derive the effective group index ng-eff(λ/λc) of the fundamental mode (solid line curve) from its effective phase index neff(λ/λc) (dashed line curve). The tangent to neff(λ/λc) crosses the ordinate axis at the value of the corresponding effective group index ng-eff(λ/λc), as shown for λ = 1.25 λc. One sees easily that ng-eff is about equal to the refractive index of the core n1, in the practical domain of use of a single-mode fiber, i.e., λc c; above 1.5 λc, the mode starts to widen a lot and the curvature loss increases drastically.$

Figure 7.Geometrical construction to derive the effective group index n_g-eff(λ/λ_c) of the fundamental mode (solid line curve) from its effective phase index n_eff(λ/λ_c) (dashed line curve). The tangent to n_eff(λ/λ_c) crosses the ordinate axis at the value of the corresponding effective group index n_g-eff(λ/λ_c), as shown for λ = 1.25 λ_c. One sees easily that n_g-eff is about equal to the refractive index of the core n₁, in the practical domain of use of a single-mode fiber, i.e., λ_c < λ < 1.5 λ_c; above 1.5 λ_c, the mode starts to widen a lot and the curvature loss increases drastically.

Note, however, that it is possible to find also a geometrical construction with the frequency dependence. It is not as simple as with the period dependence, but it has some interest. The equation of the tangent T₀(ω) to the curve n(ω), for a given angular frequency ω_0, is:(22) $T_{0} (ω) = n (ω_{0}) + [(ω - ω_{0}) \cdot n^{'} (ω_{0})],$ (23) $T_{0} (ω) = T_{0} (0) + [(ω \cdot n' (ω_{0})]$ with:(24) $T_{0} (0) = n (ω_{0}) - [ω_{0} \cdot n^{'} (ω_{0})] .$ The slope of this tangent is n′(ω₀).

Consider now the affine function DS₀(ω) starting from T₀(0), and having a double slope, i.e., a slope equal to twice that of the tangent T₀(ω):(25) ${DS}_{0} (ω) = T_{0} (0) + 2 [(ω \cdot n^{'} (ω_{0})] .$

Following (24), there is:(26) ${DS}_{0} (ω_{0}) = n (ω_{0}) + [ω_{0} \cdot n^{'} (ω_{0})] .$

As seen in (6), the group index follows n_g(ω) = n(ω) + [ω · n′(ω)], then:(27) ${DS}_{0} (ω_{0}) = n_{g} (ω_{0}) .$

With λ, we saw that the tangent to the curve n(λ), at λ₀, crosses the ordinate axis at n_g(λ₀). With ω, one draws a double-slope line DS₀(ω) (Fig. 8). In the case of the fundamental mode of a fiber, the double-slope construction can be used with the classical curve n_eff(λ_c/λ) seen in Figure 6a, as shown in Figure 9.

Figure 8.Two possible geometrical constructions for relating group index n_g to phase index n: (a) with the dependence in wavelength λ, i.e., the spatial period of the wave, the tangent T₀(λ) to the phase index curve crosses the ordinate axis at the value n_g(λ₀) of the group index; (b) with the dependence in angular frequency ω, a double-slope line DS₀(ω) is drawn from where the tangent T₀(ω) to the phase index curve crosses the ordinate axis, and the group index n_g(ω₀) equates DS₀(ω₀).

$Geometrical construction relating the effective group index ng-eff to the effective index neff of the fundamental mode of a fiber, with the dependence in normalized spatial frequency λc/λ. A double-slope line is drawn from where the tangent to the effective index curve crosses the ordinate axis, and the group index ng(λc/λ0) equates DS0(λc/λ0), as shown for λc/λ0 = 0.8, i.e., for λ0 = 1.25 λc. As in Figure 7, one easily sees that ng-eff is about equal to the refractive index of the core n1, in the practical domain of use of a single-mode fiber, i.e., λc c, or 0.67 c/λ < 1.$

Figure 9.Geometrical construction relating the effective group index n_g-eff to the effective index n_eff of the fundamental mode of a fiber, with the dependence in normalized spatial frequency λ_c/λ. A double-slope line is drawn from where the tangent to the effective index curve crosses the ordinate axis, and the group index n_g(λ_c/λ₀) equates DS₀(λ_c/λ₀), as shown for λ_c/λ₀ = 0.8, i.e., for λ₀ = 1.25 λ_c. As in Figure 7, one easily sees that n_g-eff is about equal to the refractive index of the core n₁, in the practical domain of use of a single-mode fiber, i.e., λ_c < λ < 1.5 λ_c, or 0.67 < λ_c/λ < 1.

Obviously, this geometrical construction can also be used for high-order modes, as well as for integrated-optic waveguides.

5 The case of polarization mode dispersion in a PM fiber

The use of phase and group indexes as normalized inverses of velocity, as well as the geometrical constructions, that were presented, are also very useful for birefringence and intrinsic polarization mode dispersion (intrinsic PMD) [3] of high-birefringence polarization-maintaining (PM) fibers.

Phase birefringence B, or modal birefringence, or simply birefringence, is the difference between the phase effective index n_slow of the slow polarization mode, and n_fast, that of the fast mode. In addition, it is very convenient to use the concept of group birefringence B_g, instead of intrinsic PMD; this simplifies equations. B_g is the difference between the group indexes n_g-slow and n_g-fast:(28) $B = n_{slow} - n_{fast} and B_{g} = n_{g - slow} - n_{g - fast} .$

We saw in (7), that n_g(λ) = n(λ) – (λ · dn/dλ), then:(29) $B_{g} (λ) = [n_{slow} (λ) - (λ \cdot {d n}_{slow} / d λ)] - [n_{fast} (λ) - (λ \cdot {d n}_{fast} / d λ)],$ (30) $B_{g} (λ) = [n_{slow} (λ) - n_{fast} (λ)] - [λ \cdot (d n_{slow} / d λ - d n_{fast} / d λ)],$ (31) $B_{g} (λ) = B (λ) - [λ \cdot d B / d λ] .$

This last equation (31) is similar to (7), replacing the indexes by the birefringences, therefore the geometrical construction, that we saw in Figure 4, can be used (Fig. 10).

Figure 10.Geometrical construction relating group birefringence B_g to phase birefringence B(λ) (solid line curve); the tangent T₀(λ) to the curve B(λ) at λ₀, crosses the ordinate axis corresponding to λ = 0, at the value B_g(λ₀) of the group birefringence for λ₀. This figure is obviously derived from Figure 4.

The intrinsic PMD_i of high-birefringence PM fibers is the difference between the group delays per unit length of both modes [3]:(32) ${PMD}_{i} = t_{g - slow} - t_{g - fast} = (1 / v_{g - slow}) - (1 / v_{g - fast}) = (n_{g - slow} / c) - (n_{g - fast} / c),$ which yields a very simple equation:(33) ${PMD}_{i} = B_{g} \cdot 1 / c = [B - (λ \cdot d B / d λ)] \cdot 1 / c$ knowing that 1/c is about one nanosecond per foot, as we already saw. Group birefringence is actually what is called intrinsic PMD, but normalized to 1/c. Typical group birefringence B_g of PM fibers is around 5 × 10⁻⁴, and again it is a dimensionless value, which yields an intrinsic PMD on the order of 1.5–2 ps/m.

Intrinsic PMD is related to phase birefringence dispersion: when dB/dλ ≠ 0, group birefringence B_g is different from phase birefringence B; it is a first-order dispersion. There is in addition group birefringence dispersion, when the first derivative dB_g/dλ ≠ 0. As with group index, it is also when the second derivative d²B/dλ² ≠ 0, since dB_g/dλ = −λ · d²B/dλ². It is a second-order dispersion that must be considered in certain cases as, for example, with optical coherence-domain polarimetry (OCDP) also called distributed polarization crosstalk analysis (DPXA) [13, 14].

6 Conclusion

This paper presents comments and geometrical constructions that should ease the understanding of dispersion, which is not always explained simply in textbooks. The points to remember are:

Group velocity is classically given with v_g = dω/dk_m, i.e., ω′(k_m), but this equation does not fully respect causality, within the meaning of causal link between its parameters.

It is better to use 1/v_g = dk_m/dω, i.e., k′_m(ω). To be more specific, it is n(λ) yielding dn/dλ, and not λ(n) yielding dλ/dn.

Phase and group indexes should be viewed as the inverse of their respective velocity, normalized to 1/c. This yields clearer equations for group dispersion parameter, D = (1/c) · dn_g/dλ, and for group velocity dispersion, GVD = (1/c) · dn_g/dω. In addition, indexes are dimensionless units, which avoids dealing with units that are not always easily understandable.

The term second-order dispersion, for group velocity dispersion, should be used carefully. There is group velocity dispersion when the group index n_g is not constant, which is the basic cause, causality again. It is the case when the second derivative of the phase index with respect to λ, d²n/dλ², is not null, but it is only a consequence of dn_g/dλ = −λ · d²n/dλ². With ω, it does not work anymore; it is when d²(n · ω)/dω² is not null, and not d²n/dω², that there is group dispersion.

The unit of GVD, ps²/km or fs²/mm, remains strange to me, and I should not be the only one.

The two simple geometrical constructions that relate group index n_g to phase index n, with the tangent, are very useful to visualize simply their relationship, and they are new, to my knowledge. You must have noticed that I prefer clear geometrical figures using simple math to complicated equations.

These comments and these geometrical constructions also apply to guidance dispersion in optical fiber and integrated-optic waveguide.

These comments and these geometrical constructions can be used for birefringence and intrinsic polarization mode dispersion in PM fibers or birefringent integrated-optic waveguides, when the very convenient concept of group birefringence is used.

I hope that this paper will be useful and help to clarify the subject. It is just based on what I had not fully understood about dispersion, over the forty-five years of my career, as you can check in [15, 16], where I classically used (1). I understand it better today, and I wanted to share it. And as you did notice, I do like the simple and amusing geometrical constructions with the tangent, which is the reason for this paper, even if I did not resist adding some comments that might look slightly provocative, but nevertheless important, I think.

References

[1] M. Born, E. Wolf. Principles of optics(1999).

[2] L.B. Jeunhomme. Single-mode fiber optics(1990).

[3] J.A. Buck. Fundamentals of optical fibers(2004).

[4] W. Sellmeier. Ueber die durch die Aetherschwingungen erregten Mitschwingungen der Körpertheilchen und deren Rückwirkung auf die ersteren, besonders zur Erklärung der Dispersion und ihrer Anomalien. Annalen der Physik und Chemie, 223, 386-403(1872).

[5] A. Aspect. From Huygens’ waves to Einstein’s Photons: Weird light. C. R. Acad. Sci., 18, 498-503(2017).

[6] J.W. Rayleigh. The theory of sound(1877).

[7] A. Sommerfeld. Über die Fortpflanzung des Lichtes in dispergierenden Medien. Ann. Phys., 44, 177(1914).

[8] L. Brillouin. Über die Fortpflanzung des Lichtes in dispergierenden Medien. Ann. Phys., 44, 203(1914).

[9] L. Brillouin. Wave propagation and group velocity(1960).

[10] G.P. Agraval. Fiber-optic communication systems(1992).

[11] A. Méndez, T.F. Morse. Specialty optical fibers handbook(2007).

[12] A. Kumar, A. Ghatak. Polarization of light with applications in optical fibers, TT90(2011).

[13] X.S. Yao. Techniques to ensure high-quality fiber optic gyro coil production. E. Udd, M. Digonnet (eds.), Design and development of fiber optic gyroscopes, 217-261(2019).

[14] H.C. Lefèvre. The fiber-optic gyroscope(2022).

[15] H. Lefèvre. The fiber-optic gyroscope(1993).

[16] H.C. Lefèvre. The fiber-optic gyroscope(2014).

微信扫一扫：分享

微信扫一扫：分享