
- Journal of the European Optical Society-Rapid Publications
- Vol. 18, Issue 1, 2022001 (2022)
Abstract
Keywords
1 Introduction
The theory of dispersion of light waves is well known and can be found in many textbooks, as for example [
This paper also emphasizes that the indexes are very convenient to understand more easily the equations describing dispersion. They should be viewed as inverses of velocity, normalized to 1/c, the inverse of light velocity c in a vacuum, knowing that there is no short word for these inverses that are involved in these equations. Furthermore, indexes are dimensionless values, which avoids dealing with units that are not always easily understandable, as we shall see.
In addition, two simple and amusing geometrical constructions are presented to derive group index ng from refractive index n with a tangent. To my knowledge, they are original. They apply to material dispersion, but also to guidance dispersion in a fiber or a waveguide, as well as to birefringence and intrinsic polarization mode dispersion (I-PMD) in polarization-maintaining (PM) fibers or birefringent waveguides.
Obviously, this does not prevent the use of Sellmeier’s equation [
2 Comments regarding causality
The first analysis of dispersion was done by Newton, in the mid-17th century, with a prism that can separate the various colors of white light, because the refractive index n (or index of refraction) of a glass is not constant; it is called chromatic dispersion (chroma is color, in ancient Greek). At the beginning of the 19th century, with the work of Fresnel on the mathematical theory of diffraction, it was finally accepted by the scientific community that light is a wave, as proposed by Huygens at the end of the 17th century, and that the index n is the ratio between light velocity c in a vacuum and its lower velocity in a medium. Remember that Newtonian corpuscular theory stated the opposite, with light going faster in a medium than in a vacuum, and that it required more than a century to get Huygens’ wave model accepted, because of the tremendous prestige of Newton, as discussed by Aspect in a recent historical review paper [
In acoustics, it was proposed by Hamilton in the first half of the 19th century that a modulated wave has in fact two velocities: the sinusoidal carrier does propagate at the velocity of a continuous wave, called the phase velocity vφ, but the modulation term propagates at the so-called group velocity vg. This concept of group velocity was later developed in full mathematical details by Rayleigh in his iconic book, The Theory of Sound [
At the beginning of the 20th century, the application of this concept to optical waves raised many questions, because it could lead to a group velocity higher than c, in the case of high Anomalous dispersion, i.e., when (dn/dλ) ≫ 0, which was contradictory with the Theory of Relativity. This question was brilliantly solved by Sommerfeld and Brillouin in twin papers of 1914 [
Going back to the usual case of normal optical dispersion in transparent dielectric media, i.e., when (dn/dλ) < 0, phase velocity vφ = c/n and group velocity vg are classically given by [
I do not like much the use of the derivative dω/dkm for the definition of vg, since it does not respect the causal link between km and ω, that I call, in short, causality in this paper: it is km that depends on ω, and not ω that depends on km. You may say that dω/dkm and (dkm/dω)−1 are alike for Leibniz’s notation of derivatives, beauty of math, but with Lagrange’s notation, that I prefer here, it is less clear: ω′(km) suggests a causality that would not be fully respected, whereas [k′m(ω)]−1 does respect causality.
To be more specific, you must agree that it is the index n (involved in km = 2π · n/λ), that depends on the wavelength λ in a vacuum (involved in ω, with ω = 2π · c/λ), and not the opposite; it is n(λ) and not λ(n), even if λ, as a function of n, remains mathematically possible. Math does not care about causality and can inverse a function, but causality is fundamental in physics! Equation
You may think that it is nit-picking but, to me, it is important, and I wanted to share this view. I am not the only one to think that, and equation
Now, if a frequency is the inverse of a period, as we have all learned with music, there is no short word for the inverse of a velocity, that is involved in
You may think that it is nit-picking again, or obvious math, but I prefer to consider that it is useful to understand dispersion better. Using group index ng, as defined in
It is important to notice in
With Lagrange’s notation of derivative, instead of Leibniz’s notation, this yields:
This well-known equation is often written as a function of the wavelength λ in a vacuum. Since ω = 2π · c/λ, there is, by logarithmic differentiation, dω/ω = −dλ/λ, and then:
It is often taught that there is no group velocity dispersion, when the second derivative n″(λ), or d2n/dλ2, is equal to 0; for silica fibers, it is for λ = 1.3 μm. Geometrically, this corresponds to an inflexion point of the refractive index curve n, as a function of wavelength λ (
Figure 1.Refractive (or phase) index n (solid line curve), and group index ng (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.3 μm, there is no group velocity dispersion since ng is constant, i.e., dng/dλ is null. This is the basic cause; the fact that d2n/dλ2 = 0 at 1.3 μm is just a consequence of dng/dλ = −λ · d2n/dλ2.
This result influenced a vocabulary that must be handled with care. Chromatic dispersion, also known as phase velocity dispersion, i.e., first derivative of phase index dn/dλ ≠ 0, is often called first-order dispersion, and group velocity dispersion, i.e., first derivative of group index dng/dλ ≠ 0, but also second derivative of phase index d2n/dλ2 ≠ 0, is often called second-order dispersion.
So, I have another comment: if one considers frequencies (angular, ω, or regular temporal, f = ω/2π, or spatial, σ = 1/λ), this does not work anymore; this works only with period, i.e., wavelength λ or temporal period T = 1/f. No group velocity dispersion means that dng/dω (or dng/df, or dng/dσ) is equal to zero, since it is the basic cause, but from
Therefore, when dng/dω = 0, the second derivative d2n/dω2 of the phase index equates −(2·dn/dω)/ω, and it is not null. There are similar equations with f and σ, since df/f = dσ/σ = dω/ω, by logarithmic differentiation.
Nevertheless, it is possible to view second-order dispersion with frequency. It is not as visual as for wavelength, with the inflexion point of the refractive index dependence, but it remains simple mathematically. We saw in
Since km = 2π · n/λ = [(n(ω)·ω]/c, there is no second order dispersion when the second derivative of the product n(ω) · ω is null, i.e., when d2 km/dω2 = (1/c)·d2[n(ω)·ω]/dω2 = 0, whereas, with λ, it is when the second derivative of the refractive index n(λ) alone is null, i.e. d2n/dλ2, as we already saw.
To promote, again, indexes as normalized inverse velocities, I have an additional comment with what is called group dispersion parameter, D, and what is called, we just saw, group velocity dispersion, GVD, as you find on refractiveindex.info web site, for example. D is classically expressed in ps/(nm km), and it is simply [
Using the first derivative of ng(λ), this yields:
GVD is similar, but it is using the derivative with respect to ω, instead of λ:
Using the first derivative of ng(ω), this yields:
Its unit is square second per meter. This is concise but not easily understandable. It would be clearer to use s/[(rad/s) m], or s2/(rad m), a radian per second being the unit of ω, even if a radian is a dimensionless unit that can be omitted. This is a complicated question, by the way, to decide if a dimensionless unit can be omitted or not?
In silica, at 1550 nm, dn/dλ ≈ −0.012 μm−1 and dng/dλ ≈ +0.007 μm−1 (
Figure 2.Refractive (or phase) index n (solid line curve), and group index ng (dashed line curve) of silica, as a function of the wavelength λ in a vacuum. At 1.55 μm, the slope of the tangent to the curve n(λ), i.e., dn/dλ or n′(λ), equates minus 0.012 μm−1, and gives the chromatic, or first-order, dispersion; the one to the curve ng(λ), i.e., dng/dλ or n′g(λ), equates plus 0.007 μm−1, and gives the group velocity dispersion, or second-order dispersion. Since 1/c = 1 ns/300 mm, the dispersion parameter D(1.55 μm) = (1/c) · n′g(1.55 μm) equates + 23 ps/(nm km).
To understand better ps2/km, the unit of GVD, and to compare it with ps/(nm km), the unit of D, one must see that in “ps2”, there are a first “ps” for the delay, as in ps/(nm km), and a second “ps” that is actually 1/(10+12 rad/s), with the omission of dimensionless radian. At 1550 nm, where the temporal frequency f is 193.5 THz, the angular frequency, ω = 2π · f, is about 1.2 × 10+15 rad/s; then, a value of 10+12 rad/s for the shift ∆ω, is around 0.1% of ω. The “nm” used in D for ∆λ is also a shift of about 0.1% of the wavelength, that is in the μm range, and then, the two numerical values, 23 and 28, are close. One femtosecond, for GVD, is about equivalent to one nanometer to the −1 power, for D.
In any case, ps2/km remains a strange unit to me. As a teaser, it would have been also possible to use a concise and strange unit for D: ps/mm2, since 1 nm × 1 km = 1 mm × 1 mm, but with mm2, it looks like an area!
Finally, remember that D and GVD have opposite signs, since ω = 2π · c/λ, and then, dω/dλ is negative, as well as dλ/dω. To talk about positive or negative group dispersion requires to specify if it is with respect to period, or to frequency. To avoid this problem, it is customary to use for group velocity dispersion the same vocabulary as the one used for phase velocity dispersion. As we saw, normal dispersion corresponds to a negative derivative of the index with respect to λ, and anomalous dispersion corresponds to a positive derivative. However, I am not very fond of this vocabulary for group dispersion, even if it is convenient. Anomalous has an understandable meaning with phase velocity dispersion, since it is when the medium is absorbing, and it is not the normal use in the transparency window. With group velocity dispersion, a positive group index derivative is not Anomalous, nor abnormal, strictly speaking. Taking the case of silica, it is simply above 1.3 μm, as seen in
3 Simple geometrical constructions to derive ng from n
Let us consider the theoretical case of a material without any group dispersion. As we saw, dng/dλ = 0 implies that d2n/dλ2 = 0. The curve of the refractive (or phase) index n as a function of wavelength λ is a simple affine function, with a constant slope, equal to dn/dλ, since d2n/dλ2 = 0:
Because the group index ng(λ) is equal to n(λ) – (λ · dn/dλ), as seen in
Group index ng is constant and equal to the value n(0) of the phase index for a null wavelength (
Figure 3.Refractive (or phase) index n(λ) (solid line), and group index ng(λ) (dashed line), in the theoretical case of a material without any group dispersion. The straight line representing n(λ) crosses the ordinate axis corresponding to λ = 0, at the constant value of ng(λ) = n(0).
As it can be simply visualized with this
Now, with the practical case of a material with group dispersion, one must consider the tangent to the curve n(λ). Using Lagrange’s notation, which is clearer, here, than Leibniz’s notation, the equation T0(λ) of this tangent, for a given wavelength λ0, is:
Then, it is simple to see with
The tangent T0(λ) to the curve n(λ), at λ0, crosses the ordinate axis, i.e., when λ is zero, at the value ng(λ0) of the group index for λ0 (
Figure 4.Refractive (or phase) index n(λ) (solid line curve), with group dispersion. The tangent T0(λ) to the curve n(λ) at λ0 crosses the ordinate axis corresponding to λ = 0, at the value ng(λ0) of the group index for λ0.
Knowing this,
Figure 5.Refractive (or phase) index n(λ) (solid line curve), and group index ng(λ) (dashed line curve) of silica. The tangent to the curve n(λ), at λ0, crosses the extended ordinate axis, where λ = 0 μm, at the value ng(λ0) of the group index for λ0, as shown for λ0 equal to 0.85 μm, or to 1.3 μm.
4 The case of a single-mode optical fiber
As we saw, a CW light wave propagates at a phase velocity vφ = ω/km(ω), in a bulk medium. In an optical fiber, there are discrete modes that propagate at different phase velocities vφi = ω/βi(ω), where βi(ω) is called the propagation constant of mode i [
It is very convenient to use the so-called effective index neff of the mode defined with:
Following
The angular (temporal) frequency ω is very useful to shorten mathematical equations, but everybody is more familiar with the wavelength λ in a vacuum. You know what is 1 μm, whereas 1 rad/s is not that obvious, besides the fact that it corresponds to 0.16 Hz. Therefore, the frequency dependence of neff is classically presented with respect to λc/λ, where λc is the cut-off wavelength of the second-order mode. This ratio does correspond to a frequency: it is the spatial frequency 1/λ normalized to 1/λc.
This effective index neff is a phase index, and all the mathematical equations of
The way to proceed is to invert the classical curve neff(λc/λ), and to get the inverted curve neff(λ/λc), that depends on λ, and not on 1/λ anymore (
Figure 6.Effective index neff of the fundamental mode, n1 being the refractive index of the core, and n2 being the one of the cladding: (a) classical representation, as a function of the normalized spatial frequency λc/λ, in a vacuum; (b) inverted representation as a function of the normalized wavelength λ/λc, in a vacuum.
It is interesting to notice that this inverted curve is easier to understand, for the fundamental mode, than the classical one: as the wavelength increases, the mode widens [
Now, once you have this inverted representation in wavelength, you must just apply the geometrical construction of
Figure 7.Geometrical construction to derive the effective group index ng-eff(λ/λc) of the fundamental mode (solid line curve) from its effective phase index neff(λ/λc) (dashed line curve). The tangent to neff(λ/λc) crosses the ordinate axis at the value of the corresponding effective group index ng-eff(λ/λc), as shown for λ = 1.25 λc. One sees easily that ng-eff is about equal to the refractive index of the core n1, in the practical domain of use of a single-mode fiber, i.e., λc < λ < 1.5 λc; above 1.5 λc, the mode starts to widen a lot and the curvature loss increases drastically.
Note, however, that it is possible to find also a geometrical construction with the frequency dependence. It is not as simple as with the period dependence, but it has some interest. The equation of the tangent T0(ω) to the curve n(ω), for a given angular frequency ω0, is:
Consider now the affine function DS0(ω) starting from T0(0), and having a double slope, i.e., a slope equal to twice that of the tangent T0(ω):
Following
As seen in
With λ, we saw that the tangent to the curve n(λ), at λ0, crosses the ordinate axis at ng(λ0). With ω, one draws a double-slope line DS0(ω) (
Figure 8.Two possible geometrical constructions for relating group index ng to phase index n: (a) with the dependence in wavelength λ, i.e., the spatial period of the wave, the tangent T0(λ) to the phase index curve crosses the ordinate axis at the value ng(λ0) of the group index; (b) with the dependence in angular frequency ω, a double-slope line DS0(ω) is drawn from where the tangent T0(ω) to the phase index curve crosses the ordinate axis, and the group index ng(ω0) equates DS0(ω0).
Figure 9.Geometrical construction relating the effective group index ng-eff to the effective index neff of the fundamental mode of a fiber, with the dependence in normalized spatial frequency λc/λ. A double-slope line is drawn from where the tangent to the effective index curve crosses the ordinate axis, and the group index ng(λc/λ0) equates DS0(λc/λ0), as shown for λc/λ0 = 0.8, i.e., for λ0 = 1.25 λc. As in
Obviously, this geometrical construction can also be used for high-order modes, as well as for integrated-optic waveguides.
5 The case of polarization mode dispersion in a PM fiber
The use of phase and group indexes as normalized inverses of velocity, as well as the geometrical constructions, that were presented, are also very useful for birefringence and intrinsic polarization mode dispersion (intrinsic PMD) [
Phase birefringence B, or modal birefringence, or simply birefringence, is the difference between the phase effective index nslow of the slow polarization mode, and nfast, that of the fast mode. In addition, it is very convenient to use the concept of group birefringence Bg, instead of intrinsic PMD; this simplifies equations. Bg is the difference between the group indexes ng-slow and ng-fast:
We saw in
This last equation
Figure 10.Geometrical construction relating group birefringence Bg to phase birefringence B(λ) (solid line curve); the tangent T0(λ) to the curve B(λ) at λ0, crosses the ordinate axis corresponding to λ = 0, at the value Bg(λ0) of the group birefringence for λ0. This figure is obviously derived from
The intrinsic PMDi of high-birefringence PM fibers is the difference between the group delays per unit length of both modes [
Intrinsic PMD is related to phase birefringence dispersion: when dB/dλ ≠ 0, group birefringence Bg is different from phase birefringence B; it is a first-order dispersion. There is in addition group birefringence dispersion, when the first derivative dBg/dλ ≠ 0. As with group index, it is also when the second derivative d2B/dλ2 ≠ 0, since dBg/dλ = −λ · d2B/dλ2. It is a second-order dispersion that must be considered in certain cases as, for example, with optical coherence-domain polarimetry (OCDP) also called distributed polarization crosstalk analysis (DPXA) [
6 Conclusion
This paper presents comments and geometrical constructions that should ease the understanding of dispersion, which is not always explained simply in textbooks. The points to remember are:
Group velocity is classically given with vg = dω/dkm, i.e., ω′(km), but this equation does not fully respect causality, within the meaning of causal link between its parameters.
It is better to use 1/vg = dkm/dω, i.e., k′m(ω). To be more specific, it is n(λ) yielding dn/dλ, and not λ(n) yielding dλ/dn.
Phase and group indexes should be viewed as the inverse of their respective velocity, normalized to 1/c. This yields clearer equations for group dispersion parameter, D = (1/c) · dng/dλ, and for group velocity dispersion, GVD = (1/c) · dng/dω. In addition, indexes are dimensionless units, which avoids dealing with units that are not always easily understandable.
The term second-order dispersion, for group velocity dispersion, should be used carefully. There is group velocity dispersion when the group index ng is not constant, which is the basic cause, causality again. It is the case when the second derivative of the phase index with respect to λ, d2n/dλ2, is not null, but it is only a consequence of dng/dλ = −λ · d2n/dλ2. With ω, it does not work anymore; it is when d2(n · ω)/dω2 is not null, and not d2n/dω2, that there is group dispersion.
The unit of GVD, ps2/km or fs2/mm, remains strange to me, and I should not be the only one.
The two simple geometrical constructions that relate group index ng to phase index n, with the tangent, are very useful to visualize simply their relationship, and they are new, to my knowledge. You must have noticed that I prefer clear geometrical figures using simple math to complicated equations.
These comments and these geometrical constructions also apply to guidance dispersion in optical fiber and integrated-optic waveguide.
These comments and these geometrical constructions can be used for birefringence and intrinsic polarization mode dispersion in PM fibers or birefringent integrated-optic waveguides, when the very convenient concept of group birefringence is used.
I hope that this paper will be useful and help to clarify the subject. It is just based on what I had not fully understood about dispersion, over the forty-five years of my career, as you can check in [
References
[1] M. Born, E. Wolf.
[2] L.B. Jeunhomme.
[3] J.A. Buck.
[4] W. Sellmeier. Ueber die durch die Aetherschwingungen erregten Mitschwingungen der Körpertheilchen und deren Rückwirkung auf die ersteren, besonders zur Erklärung der Dispersion und ihrer Anomalien.
[5] A. Aspect. From Huygens’ waves to Einstein’s Photons: Weird light.
[6] J.W. Rayleigh.
[7] A. Sommerfeld. Über die Fortpflanzung des Lichtes in dispergierenden Medien.
[8] L. Brillouin. Über die Fortpflanzung des Lichtes in dispergierenden Medien.
[9] L. Brillouin.
[10] G.P. Agraval.
[11] A. Méndez, T.F. Morse.
[12] A. Kumar, A. Ghatak.
[13] X.S. Yao. Techniques to ensure high-quality fiber optic gyro coil production. E. Udd, M. Digonnet (eds.),
[14] H.C. Lefèvre.
[15] H. Lefèvre.
[16] H.C. Lefèvre.

Set citation alerts for the article
Please enter your email address