Lectures in physics Part 1: Mechanics Przemysław Borys 7.11.2013

by user

on 15 сентября 2016

Category: Documents

>> Downloads: 5

views

Report

Comments

Description

Download Lectures in physics Part 1: Mechanics Przemysław Borys 7.11.2013

Transcript

Lectures in physics Part 1: Mechanics Przemysław Borys 7.11.2013

Lectures in physics
Part 1: Mechanics
Przemysław Borys
7.11.2013
1. Scalars and vectors.
In physics we use different quantities to describe the observed phenomena. Among them the
most important are scalars and vectors. Scalars are simply the numbers, which describe for
example the temperature, mass, density, energy. Other physical phenomena cannot be described in
terms of simple numbers. An example of such quantity is the velocity. It is not sufficient to say that
a car moves at 50km/h, we need to supplement this idea by the direction he moves to. A quantity
which consists of a magnitude and the direction information is called a vector.
A vector can be of two types. One type of vector is a bound vector, and the other is a free
vector. The first type of vector has prescribed coordinate of the initial and terminal point, while the
other doesn't. An example of the second case could be a translation vector of a coordinate system.
An example of the first case can be a force which acts on a specific point of a body.
In general it is possible to summarize that a vector consists of a magnitude, direction, and
sense, which determines whether the direction pointed by a vector is taken forward, or backward.
The sense is denoted by the arrow position on the line segment in the graphical vector
representation.
1.1. Basic notation, vector addition, vector multiplication by a scalar
In the Eulcidean space we can represent vectors in two basic ways. One is to use component
notation, another is to use the unit vectors. Let us first consider the former case.
In the figure above we can see a vector (a thick arrow) which can be enclosed by a cuboid in the
coordinate system. The x component equals „2”, the y component equals „3”, the z component
equals „4”. Therefore we can denote the vector by
[]
2
3
4
, or [2,3,4]T for short, where the „T”
symbol denotes a transposition, which transforms a row into a column, and vice versa.
To understand the second way of denoting vectors, we shall first learn the rules for vector
addition and subtraction. Before we do this, I emphasise a general remmark, which should be
apparent at this point. Vectors and scalars are different mathematical objects! It is impossible to put
an equality between a scalar and a vector, as is often done by students, who write equations like
 =ma , where the left hand side cannot be evaluated by the right hand side (which has no
F
arrow!) because it is impossible to set the direction of F, i.e. the information on the right hand side
is insufficient! This becomes even more evident in the component notation, where on the left hand
side the vector is represented by three numbers in an Euclidean space, while on the right hand side
we have only one number! .
The vector addition in terms of components can be easily understood. It simply refers to the
summation of the corresponding components. For example:
[][][]
1
2 3
21=3
3
4 7
Such an addition rule can also be represented by an useful geometrical construction, known
as the parallelogram rule:
In the above case we sum

 =[ 6,5]T .
A
B=[1,3]T [5,2]T =C
Another „easy” operation, which can be done on vectors is the vector multiplication by a
scalar number. This is related to the enlargement of the vector length by a specific amount. To do
this, we simply multiply all the vector components by the specified scalar, for example:
[][]
1
3
3 2=6
3
9
A careful reader may ask now: how do we define the vector length, that was multiplied
above? For two dimensional vectors, the components x, and y determine sides of a right triangle,
and therefore we can use the Pythagorean theorem: ∣A∣= A2x  A2y . In three dimensions, we can
first apply the Pythagorean theorem to the XY plane, and then to the resulting vector and the Z axis,
which results in ∣A∣= A2x  A2y A2z like below:
Having the rule for multiplication by a scalar, the vector addition procedure can easily
become generalized to vector subtraction because

  , i.e. it can be done based
A−
B= 
A−B
upon the vector addition formula for a negative vector B (i..e a vector with inverse sense, or
components multiplied by (-1)). So, to calculate

A−
B in the above case, we would write

A−
B=[1,3]T −[5,2]T =[1,3]T [−5,−2]T =[−4,1] T . A geometrical construction can be the
following:
As mentioned before, the vector representation can be put in terms of such called unit vectors.
What are the unit vectors? They are vectors of length „1”, which point along the x, y, and z axes.
Their names are i, j, and k, respectively:
As can be seen from the above figure, the unit vector, when multiplied, can resemble any
possible vector along the related axis. Then, making use of the vector addition formula, we can
„compose” any possible vector from the vectors based upon the unit vectors. For example,
[][][][][] [] []
1 1
0 0
1
0
0
=


=
2
3
2 0
2 0
0
1
0 = i 2 j3 k
3 0
0 3
0
0
1
This gives the second representation for vectors, which is quite useful for writing.
1.2. Dot product
The dot product between vectors

A and

B can be written as

 =∣A∣∣B∣cos 
A⋅B
where α is the angle between vectors 
A and 
B . It can be easily seen that when the vectors are
colinear (aligned along a single line, α=0), the dot product is just the multiplication of their length,
i.e.

A⋅
B=∣A∣∣B∣ . In case when the vectors are orthogonal (the angle between them is straight,
α=90), the dot product results in 0. This can be useful for designing a very useful dot product
representation:

 A x i A y j Az k ⋅ B x i B y jB z k =.
A⋅B=
 k =.
A x B x i⋅j A y B y j⋅j Az B z k⋅
Ax B x A y B y  Az B z
which is just because the product of orthogonal unit vectors, i.e. i⋅j , i⋅k , j⋅k , all reduce to zero.
Of course such simple relations hold only in the Euclidean space and if some of you would try to
study the physics of general relativity and curved space-time, he would encounter a more general
formula for the dot product, which can be written as:
ij

A⋅
B=∑i , j g Ai B j
which makes use of the, such called, metric tensor g, which takes a diagonal representation for the
Euclidean space, where gij=1 for i=j and gij=0 for i≠j (in fact it also differs in that it also considers
the fourth coordinate—the time, but we leave this aside for now).
The simple interpretation of the dot product is by means of the vector projection on another vector.
Consider the following figure:
Here we can see that to calculate the projection of vector A to vector B, one needs to multiply the
length of A by the cosine of the angle between the two vectors. It is immediately seen that if we
multiply the projection by the length of vector B, we end up in the dot product formula:
proj  A , 
B ∣B∣=∣A∣∣B∣ cos = 
A⋅
B
1.3. The cross product
The previous section gave us the information about a type of multiplication between two
vectors, which gives a scalar result. There is also a type of multiplication between two vectors,
=
which gives a vector as a result. This is called the cross product C
A× 
B . It is defined as an
operation that returns
a) a vector that is perpendicular to the two operands, and its direction is determined by the right
hand rule (see figure below)
b) a vector with magnitude equal to ∣C∣=∣A∣∣B∣sin 
The b) requirement means that the magnitude of the resulting vector equals to the area of the
parallelogram that is determined by these vectors (see fig. Below), where
S=∣A∣ h=∣A∣∣B∣sin  .
As we did in case of the dot product, also in case of the cross product we can consider the
behaviour of the product acting on the unit vectors. The angles between unit vectors are eighter 0, π/
2, π, 3π/2. The sine of such angle can be easily evaluated and gives either 0, 1, or -1. The
magnitude of the cross product between components equals simply to the product of their
magnitudes, because they form a rectangle, not a parallelogram. The cross product between two unit
vectors (e.g. i , j ) should give a perpendicular vector, which is the remaining unit vector (in our
example k ). Taking this together, we can say:

A× 
B= A x i A y j Az k × B x iB y jB z k =.
 A y B z− Az B y  i Az B x − Ax B z  j A x B y −A y B x  k
If you know what is a determinant of a 2x2 matrix, you immediately recognize that
∣
∣∣
A Az  A z

A×
B= y
i
By Bz
Bz
∣ ∣
∣
Ax  Ax Ay 
j
k
Bx
Bx B y
and further, considering the Laplace expansion, one finally recognizes that
∣
∣
i
j k

A×
B= A x Ay A z
Bx B y Bz
which gives the most compact and easy to memorize formula for calculating the cross product
provided, that you know the way to evaluate it (the Sarrus method, see fig. Below)
As you can see, to evaluate a determinant of the 3x3 matrix, one needs to rewrite the first
two rows under the determinant, and then draw the diagonal lines from upper left corner to lower
right and from upper right corner to lower left. The symbols encountered along the lines should be
multiplied and added together with symbols multiplied along the other lines. The lines pointing
from left to right are taken with a positive sign, and those pointing from right to left are taken
negative..
Questions
1. Describe vector addition and subtraction and define the dot product, and cross product
2. Discuss the derivation of the formulas for dot and cross product from the formulas
containing sine and cosine. Give interpretation of these operations.
Reference
E. Karaśkiewicz, Zarys teorii wektorów i tensorów. PWN, 1974.
2. Introduction to the differential calculus
If you remember the school definition of the velocity, you probably recall something like
v =
 s
t
where  s is the displacement vector, and  t is the associated time interval. However, this is
not a strict definition. Consider an accelerating car that between 0 and 10s accelerates from 0 to
50km/h. If you would evaluate velocity at time equal to 1s with  t=1s , you would obtain
approximately v =2.50km /h (calculating the distance by s=
at 2
). If you would take
2
 t=5s , you would obtain v =0.50km/h (because of the zero contribution from the past). For
 t=0.1s you would obtain
v =4.5km /h , quite close to what would be predicted by the
formula v =at .
How it is possible to have such different velocities at single time instant? It is not possible.
What we have calculated is NOT the instantenous velocity. It is just an average velocity. And the
average can differ depending upon the period, that we take into account for averaging. To obtain the
„real” instantenous velocity, we need to take the time interval to be as narrow as possible, to not
include any „historical” contributions:
v =lim t  0
 s d s
=
= s '
 t dt
2.1. Differentiation
Such a limit difference quotient is called a derivative and can be denoted using the
d
dt
or ... ' symbol. It is possible to calculate such derivatives directly. For example,
d t 2 
t t 2−t 2
t 22t  t− t 2−t 2
=limt  0
=limt 0
=2t
dt
t
t
You will probably learn more such examples durring your math classes. For the purposes of
physics we don't need however to digest this in details. You only need to know the key idea. Based
on such considerations, it is possible to build a basic table of derivatives:
Function
Derivative
t
n
nt
n−1
cos t
− sint 
sin t
cos t 
e kt
ke kt
ln ∣t∣
1
t
tg t
1
cos 2 t
ctg t 
−
1
2
sin t
The construction of the difference quotient gives a geometrical interpretation (it describes
the tangent of the angle of a tangent line to the curve, from basic trigonometry for the right triangle
below we have tg =
dy
):
dx
The difference quotient allows to have certain properties of the differentiation. For example:
af t ' =lim t  0
af t  t −af t 
f t  t − f t 
=a lim t  0
=a  f t '
t
t
Other properties can be found as well which results in the following table:
af t'=a  f t ' (derivative of a function multiplied by constant)
 f tg t'= f t' g t' (derivative of a sum of two functions)
 f t g t' = f t ' g t f t  g t' (derivative of a product)
'
 
f t  f t ' g t − f t  g t'
=
(derivative of a fraction)
g t
g 2 t 
 f  g t '=
df  g  dg t 
dg
dt
(chain rule)
Examples:
2t 25sin t '=2t 2' 5sin t' =2t 2 ' 5sint ' =4t5cos t
t 2 sin t '=t 2 ' sin tt 2 sin t'=2t sint t 2 cos t 
'
 
t 2 ' sin t−t 2 sin t' 2tsin t −t 2 cos t
t2
=
=
2
2
sin t
sin t
sin t 
sint 2 ' =cos t 2t 2 ' =2t cos t 2
2.2. Integration
Integration (indefinite integration) is an inverse procedure to differentiation. We denote
the integral using the following notation:
∫ f t dt= F t 
where a condition must be met that:
 F t '= f t  .
Looking at the table of derivatives it is possible to construct a table of integrals that fullfills the
above condition. For example t n '=nt n−1 ; substituting k =n−1 ; n=k 1 we obtain
'
t
k1
'=k 1t
k
, so
 
t k1
=t k , which gives the first relation in the table below:
k 1
Function
Integral
∫ t n dt
t n1
C
n1
∫ cost dt
sin t 
C

∫ sin t dt
−
cos t 
C

∫ e kt dt
e kt
C
k
∫ t dt
1
ln ∣t∣C
∫ cos 2 t dt
1
tg tC
1
−ctg tC
∫ sin2 t dt
Where the „+C” component originates from the idea that the derivative of a constant euqls zero.
Similarly to the differentiation, integration also obeys certain laws:
∫ af t dt=a ∫ f t dt (integral of a function multiplied by constant)
∫ f t g t dt=∫ f t dt∫ g t dt (integral of a sum of two functions)
∫  f t' g t dt = f t  g t−∫ f t  g t ' dt (integration by parts)
∫ f  g t g t' dt=∫ f  g  dg (integration by substitution)
Examples:
∫ t sin t dt=
[
]
f =t
g ' =sin t =−tcost  cos t dt=−tcos tsin tC
∫
f '=1 g =−cos t 
∫ 2t sint 2 dt=∫ sin t 2 2t dt=.
[
]
dg
=2t dg =2tdt =.
dt
∫ sin  g  dg=−cos  g C=−cos t 2C
g=t 2,
Another type of integral is the, such called, definite integral. This type of integral is defined
by the relation
b
∫a
where:
f t dt=F b−F a 
F t=∫ f t dt
is the indefinite integral of f(t).1 In the course of the math classes, you will learn an important
identity between the definite integral and a Riemann sum, which in most of the cases is the basic
tool to construct integrals in physical problems:
b
∫a
n
f t dt=lim min  t  0 ∑i =1 f i  t i
i
i
where  i is the point within the interval  t i , and the intervals  t i cover the domain from ta
to tb. Due to this relation to Riemann sum, it is possible to give a geometrical interpretation to the
definite integral: it gives the area below the integrated curve (see fig. below).
Reference:
W. Żakowski, Matematyka dla kandydatów na wyższe uczelnie
1 We can prove this by considering a difference quotient (differentiating the definite integral):
t  t
∫a
t
t t
f t  dt−∫a f t dt =∫t
f t dt= f t c  t , where the last equality results from the mean
value theorem (the value of the integral cannot be larger than f max  t and cannot be lower than f min  t ,
but there must be some point t c for which the equality holds). If  t  0 , the result becomes equal to f(t)dt,
which gives the integrated function. Thus the definite integral is strictly related to indefinite, because the
differentiation gives the same results, i.e. integrand is the same function.
3. Kinematics
3.1. Displacement versus path
To understand the difference between displacement and path, consider the following figure:
The path length is equal to the sum of all traversed segments of this zig-zag trajectory. It is a scalar
quantity that has no direction. On the other hand, the displacement  s is defined as the distance
between the ends of trajectory and has a direction, and sense (which depends upon the choice of the
initial and terminal points—the vector should point to the terminal position).
The discrimination between path length and displacement is essential do discriminate
between two different physical objects: the average speed and average velocity. Speed is a scalar
quantity related to the path length l, while velocity is a vector quantity and relates to the
displacement  s :
〈 v 〉=
〈
v 〉=
l
(average speed)
t
 s
(average velocity)
t
3.2. Instantenous velocity
As we have mentioned in the mathemathical introduction, the average speed and velocity is
of limited use in practical applications. It is more convenient to consider instantenous speed and
velocity, which are defined by:
v=
dl
(speed)
dt
d s
v =
(velocity)
dt
3.3. Acceleration
In cases where the velocity is not constant, we also would like to measure its change. This can be
done by means of the acceleration, which gives the rate of change of the velocity:
dv
a= 

dt
3.4. Relations for the distance in an accelerated motion on a straight line
It is important to obtain some results from the definitions of instantenous velocities and
accelerations on a straight line. When the motion takes place on a straight line it is not necessery to
use vectors, and we can use scalar equations, which we can easily differentiate and integrate. Now,
let us see, what can be learned from the definition of acceleration, proposed above:
dv
∣∣⋅dt
dt
adt=dv
∫ adt=∫ dv
a=
v=a ∫ dt
v=atv 0
The v0 term above is just an integration constant set to provide initial velocity when t=0. We can
now put the last formula for the velocity into the definition of the velocity:
ds
dt
ds
atv 0= ∣∣⋅dt
dt
atdt v 0dt =ds
∫ at dt∫ v 0 dt=s
v=
s=a ∫ t dtv 0∫ dt
at 2
s= v 0 ts0
2
where we have again an integration constant s0, which serves to have the initial displacement equal
to the s0 rather than to 0. This way we have obtained a „famous” result for the distance travelled in
an accelerated motion, which you probably know from the earlier education. However, this time
you don't have to memorize it! It is sufficient to know what is the velocity, and what is acceleration
to derive the formula without any problems. Moreover, you are now able to calculate the distance
travelled by an object that follows an acceleration which is not constant! Something you couldn't
handle earlier at school.
3.5. Angular velocity and acceleration
When an object does not move forward, but instead rotates, it is convenient to measure its angular
position φ instead of its position s.The angular position can undergo changes, i.e. a body can pass
some angular distance Δφ in a specific time Δt, giving rise to the average angular velocity ω
〈 〉=

t
As previously, it is possible to define the instantenous angular velocity and angular acceleration ε,
which measures the rate of change of the velocity. The formulas are dual to those from transitional
motion:
d
dt
d
=
dt
=
Also, exactly as we did for the translational kinematics, we can calculate the distance φ travelled in
an accelerated motion:
d
∣∣⋅dt
dt
d =dt
∫ d =∫ dt
= t0
d
=
∣∣⋅dt
dt
d =dt
d = t dt 0 dt
∫ d =∫ t dt0∫ dt
 t2
=
0 t 0
2
=
3.6. Relation between the angular velocity and acceleration, and tangential velocity and
acceleration
The angular displacement  covered by a body on radius r corresponds to a path length
of l= r . This gives the possibility to calculate an associated tangential velocity and
acceleration of a rotating body related to the angular quantities:
l= r (path length travelled)
dl
d
=r
dt
dt (tangential velocity)
v=r 
dv
d
=r
dt
dt (tangential acceleration)
a=r 
3.7. Centripetal acceleration
While performing rotations at constant angular velocity one could say that there is no
acceleration at all. However, velocity is a vectorial quantity. Since the direction of the velocity
changes, there must be some corresponding acceleration. This is called the centripetal acceleration.
Consider the following figure:
As the body rotates, the velocity vector rotates (it also moves its initial point, but this is not
relevant, and is actually not included in the definition of the velocity). Superposing the two initial
v vector which points to the center of the
points (on the right on fig. above), we can see a net  
rotation circle. What is important, the angle between subsequent velocity vectors is exactly the same
as between the subsequent radii that touch to the moving body. Therefore, we can estimate the
length of  v by using the formula for the arc length of an arc with radius equal to v. This would
give
 v= v
To obtain the acceleration, we divide  v by  t which results in:
a=

v= v
t
... or 'VW', if you find it easier to memorize. Utilizing the fact that v =r , we can represent the
above formula in more common representations:
a=
v2
= 2 r
r
This can be also represented in a vectorial manner:
a =×
 ×
 r

 r is perpendicular to the plane of those two vectors, and points in the
where the term ×
 ×
 r  points to the center of the
direction of the rotation, and then the cross product ×
rotation, and since the angles between the vectors undergoing cross producta are zero, the
magnitude of the resulting vector is  2 r .
3.8. Motion in noninertial reference frames
Until now, we have considered our formulas for cases of the, such called, inertial reference
frames. An inertial reference frame is such a frame that does not accelerate, i.e., as we will learn
soon, it is a reference frame that is not subject to the action of any net force. In case of the
noninertial reference frames, the reference frame itself can move with acceleration a, which
creates certain additional effects, which are not normally seen in inertial reference frames.
To start, let us take the position vector of a moving body with respect to an inertial
coordinate system (see fig. below):
r = r0 r1
e r= e0 r 0 e1 r 1
In the above, the symbol e denotes an unit vector aligned with the vector r . To obtain the
formula for the velocity, we need to calculate the first derivative of the above formula. We already
know how to differentiate functions of time, but how to differentiate the unit vectors? The unit
vectors cannot change their magnitude, so they can only rotate. The angular velocity vector gives
the speed of rotation, and is perpendicular to the plane of rotation. The cross product ×
 e is
perpendicular to 
 and e , so it is aligned into the direction to which e changes. The
magnitude of the change equals  r e dt=dt , because the radius of rotation re=1. Taking this all
together,
d
e
=×
 e
dt
Now we are ready to calculate the velocity and acceleration in a noninertial reference frame
by utilizing the above formula and the formula for a derivative of a product:
r =e0 r 0e1 r 1
dr d e
dr
d r d e0
v =
=
r 0e0 0  1 r 1 e1 1
dt
dt
dt
dt
dt
v =

 0×e0 r 0e0 v 01×e1 r 1 e1 v 1
This can be rewritten in a compact notation by taking into account that r =e r :
v =0× r0 v0
 1×r1v1
0 ×r0 is the circumferential velocity of the noninertial reference frame around the
Here v 
cir0 = 
 1×r1 is the
origin of coordinates in the inertial reference frame, and respectively vcir1 =
circumferential velocity of the considered body around the origin of coordinates in the noninertial
reference frame.
Now we can differentiate the velocity to calculate the accelerations.
d
d e
d r d e
dv
0
×e0 r 0 
0× 0 r 0 
0 ×e0 0  0 v 0 e0 0
dt
dt
dt
dt
dt
d
1
d e1
d r 1 d e1
dv 1

×e1 r 1
 1×
r 
1×e1

v e
dt
dt 1
dt
dt 1 1 dt
a=

this gives:
a =0× r0 

0 ×
0 ×r0
 0×v 0 
0 ×v0 a0
1× r1
 1× 
1× r1 
1× v1
1× v1a1
In practical considerations we usually consider noninertial frames which either separate from the
inertial reference frame at constant acceleration without rotations, or we consider a frame that
rotates while its origin of coordinates is immobilized inside of the inertial reference frame. In such
cases we can simplify the above by rejecting the terms dependent on  0 (multiplied by things
that depend upon r 0 ):
a = a01×r1

1× 
1× r12 1× v1 a1
The last formula can be written in a convenient form as
a
 = a0acir1  ac  acor a1
Where: a0 is the linear acceleration of the noninertial reference frame with respect to the inertial
reference frame; a cir =1×r1 is the circumferential acceleration of the body in the noninertial
2
reference frame ; a c =
 1× 
1 ×
r 1=− 1 r1 is the centripetal acceleration of the body in the
 1×v 1 is the Coriolis acceleration; and a1 is the linear
noninertial reference frame; a cor =2 
acceleration of the body with respect to the noninertial refernce frame.
3.9. Coriolis acceleration
The reader may ask now. What is the Coriolis acceleration which we have derived in
previous section? Why does it depend on the velocity? Consider the situation of a black body that
moves along the radius of the rotating plate below:
The body angular position remains fixed in the inertial refrence frame, i.e. it doesnt rotate
with the plate, it only moves along the radius (the solid line). However, when observed from the
noninertial rotating reference frame, one can see a curved trajectory (the dashed line), and not a
straight line! The body, as it moves otside of the center gains larger and larger circumferential
velocity. To enlarge velocity, one needs acceleration, and this is where the Coriolis effect occurs.
This effect is responsible for bending of the plane of oscillations of the Foucault pendulum,
as can be seen in the following figure:
It is also extremly important for the weather science since wind near the Earth surface is curved
according to the Coriolis force which—in cas of air masses moving from opposite directions, can be
the reason for a cyclone formation.
The origin of Coriolis acceleration can be thought of as being resulting from two effects (fig.
below): first of all, as the body moves along the radius, it gains circumferential velocity in a rate of
dv =dR
dv
dR
= = v
dt
dt
Second, (the figure on the right below), the velocity vector is changing direction, exactly in the
same manner as it was in the consideration of the centripetal acceleration. This gives another term
v , hence the factor „2” in front of the Coriolis acceleration.
3.10. The projectile motion
How does a cannon ball move after it is shot from the cannon? The motion can be decomposed into
two perpendicular components that can be considered separately. One component is the vertical
motion subject to the gravitational acceleration g, while the second does not receive any
acceleration (we assume the air resistance is negligible, and the cannon ball, or the projectile,
moves at constant horizontal velocity).
The components of the initial velocity can be specified as:
v 0x =v 0 cos 
v 0y=v 0 sin 
The maximum height can be calculated by:
{
{
gt2
H max=v 0y t−
2
v 0y
g=
t
H max=v 0y t−
H max =
t=
gt2
2
v 0y
g
v 20y v 20 sin 2 
=
2g
2g
The time to reach this height follows from:
v 0y
t
v 0y
t=
g
g=
Therefore, the range R, travelled in twice this time (the time of falling down is identical to the time
of rising up), we have:
2v0x v 0y 2 v 20 cos sin 
R=
=
g
g
4. Relativistic kinematics
4.1. The origin of the whole confusion: the Michelson-Morley experiment
The Michelson Morley experiment is one of the best recognized experiments that demonstrate the
independence of the speed of light value on the speed of the inertial reference frame where it is
measured. Michelson was born in Strzelno (currently Poland, previously Poland did not exist and
was these times partitioned among Russia, Prussia and Austria), while Morley was an American.
In this figure the incident light beam splits on the dichroic mirror and proceeds to two normal,
reflecting mirrors, that are located in the same distance from the beam splitting point. Therefore, in
case when the reference frame velocity equals zero, the light beams follow the same distance and
leave the interferometer in phase, and no interference can be detected.
In case when the reference frame moves with velocity v, the vertical ray should travel at a
velocity equal to c=  v 2v 2up  v up= c 2−v 2 (pythagorean theorem), while the horizontal ray
should move at a velocity equal to v horiz =c±v . The time of travel for the upper beam is equal to
(consider only the vertical component of the velocity and vertical distance traveled)
t up
L
= 2 2
2  c −v
2L
t up = 2 2
 c −v
In case of the horizontal beam, the situation is different,
t horiz=
L
L
2Lc

= 2 2
c−v cv c −v
The formulas differ and the beams need different time to pass their distance which means that they
will meet out of phase! They should interefere!
The details of the ray trajectories: they escape the dichroic mirror in different position when they
move thowards the detector and it results in a „double slit-like” behaviour.
But nothing like that was confirmed in the experiment. The researchers aligned their interferometer
along the Earth's orbit to have a contribution of 30km/s of the Earth's orbital speed to the device. In
such case, the waves should travell the interferometer at L=1m in t horiz−t up =3.3⋅10−17 s . This
corresponds to a displacement of 10nm which should cause a detectable change in the interference
pattern for visible light (it is about 0.02λ, and becomes twice as large after rotating the
interferometer by 90o). However, the investigators couldn't detect any interference patterns.
They repeated this experiment with multiple mirrors to enlarge the interferometer arms tenfold,
to11m. Still no interference was detected. This could either mean that the aether in which the light
was supposed to spread is centered at Earth and rotates with the eard around the Sun, or that the
light speed is independent of the velocity of the intertial coordinate frame. This was very awkward,
and it was not easy to accept these facts and the relativity theory which explains them.
4.2. Developing the Lorentz transformations
If the speed of light is constant in any refrence frame, then the light wave must form a sphere in
each inertial coordinate system. Therefore, the equations for the wave front in a moving and resting
coordinate frame must have the same form. Let us look if this really occurs in a classical
formulation, where we say that
x '=x−vt and t'=t (the galilean transformation), where x and t is
the position and time in the stationary inertial refrence frame (IRF) and x' and t' is the position and
time in the moving IRF:
{
x 2 y 2 z 2 =c 2 t 2
 x −vt 2  y 2z 2=c 2 t ' 2
Expanding the bracked on the left hand side of the bottom expression, and substituting t'=t we
obtain
{
x 2 y 2z 2=c 2 t 2
x 2−2xvtv 2 t 2  y 2z 2=c 2 t 2
These two expressions describe different wave fronts in terms of the unprimed coordinates! The
classical approach does not work! We must modify something. A reasoning could be made that if
the speed of light remains constant, and the path changes in the Michelson-Morley experiment, then
maybe something happens to the time so that c=
l
t
remains unchanged. Almost any function
can be expanded into a power series, and the simplest form of a power series, for small
corrections, can be represented by a linear dependency. So we assume t ' =tlx . This has also
the great consequence that the IRF still moves on a straight line with constant velocity—
something that could be disrupted by a more cumbersome time transformation. Now the equations
in primed and original coordinates read:
{
x 2 y 2z 2=c 2 t 2
x 2−2xvtv 2 t 2  y 2z 2=c 2 t 22c 2 lxtc 2 l 2 x 2
Rearranging terms, we obtain:
{
2
2
2
2 2
x  y z =c t
 
x 2  1−c 2 l 2 −2 x t v c 2 l y 2 z 2=c 2 t 2 1−
2
v
2
c
The term 2xt(v+c2l) on the left hand side does not fit to the sphere equation. We can eliminate it by
assuming l=−
v
2 . After this modification, we end up in the equations:
c
{
x 2 y 2 z 2=c2 t 2
x 2 1−

 
v2
v2
2
2
2 2

y

z
=c
t
1−
c2
c2
We can immediately see that to obtain the original equation we only need to divide the time and
position transformation by the factor

1−
v2
. Then the terms
2
c
 
2
1−
v
2
c
in the spherical wave
equation will not occur and it will be a sphere in deed, and not an ellipsoid. In general, the resulting
transformations read:
x '=
x−vt

1−
t−
t '=

v2
c2
v
x
c2
1−
v2
c2
and they are called the Lorentz transformations of the space-time. These are the transformations
that we must use when considering motion of bodies at speeds close to the speed of light. It can be
immediatelly seen that these transformations reduce to the classical galilean transformations for
small velocities, when the ratio
v
≪1 , resulting in x'=x-vt and t'=t.
c
To transform from the primed coordinate system to the non-primed coordinate system, we repeat
the above reasoning, but now the direction of the velocity becomes negative, so:
x=
x '−−v t '

=
x 'vt '

−v 2
v2
1−
c2
c2
−v
v
t '− 2 x ' t ' 2 x '
c
c
t=
=
2
−v 
v2
1− 2
1− 2
c
c
1−


i.e. this reduces to a change in the sign of the operation in the numerator of the transformation.
4.3. Relativistic summation of velocities
We already know about differentiation. Thus, we can ask about the representation of a velocity
under transformation of the coordinates. Consider two IRF-s: IRF1 that is stationary and IRF2 that
moves with velocity u. What would be the velocity w in IRF1 of a body that moves at velocity v in
IRF2?
w=
dx
dt
By means of the total derivative for the Lorentz transformation from the primed coordinate system
to the nonprimed (the velocity has then a negative sign):
dx=
dt=
dx ' vdt '

2
1−
v
c2
dt ' 
v
dx '
c2

v2
c2
1−
So, you probably ask now what is a total derivative? It is a way to express the increment of a
multiargument function, eg.
 f  x ,t =
f
f
df
df
 x
 t≈  x t
x
t
dx
dt
Ok, having the formula for dx and for dt, we can divide them to obtain the result:
dx dx ' vdt '
=
dt
v
dt '  2 dx '
c
dx '
v
dt ' dt '
w=
dt '
v dx '
1 2
c dt '
uv
w=
vu
1 2
c
w=
This formula, as you can easily check, does not allow to cross the speed of light. If you move in an
IRF at speed almost equal to c, and then you try to accelerate and increase the velocity by Δc, you
end in:
w=
c c
c c
=
=c
c  c c c
1 2
c
c
4.4. Space-time interval
Consider a body that moves in the x direction with velocity v. Let us calculate some quantity:
2
2
2
 s = x ' −c  t '
2
2
v

x

2
c2
2  x−v  t 
2
s =
−c
v2
v2
1− 2
1− 2
c
c
v
v2
2

t
−2

x

t
 x2
2
2
2
2
4
 x −2v  x  tv  t
c
c
 s 2=
−c2
2
2
v
v
1− 2
1− 2
c
c
2
v
 s 2= 1− 2   x 2−c 2  t = x 2−c  t 2
c
 t−
 
The calculations shows that the value of Δs is the same no matter the coordinate system we use for
calculations. Therefore it is called an invariant of the Lorentz transformations. It is interesting to
notice that the interval can be positive or negative which corresponds to time-like or space-like
events. The negative intervals are time-like (i.e. they can have a causal relations and one must
preceed the other in any reference frame) while the positive space-like events are separated by a
distance that is too large even for the light to pass in the corresponding time interval and therefore
they are not related causually.
4.5. Time dilation
Consider now how does a clock of a body behave as it moves. Let us observe an astronaut that
travels at velocity v. We observe him at time t=0 in position x=0, and thenafter, at time t=Δt we see
him in the position x=vΔt (because he moves at speed v). Let us transform these quantities to the
moving coordinate frame to calculate the time increase of the astronaut:
 t ' =t ' 2−t ' 1=
1

2
v
1− 2
c
[

 t−
 t ' = 1−

v
v
v  t − 0− 2 0
2
c
c
]
v2
t
c2
So, the time increments in the moving coordinate frame are smaller than in the stationary
coordinate frame. This leads to certain „apparent” paradoxes due to our limited imagination, an
example being the twins paradox, but we will discuss that later.
4.6. Lorentz contraction
We have found that the time runs slower for the moving astronaut. Consider now an astronaut that
travels from Earth to the Alpha Centauri. He has 4 light years to go. But if his clock runs slower
than ours, then the velocity that he would preceive after the travel, i.e. s/t' would be larger than the
velocity that we see on Earth! This would become even more awkward when he would travel at a
speed close to the speed of light, because then, evidently, the speed at which the Alpha Centauri
approaches him, would exceed the speed of light!
Such problems however do not occur because the moving bodies become contracted. So for
example when we travel to the Alpha Centauri and our clocks run slower, then at the same time we
observe a contracted distance to the target star and we need less time for the travel.
Let us again consider some body, for example the rocket of the astronaut. In his primed coordinates
the rocket can take coordinates x1'=0, t1'=0 and x2'=L, t2'=0 (t1'=t2', because the length makes sense
as a difference in end positions only at the same moment of time). Transforming this to the
stationary reference frame, we obtain:
x 1=0
t 1=0
Lv⋅0
x 2=
=
v2
1− 2
c
v
0 2 L
c
t2 =
=
v2
1− 2
c
L


1−


1−
v2
c2
v
L
c2
v2
c2
which means that the x2 coordinate is obtained too late compared to x1. To correct this, we need to
consider where was the second end of the rocket t2-t1 before. This corresponds to a correction of the
distance by v(t2-t1), i.e.:
x 2=
L

x2 =
v2
1− 2
c
−vt 2
 
v2
L 1− 2
c


v2
c2
v2
x 2=L 1− 2
c
1−
Which means that the length of the rocket becomes contracted for the stationary observer with
respect to who it moves with velocity v. This holds in general to any object of length L0, whose
length contracts to L as it starts to move:
L=L0

v2
1− 2
c
This effect is known under the name of the Lorentz contraction.
4.7. The twins paradox
There is a well known paradox in the special relativity. We consider two twin astronauts. One
remains on Earth while the second makes a distant travel at speed of light and comes back to his
brother. After the travel, the brother got old, his hair is gray, but the other twin, due to the time
dilation did not become older, and is still very young.
How can this be if the same reasoning could be undertaken from the point of view of the astronaut
in the rocket, who can see his brother leaving him at speed close to speed of light, and aging slower
than the astronaut!
But the brother on Earth does not change the inertial reference frames. The astronaut does. There
is no reason to assume that something wrong happens to the clocks of the brother on Earth, but the
astronaut clock's may behave strange because they flip from one inertial reference frame to another
and we know there is a lot of unusual things at high velocities, like for example broken simultanity
of events. I.e. being back on earth and seeing the other brother young (two events) does not have to
be simultaneous after moving back to the brother's coordinate system!!!
To understand what happens, consider a simplified problem of two twins, where one of them travels
to some distant star system (say, 50ly away). The first brother looks on the second brother, who
moves at speed 0.9999c (i.e. indistinguishable from c for the observer), and reaches the star after 50
years. However, the astronaut has a different perspective. In his (moving) coordinate system, the
distant star approaches him at 0.9999c, and he remains at rest. The distance between the distant star
L=L0  1−0.9999 2 . So the astronaut is
and the Earth contracts due to the Lorentz contraction!
L L
close to the distant star after t= = 0 1−0.99992=t earth  1−0.99992 . So the escapade took
v v
him much less time than could be seen by his brother. After he got to the desired position, he stops
immediatelly, to remain in the point of interest. We Loretnz-transform the event
 L ,t earth  1−0.99992  to the stationary brother's inertial reference frame to obtain the time
increment euqal to:
t earth 1−0.99992 
t final−t initial =
v
L
2
c
1−0.99992
t=t earth
v
L
2
c
−
1−0.99992
0
(please note that we need to consider the difference between the initial and the terminal time
calculated with respect to the moving frame coordinates).
Questions
1. Tell the difference between instanteneous velocity, average velocity, and speed.
2. Define the acceleration and derive the expression for distance travelled in a motion with
constant acceleration.
3. Define angular velocity and angular acceleration, derive the formula for the travelled angle
if angular acceleration is constant.
4. Derive the formula for a centripetal acceleration.
5. *Derive the general formula for accelerations in noninertial reference frames (by
differentiation of the radius vector, including the differentiations of unit vectors).
6. Interpret the Coriolis acceleration.
7. Derive the reach and maximum height in a projectile motion.
8. Describe the Michelson-Morley experiment.
9. Derive* or at least explain the origin of the Lorentz transformations.
10. Derive the relativistic summation of velocities.
11. Describe the idea of a space time interval.
12. Derive the time dilation formula.
13. Derive the Lorentz contraction formula.
14. *Describe the twin's paradox.
Refrences:
1. A. A. Michelson, E. W. Morley, On the relative motion of the Earth and the lumuniferous
Ether, Am. J. Sci. 203, 22,1887.
2. R. Feynamann, The Feynamann lectures in physics, Addison-Wesley, 1977.
3. J. K. Goyal, K. P. Gupta, Theory of relativity, Krishna Prakashan Media, 1975.
4. S. Kończak, Podstawy fizyki, Wyd. Pol. Śl., 1998.
5. Dynamics
5.1. Newton's first law of dynamics
The first law of Newton states that a body remains at rest (or moves with a constant velocity along
a straight line) when there is no force acting on the body or all of the acting forces cancel each
other.
This formulation actually defines the idea of a force: it is a phenomenon that can start the motion.
So if we push a body, we act with a force; if the body slides down from a mountain, it receives a
gravitational force; if the body becomes attracted to a magnet, it receives a magnetic force, and so
on.
5.2. Newton's second law of dynamics
The most widely used law of dynamics is certainly the second law of dynamics. This law relates the
body acceleration to the forces that act on the body:
n
m
a =∑i=1 F i
Which means that the vectorial acceleration of the body, multiplied by the mass m, equals to the
vectorial sum of all forces
F i which act on the body. The first law of dynamics is a direct
consequence of the second law when the right hand side reduces to zero. In such case the body
acceleration equals zero, and it moves along a straight line with constant velocity (i.e. it cannot
change the velocity due to a zero acceleration).
It is interesting to note that the heavier is the body (the larger mass m), the lower the acceleration
subject to a given force. Therefore, mass can be considered a measure of inertia (or resistance) to
the changes in motion.
5.3. Newton's third law of dynamics
If a given body acts on another body with force
 , then the second body reacts to the first
F
body with a force of the same magnitude, but opposite sign, i.e. − 
F . An example of this
situation is the gravitational force that is pushing us downwards to the center of the Earth where we
actually don't move because of the resistance due to the normal force of the surface that supports
our weight.This also holds in noninertial reference frames (figure out an example!).
It is important to stress that the reaction force is applied to a different body than the original
force! It cannot be introduced into the summation of the second law of dynamics, because it is
related to some different mass m!
5.4. The momentum
An important quantity in the mechanics is the momentum,
p =m v
It is a measure of the „amount of motion”. For example a body of mass m, that moves at speed v,
has the same amount of motion as the body of mass 2m, that moves at speed v/2. Due to the
definition, one can immediatelly realize that the idea of momentum allows to express the second
law of dynamics in a compact way by:
n
d p
=∑i =1 F i
dt
This can be easily checked after substituting
p =m v to the above equation. This equation has an
important consequence. In case when the system is not subject to any external net force, the right
hand side of the equation remains equal to zero. This means that the changes in the momentum are
equal to zero. As a consequence, we can formulate the momentum conservation principle: the net
momentum of a system remains constant if there are no external forces acting on the system.
The momentum conservation principle is very useful in calculating the results of the interactions
between bodies. For example if we have a boy that weights m=70kg and jumps at velocity v=3m/s
into a resting boat of mass m=30kg, we can calculate the resulting speed of the boat with the boy
inside:
70kg⋅3m/ s30kg⋅0m/ s=70kg30kg⋅v
v=2.1m/ s
5.5. Work
Work can be defined, as the force exerted on a body throughout a specified path. In differential
terms we say:
⋅d l
dW = F
This can be expressed in terms of the force and displacement components by expanding the dot
product:
dW =F x dxF y dyF z dz
Such formulation is not suitable for the integration because it contains too many differentials.
However, having the trajectory parametried by functions
we can say that dx=
x= x t ,
y= y t  , z =z t  ,
dx t 
dy t
dz t 
dt = x ' t  dt , dy=
dt= y ' t dt , dz =
dt=z ' t dt . In
dt
dt
dt
such case it is possible to integrate the work equation with respect to the parameter t (which can be
the time for example):
t1
W =∫t F x  x t  , y t , z t x ' tF y  x t  , y t , z t  y ' t F z  x t , y t  , z t  z ' t dt
0
This formulation is known in mathematics as the line integral. It takes a particularlly simple form
in two-dimensional cases, when the trajectory is prescribed by
y= f  x :
x1
W =∫x F x  x , f  xF y  x , f  x  f '  x dx .
0
To interpret work in simple terms, we could say that it measures the effort, that we need to put into
some task. For example if we want to push a heavy block over 1m then we need a smaller effort
than to push it over 10m. Similarly, when we need to drop coal to the basement and we have 5 tons,
we need to make more effort than to drop 2 tons... How can we relate the later to the work? Don't
you wave with the spade on a nonzero distance when performing the task?
It is also important to comment the obvious observation that when you support a heavy object on
your arms, you become tierd, however you dont move it on any distance! How it is possible to spent
work on such task??? The answer is in the microscopic behaviour of the muscles, where the muscle
fibres constantly shake moving there and forth. They actuall perform work, and burn the energetic
chemicals like ATP. We just cannot observe it on a macroscopic level.
5.6. Mechanical energy
Energy can be considered as the ability to perform work.For example, if you climb up on a
mountain with your car, then the car obtains the ability to perform work without burning fuel. It can
just roll down. This kind of energy is called the potential gravitational energy. To obtain a relation
for its magnitude, consider the work done against the gravitational force (Q=mg) when climbing up:
h1
h1
0
0
W =∫h Qdh=∫h mg dh=mg h 1−h 0
If we would take the reference height h0 equal to 0, then the relation for the work would read
W=mgh. This is the expression for the potential energy:
E p =mgh
It goes without saying that when rolling down, the potential energy decreases, and the car
accelerates. The potential energy converts into a kinetic energy. Just take a look. The gravitational
force F sets now the acceleration of the rolling car:
2
2
l
l
l
t
v
 ∫ m
 ∫ m d v ⋅dl=
 ∫ m d v ⋅v dt=∫ mv⋅d v = mv 2 − mv1
⋅dl=
W =∫l F
a⋅dl=
l
l
t
v
dt
dt
2
2
2
2
2
2
2
1
1
1
1
1
Again, if we set the reference (initial) velocity equal to zero, we end up in a formula for the kinetic
energy which reads:
E K=
mv 2
2
5.7. Power
It is often important to consider how much work is done in a specific amount of time. This is
measured by the power:
p=
dW
dt
The power can be easily calculated for case when an object moves against some constant force (for
example a car driving against the air resistance).:
p=
 F

⋅ds
 ⋅ds
dW F
 ⋅v
=
=
=F
dt
dt
dt
This formula can be used for example to calculate the power required by a car to drive at certain
speed in presence of the air resistance, which can be described by the formula:
F =C d  A
v2
2
where C d is the aerodynamic coefficient, for cars equal to 0.2-0.3 (assume 0.3); =1.225
kg
m3
is the air density, A is the cross section area of the car surface that attacks the stream of air (car
width equals 1.8m, height 1.7m, so A=1.8·1.7m2) and v=200km/h is the velocity of the object.
Subject to this data, F=1735N. This requires a power p=Fv which equals to 96kW (or 131hp,
horsepower). The requirement could be smaller if the aerodynamics of the car was better.
To put the concept of power in a right context, few examples of power consumption. You already
have an example of the car. A truck can consume up to 500hp (370kW). A train locomotive
consumes 2.5MW (3400hp). A nuclear power plant can produce up to 8GW.
5.9. The second law of dynamics in special relativity
To understand the second law of dynamics in the special relativity, one needs to consider the
formulation:
 = d p
F
dt
together with the specification of the momentum itself in the special relativity. To understand the
latter, consider a relativistic collision of two pool balls:
Let the ball on the left be called „A” and the ball on the right „B”. Because the collision is
symmetrical, in the table reference system we would expect the same velocity component of the
balls in the y direction after collision.
But the momentum should be conserved in any inertial coordinate system! If the balls move at
relativistic speeds, there occurs the time dilation. So, for example, after collision the observer of
„A” can see the dashed line to move away at velocity v (e.g. 10 meters in each second) and the
same should be found by the observer „B”. However, from the point of view of „A”, observer „B”
does not move that fast. If the y component of the velocity is insignificant, one can omit the Lorentz
contraction in the y direction. But the time dilation applies. The observer „B” from point of view of
1s
„A” will travell the 10m in a dilated second, i.e. in
1−v 2 / c 2
. So the momentum, of „B” would
be equal to:
.
sB
p B =mv B=m =m
tB
sA
tA

2
1−

=mv A 1−
v BA
c
2
v BA
c
2
2
and would be different from the momentum of the „A” ball:
p A=m v A
So the momentum would not be conserved! The rescue could be to postulate that the mass
transforms with velocity as:
m=
m0

2
1−
v BA
c2
where m0 is the rest mass, and m is the relativistic mass but because such relativistic mass is an
apparent construct (for example it is not a source of gravitation—you cannot expect for example
that an accelerated electron, which mass increases and length contracts will change into a black
hole), we prefer to say that the relativistic formula for the momentum is in general not p=mv but
p=
mv

1−
v2
c2
where m stands for the normal rest mass. The second law of dynamics in such case reads:
d
F=
[ ]
mv
v2
1− 2
c
dt
Reference: M. Fowler, UVa Physics: Relativistic Dynamics, 2008.
5.10. Conservative and non-conservative forces
The work done by the forces,
W =∫ F  r ⋅d r
Can depend on the trajectory (for example the work done against the frictional forces, which we
will meet later in the course),
or it can be independent of the trajectory, and then the above integral can be evaluated as a definite
integral using „potential”, where the potential at the end point subtracted by the potential at the
start point gives the work done in the movement:
In this example the work done to lift the ball is the same. However, on the right hand side you have
to push harder, but this is not consumed by gravitation! This „extra” energy is conserved into a
nonzero kinetic energy—in case B the velocity is nonzero!
6. Friction forces
The first person in history who investigated the phenomena of friction was Leonardo daVinci. He
has observerd that the friction force is proportional to weight and that it is independent of the
contact surface. He didn't distinguish however between the static and kinetic types of friction.
Currently, we make frequently use of the three laws of friction, two of which were formulated by
Amontons, and one which was formulated by Coulomb (a very talented sciencist, mostly known for
his work on electrc charges):
1. The magnitude of the friction force is proportional to the normal force by which the body
acts on the surface (Amontons). This can be formalized by defining the friction coefficient
as the ratio of the friction force F to the normal force N:
=
F
N
2. The friction force does not depend upon the size of the contact surface (Amontons),
3. The kinetic friction does not depend upon the speed of sliding (Coulomb).
6.1. Static and kinetic friction in sliding
The earliest models to explain the phenomenon of friction considered the climbing of the asperities
of one body on the asperities of the other body. This is the typical explanation that you frequently
learn in the high school. For example, Euler in 1748 considered the following picture:
In this case, the component mg sin  of Q must be equal to
coefficient  :
F cos  which results in a friction
F cos =mg sin 
F cos =Q sin 
sin 
F =Q
cos 
F =Q
=tg 
Earlier, in 1737 Belidor considered climbing of spherical asperities
A figure from the Belidor work from 1737 in Architecture Hydraulique
which resulted in a friction coefficient equal to 0,35 which he thought to be the universal friction
coefficient (i.e. equal for any considered body).
The theories based upon asperity deformation can give some insight into the friction phenomena but
do not explain all of them. For example, there exists a material—mica—which is smooth on a
molecular level and still displays friction! To overcome such difficulties, Bowden and Tabor have
proposed an adhesive theory of friction, where they assume that bodies—when connected—adhere
to each other due to molecular interactions which need to be broken in order to start the motion
which results in friction.
Within the framework of this theory it is very easy to understand the apparent independence of
friction on the contact area of the bodies. This is because the bodies do not contact on their whole
length but rather this occurs on spiky asperities:
After being loaded by the force, the asperities begin to deform by melting plastically. They do so
until the load divided by the contact area is larger than the melting pressure:
F
P melt
A
As the area grows and grows due to the plastic flow, the limit of the melting pressure is nearer and
nearer, and finally, the area growth is stopped. So, no matter the apparent contact area due to
geometrical dimensions of an object—the real contact area limits at
A=
F
.
P melt
This theory also explains the relation between the kinetic and static coefficient of friction (the
coefficient of friction K that is measured in motion is typically lower than the coefficient S
measured in static position). It is thought that the formation of adhesive contacts between bodies
requires a diffusive crossing of a layer of contaminants and lubricants, which requires time. This
time is lacking in motion and the contacts occur to a smaller degree than they occur in the static
position.
In fact, Coulomb did a series of measurements of the friction in wood, where he actually could
measure the growth of the friction coefficient with time!
6.2. Rolling friction
To understand the phenomena involved in the rolling friction, consider the figure below:
As the body rolls, it deforms on an asperity and receives a braking force (also energy is consumed
to provide the deformation). When the body passes the asperity, it starts to act on the body in
different direction, and the elastic energy is converted to the kinetic energy. Such process is never
ideal, and there occur losses of energy (for example the rubber may not expand fast enough to
release all of its elastic energy; some energy is lost for the internal rearangements of the rubber
molecules in the process of internal friction, etc.). In effect, the body is loosing energy durring the
roll and it slows down. This is the rolling friction.
To understand this phenomenon to a better degree, consider a pure deformation of the rolling body
without any obstacles:
It can be seen that due to the deformation, the support point of the body moves by  from the
center and the normal force N gives now a nonzero torque = N which opposes the motion (
 is the projection of R on the perpendicular direction to N)! The torque that must be applied to
the body to overcome this braking torque equals
F t R=

F t= N
R
F t =t N

t =
R
which gives a great conclusion that the rolling friction force should decrease with the radius of the
rolling body (provided the deformation size  remains fixed).
Questions
1. Define and explain the three laws of dynamics.
2. Define and interpret the idea of the momentum and specify the momentum conservation
principle. When is the momentum not conserved and how does it change?
3. Derive the formula for work on a curvilinear trajectory in a 3D force field.
4. Derive the potential and kinetic mechanical energy
5. Define the power and give the limit for a force exerted by an object of specified power at
given constant velocity.
6. Derive* or specify and comment on the 2nd law of dynamics in Special Relativity.
7. What is the difference between conservative and non-conservative forces.
8. Define the three laws of friction.
9. Explain why does the friction not depend upon the contact area (Bowden-Tabor theory).
10. Explain the origin of the rolling friction.
Reference: P. Borys, Foton 106, 4-26,2009, F.P. Bowden, D. Tabor, An introduction to tribology,
Anchor press, 1973; C. M. Mate, Tribology on the small scale, Oxford Univ. Press, 2008.
7. The rigid body dynamics
7.1. Torque
Did you ever try to remove a nut from the screw?
It can become quite easy when you have a long arm of the key and is extremly difficult with a
small key.
It seems that the driving force for rotations is not just force as was in the transitional dynamics but
some other quantity, that depends also on the distance from the rotation axis. This quantity is called
the torque and is defined by


R× F
= 
where R is the radius from the rotation axis where the force F is applied.
7.2. The moment of inertia and 2nd law of dynamics
Think about rotating a stick:
In the upper case it is much easier to start the rotations than in the bottom case. Why is that the
case? It seems that not only mass is what opposes rotations, but also the radius of the mass from the
rotation axis. The further the mass from the rotation axis, the larger linear velocity it must have to
move and the larger acceleration is required to establish this velocity, and consequently, a larger
force is required for establishing the acceleration.
Consider mass m on radius R from the rotation axis. To provide an angular acceleration =
a
,
R
the second law of dynamics states:
m R= F
2
 m R =RF=
I =
which can be also written in a more general vectorial manner as the second law of dynamics for
the rotational motion of a rigid body:
I =

where I is the moment of inertia, that was equal to mR2 for a point mass and for a more general
rigid body it is the sum of the moments of inertia of all mass points:
N
I =∑i=1 m i R2i
where mi is the mass of the i-th point and Ri is its distance from the rotation axis. This summation
formula is often written in a continous limit of infinitely small masses mi.
2
I =∫M r dm
where the integration takes part over the whole considered mass m (typically parametrized by the
position parameter in relation to density). For example, a moment of inertia for a ring (or a hollow
cylinder) can be calculated by the following:
An infinitely small mass point can be identified by dm=Mdα/2π. Therefore:
R2 M
d
2
2
I =MR
2
I =∫0
Another example is to calculate the moment of inertia of a stick with respect to one of its ends:
L
I =∫0 M x 2
3 L
∣
dx
L
Mx
ML
I=
=
3L 0
3
2
Having the formula for a ring, we can calculate the moment of inertia for a solid disc (or a solid
cylinder):
Rmax
I =∫0
2
MR
2 R dr
 R2max
2M R
3
R dr
2 ∫0
 R max
4
2M R max
1
I= 2
I = M R 2max
2
Rmax 4
I=
max
Having the formula for a solid cylinder, we can derive a formula for a hollow cylinder (outer radius
r2 , inner radius r1, mass of the removed part—m, mass of the remaining part—M):
 mM  r 22 m1 r 21
−
2
2
r 21
m= M m 2
r2
2
2
2
Mr 1mr 2mr1 mr12 M  r 21r 22 
I=
−
=
2
2
2
I=
Where the middle relation follows from the proportion between mass and cross-sectional area of the
inner cylinder and of the full cylinder with radius r2.
Other important formulas for the moment of inertia include (Dietłaf, Jaworski: Reference in
Physics):
Body
Moment of inertia
Stick of length L, mass M along its end
ML2/3
Stick of length L, mass M along its center
ML2/12
Hollow cylinder, radius R, mass M, along its rotation axis
MR2
Solid cylinder, radius R, mass M, along its rotation axis
MR2/2
Ball, mass M, radius R along its center
2/5MR2
Sphere, mass M, radius R along its center
2/3MR2
7.3 Steiner's theorem
Once we have a moment of inertia given for the rotation axis that is passing the center of the mass
(ICM), we can calculate the moment of inertia of a body along any other parallel axis separated by
radius r by:
I =I CM mr 2
To prove this relation, consider a general form of the ICM:
N
I CM =∑i=0 mi  x 2i  y i2
Now let us shift the rotation axis to r in x direction:
N
N
N
N
I =∑i=0 mi [ x ir 2 y 2i ]=∑ i=0 mi  x 2i  y 2i ∑i=0 m i r 2 ∑ i=0 mi r xi =I CM mr 2
because the last smmation vanishes since it is calculated with respect to the mass center (i.e. it has
the same amount of positive mixi components as of the negative components).
Reference: Burton P., Kinematics and dynamics of planar machinery, Prentice Hall 1979.
7.4. Angular momentum
Consider the second law of dynamics in absence of any forces:
I
 =0
d

I
=0
dt
This can be integrated with respect to time to give a certain conserved quantity L:
L=I =const
This quantity is called the angular momentum. It is a dual quantity to the normal momentum of
translational motion, where the mass is replaced by the moment of inertai and velocity is replaced
by the angular velocity.
The conservation of angular momentum is important for many phenomena. The most widely
recognized is the spinning ice skater, who spins faster after contracting her arms and legs towards
the rotation axis. Another example of the conservation of angular momentum is the rotation of stars.
As the interstellar matter contracts to form a star it is almost immobilized but there are always some
assymetries and a nonzero rotation. As the moment of inertia greatly decreases (the radius shrinks
after the star is formed), the star starts to rotate. It rotates extremely fast after a terminal contraction
to a neutron star which can have a diameter of only 10km instead of the original hunderths of
astronomical units of the planetary gas (1AU=150mln km). In fact, neutron stars can perform
hunderths of rotations per second. This causes their shape to become elliptic.
The angular momentum is also used for the construction of a giroscope, a device that is (for
example) being used to show the horizon in the airplanes. It consists of an extremly fast rotating
disc and it has thus a large angular momentum. The torques experienced during the flight cause
changes in the angular momentum:
d L

=
= 
R× F
dt
But if the L by itself is very large, then small changes dL cannot change the direction of the artificial
horizon:
Here, ΔL corresponds for example to the change in the angular momentum after 10 evolutions of
spinning around the long axis of the airplane. Assuming that the speed of rotations does not change
(i.e. the magnitude of the vector L=Iω remains constant), one can derive an equation for the
precession:
=

d
L d I
 d I  e dL e dL
 p× L
=
=
=
= e  L  p×e =0 p × 
L =
dt
dt
dt
dt
dt
(recall the derivative of the unit vector that we have considered when deriving accelerations in
noninertial reference frames).
7.5. Energy in the rotational motion
As the parts of the body rotate, they posess certain speeds around the rotation axis. Such speeds
give contributions to the kinetic energy of the movememnt:
n
E k =∑i=1
n
mi v 2i
mi r i2  2  2 n
I 2
=∑i=1
= ∑i=1 mi r 2i =
2
2
2
2
7.6. Work in the rotational motion
When the mass points are displaced under a torque around the rotation axis, a work can be
calculated by:

⋅dl=Fdl
dW = F
cos 
For a force that acts in the direction of the rotation (the component of torque), we say α=0, and
dl=dφR. Therefore:
dW =FRd = d 
In such case
2
W =∫  d 
1
7.7. Summary of the similarities between translational and rotational motion
translational
rotational
Position x
Angle φ
Velocity v=dx/dt
Angular velocity ω=dφ/dt
Acceleration a=dv/dt
Angular acceleration ε=dω/dt
Mass m
Moment of inertia I
Second law of dynamics: F=ma
Second law of dynamics: τ=Iε
Momentum: p=mv
Angular momentum: L=Iω
Work: dW=F∙ds
Work: dW=τdφ
Power: P=Fv
Power: P=τω
2
Kinetic energy: E=mv /2
7.8. Collisions in 2D
Consider two rigid bodies that collide in two dimensions.
Kinetic energy: E=Iω2/2
n and exert a force in point p. The interaction force
The objects collide in the collision plane 
obeys the 2nd law of dynamics by:
d p 
=F
dt
Now taking into account the velocity changes under the influence of a force
m
a= 
F

d
v F
=
dt m
we can say
d
v d p
=
dt mdt
cancelling the time infinitesimal and integrating we obtain:
p
m
p
v 2=v 1
m
 v =
Similar reasoning can be performed for the rotational motion:
d

=

dt
d


I
=r × F
dt

d
 r × dp
I
=
dt
dt

I d =
 r × dp
r × p
=
I
I
so:

 2=
1 
r ×  p
I
Now we can consider two bodies that approach each other at velocities v 1A , v1B , 1A , 1B . The
resulting equations are:
 p
v2A= vA1 
m
 p
v2B= vB1 −
m
AP
 p
r ×
A2 =1A
IA
BP
 p
r ×
B2 =1B−
IB
because this is a 2D problem, we can simplify the cross product (and neglect the z components of
the vectors—angular velocity will become a scalar in further reasoning):
∣
∣
i
j
k
rx
ry 0
 p x  p y 0 =k r x  p y −r y  p x = k r⊥⋅ p
i j k
rx ry 0
where r ⊥ stands for the r vector with interchanged x and y coordinates, and the new x coordinate
is taken negative. This makes then:
p
m
p
v 2B=v 1B−
m
AP
 p
r ⋅
 A2 =1A ⊥
IA
BP
 p
r ⋅
 B2 =1B ⊥
IB
v 2A=v 1A
This makes a system of 4 equations and 5 unknown variables. To deal with this, we introduce after
Newton the coefficient of restitution e, which describes the degree of elasticity of the collision. A
completely elastic collision sets e=1, while a plastic collision sets e=0. There is a lot of
experimental work available with coefficients of restitution of real bodies (e.g. cars, where
e=
1
). The coefficient is defined by:
v [ m/ s ]
2

v AP
v 2BP ⋅n =−e  
v 1AP −v BP
n
2 −
1 ⋅
where the velocities are not the velocities of the center of the mass, but the velocities of points of
the body that touch the collision site p:
AP
v 2 =v 2 
 2×r
AP
However, we don't treat angular velocity as a vector. But we know that it should point up from the
considered XY plane, in the direction of the k unit vector. So:
 AP 
 k ×r AP =  r AP
r AP
x j−r y i =
⊥
AP
AP
v 2 = v2r ⊥
with similar relation for body „B” and for pre-collisional velocities (index „1”). (Notice: the angular
velocity in the above is taken without a vector, just as a magnitude! The rotation direction is
v AP
v 2BP  n
perpendicular to rAP and is determined correctly by r⊥AP . Now we can subtract  
2 −
to obtain:
 v 2AP−v 2BP  n =.
v A2⋅n2A r AP
 −v B2⋅n− 2B r BP
 =.
⊥ n
⊥ n
AP
AP
BP p n
 p n
r ⊥ r ⊥ ⋅
r AP
 p⋅n
 p⋅n
A
A AP
B
B AP
⊥  r ⊥ ⋅ 
v 1⋅n 
1 r ⊥ n 
−v 1⋅n 
−1 r ⊥ n 
=.
mA
IA
mB
IB
AP
2
BP
2
 p  p r ⊥ ⋅n   p  pr ⊥ ⋅n 
 v 1AP−v 2AP  n 



mA
IA
mB
IB
 p= p n . Evaluating the transmitted magnitude of the
where we made use of the identity 
momentum, we obtain:
AP
BP
−1ev 1 −v 1 ⋅n
 p=
AP
2
BP
2
1
1  r ⊥ ⋅n   r ⊥ ⋅n 
 

ma mb
IA
IB
Having  p in terms of the pre-collisional velocities, it is straightforward to determine the postcollisional velocities and angular velocities.
Questions
1. What object replaces forces in rotational dynamics? Could you give some reasons?
2. Derive the second law of rotational dynamics for mass m rotating at radius r, subject to a
force F. Define the moment of inertia.
3. Derive the moment of inertia for a solid cylinder.
4. Prove the Steiner's theorem.
5. From second law of dynamics derive the principle of angular momentum conservation.
Derive the equation of precession.
6. Derive the formula for work and energy in rotational motion.
Reference: Chris Hecker, Game Developer, 11, 1997; J. Wierciński, A. Reza, Wypadki drogowe.
Vademecum biegłego sądowego. IES, 2010.
8. Analytical mechanics
8.1. Lagrange equations
Consider a special form of an energy function, that is called Lagrangian:
L=E K − E P
Which is made of the difference between the kinetic energy and the potential energy. For example if
we consider a translational motion in the x direction, the Lagrangian can have the form:
2
L=
where
m ẋ
− E p  x
2
ẋ stands for the time derivative of x, i.e.
respect to x or
ẋ=v . In such case, the derivatives of L with
ẋ take a special form:
∂ L −∂ E p  x 
=
= F  x
∂x
∂x
∂L
=m ẋ= p
∂ ẋ
Where the first equality resembles the formulation of a potential force. I.e. when you slide down by
dx to change the potential by dEp, a work has been done dW=-dEp, so the force must have been
involved by -dEp=Fdx, i.e.
F=
−dE p
.
dx
Comparing this to the second law of dynamics, we can state that:
∂L d ∂ L
=
∂ x dt ∂ ẋ
Which is called the Lagrange equation of mechanics. It can be proved by the variational calculus
that this form of equation would hold for any pair of the conjugated coordinates that occur in L. The
generalized momenta are typically denoted by letters p, while the generalized coordinates are
typically denoted with letters q. We will see this notation in the chapter devoted to the Hamilton
mechanics. For now, as an example you could consider angular motion with:
2
L=
2
I
I ̇
−E P =
−E P  , so that the equation would read:
2
2
∂L d ∂ L
=
∂ dt ∂ ̇
8.1.1. The Euler-Lagrange formula for variational problem
Consider a motion along a trajectory
y  x  that is perturbed to Y  x= y  x x .
Let us find a trajectory that minimizes the following integral:
b
∫a
f  x , y , y '  dx  min
Substituting the perturbed trajectory, we say:
b
∫a
b
∫a
f  x , Y ,Y '  dx  min
f  x , y  ,Y '  '  dx  min
To find the trajectory that minimizes the integral, we differentiate it with respect to  and
compare to zero:
∫a ∂∂ fy  ∂∂ yf ' ' dx=0
b
The second term can be integrated by parts taking into account that the perturbed trajectory must
start and finish in the unperturbed points (i.e. a =b=0 ):
b
∫a 


∂f
d ∂f
−
dx=0
∂ y dx ∂ y '
Since the perturbation can be any possible function, the interior of the brace must vanish, which
gives rise to the Euler-Lagrange equation:
∂f d ∂f
=
∂ y dx ∂ y '
Which is the same equation that we considered for the mechanics, just represented in the domain of
x, and not t... So the Lagrange dynamics describes a motion along a trajectory of minimum kinetic
energy and maximum potential energy.
8.1.2. Example of the Lagrange formalism application
The greatest utility of the Lagrangian formulation of the mechanics is to describe a motion subject
to a system of constraints. Example application for a bead fixed on a rotating circle made from wire:
The only degree of freedom of this bead is the angle  . The kinetic energy and the potential
energy can be described in terms of this angle:
mv2tangent mv2circumferential

2
2
2
2 2
˙
 m  R sin2 
T =mR 2 
2
2
E p =mgR 1−cos 
T=
Using that, we can construct Lagrangian:
L=mR 2
˙2 m 2 R2 sin2 

−mgR 1−cos 
2
2
and then put it into the equations for:
∂L d ∂ L
=
∂ dt ∂ ̇
which reduces to:
mR 2 2 sin cos −mg R sin =mR2 ̈
̈= 2 cos −g / Rsin 
that can be integrated.
8.2. Hamilton equations
It is possible to construct another function, related to the Lagrangian. It is called the Hamiltonian:
H = p q̇−L=2 E k − E k − E p =E k E p
(for example: if
p=m ẋ and q̇= ẋ ,
p q̇=m ẍ 2 )2. So Hamiltonian is the sum of the kinetic
and potential energy. It is interesting to calculate the derivatives of H with respect to p and q.
∂ H −∂ L
d ∂L
d
=
=−
=− p=− ṗ
∂q
∂q
dt ∂ q̇
dt
∂H
∂ q̇ ∂ L ∂ q̇
∂ q̇
∂ q̇
=q̇ p
−
= q̇ p
−p
=q̇
∂p
∂ p ∂ q̇ ∂ p
∂p
∂p
So, the Hamilton mechanics, while being based upon the Lagrange mechanics, offers very useful
equations for calculating the evolution of the generalized coordinates:
∂H
∂q
∂H
q̇=
∂p
ṗ=−
Questions
1. Explain the Lagrange formulation of mechanics.
2. Explain the Hamilton formulation of mechanics.
Refrence: J. R. Taylor, Classical Mechanics, Univ. Sci. Books, 2005.
9. Gravitation
Between every two mass bodies in the space there exists a force of gravitation. Newton postulated
a simple form of this force with
2 Remember: p are the generalized momentum coordinates, and q are the generalized position coordinates.
 =G Mm r
F
r2
This force is always attractive and depends upon the mass M of one body, mass m of the second
body, and upon the distance between these bodies r. The symbol r denotes the unit vector along
the line connecting the two bodies, and G=6.67∙10-11Nm2/kg2.
This force is different from any other forces because it acts on each mass point causing an uniform
acceleration of bodies in space:
Mm
r
2
r
GM
a = 2 r
r
m
a =G
It works independent of the mass m of a body. If you could shut down into a freely falling container,
everything inside would fall with the same acceleration and you would not notice any inertial
effects. You could conclude that you are in the inertial reference frame. This was the conclusion of
Albert Einstein: he said that the freely falling coordinate system is an inertial refrence frame.
Because it is an inertial reference frame, even the light inside of such container will „fall” in the
direction of the attracting mass. This is the basis of a process called „gravitational lensing” of light.
However, to calculate a proper deflection angle it is not sufficient to calculate (as did Henry
Cavendish in 1784):
[
z=b tan t
GM
b
GM
b
t=atan z /b
 v=∫−∞ a dt=∫−∞ 2 2
dt=∫−∞ 2 2
dz=
2
2
2
2
dz /b
bdz
b z  b z
b z c  b z
dt=
= 2 2
2
1 z /b b z
∞
∞
∞
]
/ 2
bGM
bc
∫− / 2 dt
GM
 b b tan t  bc
2
2
2
=
dt
/ 2
∫−/ 2

cos2 tsin 2 t
cos 2 t
=
GM
bc
∫−/ 2 cos t dt= 2GM
bc
/ 2
where z is the position of the photon along its straight line trajectory and b is the collision
parameter (minimum distance to mass M) and then to divide =
v
, because we would obtain
c
an angle that is twice too small (in such calculations we assume the ray moves almost along a
straight line and the conditions for the force equation depend only on z, not on b, but this is not the
source of the problem). Because the gravitational time dilation plays a role (it slows down the light
for the distant observer, exactly the same happens in a glass lens), Einstein derived identical
angular correction due to the time dilation by itself) as well as the space curvature, one needs to use
the formalism of the general relativity to obtain the final result:
=
4GM
2
c b
To give some hint against this correction, consider the following figure:
In the next chapter we derive a formula for gravitational time dilation,

 = t 1−
2GM
rc 2
Which means that the time runs slower in presence of gravitation, and the light (apparently) moves
faster (in fact the distance also undergoes relativistic modification in this coordinate system, and
velocity is constant). However, the light can pass from point A to point B in shorter time than when
it moved on a straight line, just as it will be shown by us in the future in the derivation of Snell's law
by the Fermat principle. So, we could try to use the Snell's law for refraction:
n 1 sin 1=n 2 sin  2
together with a formula for the „gravitational refraction index”,
n=
v0
=
vn
c

c / 1−
2GM
rc 2

= 1−
2GM
rc 2
Now consider the point in the figure, where the angles are shown. For that point we can write the
Snell's law as:
n sin =n
dn
dz sin d 
dz
Making use of the formula for the sine of a sum of the angles, we have:
n sin =nn ' dz sin cos d cos  sin d 
n sin ≈nn ' dz sin cos  d 
The derivative of n can be calculated easily after its simplification:

n= 1−
2GM
GM
GM
≈1− 2 =1− 2 2 2
2
rc
rc
c  b z
−GM
n ' = 2 2 2 3/2
c  z b 
Now substituting such n', and assuming that n is close to 1 (the major contribution originates from
n'), we have:
−GMz
sin  dzcos d 
c  z 2 b2 3/ 2
−GMz
bdz
zd 
0= 2 2 2 3/ 2 2 2  2 2
c  z b   z b  z b
GMb
d = 2 2 2 3/ 2 dz
c  z b 
0=
2
which gives identical contribution as we had in the corpuscular model! (the only difference is the
extra c factor in the denominator which results from the fact that we calculate the angle directly, not
the change of the corpuscular velocity, as previously) Assuming the equivalence principle, both
effects should act additively, thus giving an angle twice larger than predicted by the classical theory.
9.1. Gravitational time dilation
Consider a body that falls down from infinity onto a mass object M. It gains the energy:
E=
GMm
r
which converts into kinetic energy:
Ek =
mv 2
2
Equating these two energies we obtain the velocity of a freely falling body:
v=

2GM
r
Now, we can use this velocity in the time dilation formula to obtain the gravitational time dilation!

 = t 1−
v2
2GM
= t 1−
2
c
rc 2

i.e. the proper time in gravitational field (   ) runs slower than for a distant observer (  t ).
9.2. Orbital speed
Consider a body of mass m that enters an orbit around a body of mass M. To do this, the body must
attain an equilibrium between the centrifugal force and the gravitational force:
mv 2 GMm
= 2
r
r
GM
v=
=v I
r

So, for example, if Earth's mass is 6⋅10 24 kg , radius equals 6400km, one obtains v=7.9km/s.
9.3. Escape velocity
A higher velocity is required to leave the orbit of a stellar body. To calculate this velocity one needs
to consider the equality of the kinetic energy and of the potential energy of the body:
mv 2 GMm
=
2
r
2GM
v=
=v II = 2 v I
r

For the escape from the Earth this makes vII=11.2km/s.
To escape from the Solar system, one needs ( M sun =2⋅1030 kg ,
R Earth=1Au=150mln km )
vII=42.2km/s, but much of this velocity is given by the orbital speed of Earth around the Sun
(vI=29.8km/s), so the actual speed to leave the Solar System from the Earth surface is lower:
mv 2 GM Em m 42.2km/ s−29.8km/ s 2
=

2
RE
2
2GM E
v 2=
12.4km/ s 2
RE
v=

2GMe
12.4km /s 2
RE
v III =16.7km / s
9.4 Gravity assist and Hohmann transfer maneuver
Questions
1. Define the Newton's law of gravitation and tell why is this force so different than the other
forces? What was the Einstein's conclusion?
2. Define the gravitational lensing formula. What is the contribution of relativistic effects?
3. Derive the gravitational time dilation.
4. Derive the expression for an orbital speed.
5. Derive the expression for an escape velocity (from gravitational field).
10. Chaotic dynamics
Chaos refers to a set of phenomena which are characterized by deterministic mechanics but their
evolution is unpredictable due to extreme sensitivity to the initial conditions. Because we can't
assure infinite precision of the initial conditions (we always have some measurement error), we
have only limited possibilities to predict the evolution of the system. An example is the trajectory of
a mass point between circular scatterers:
The solid line and the dashed line start from a single point, but the dashed line has slightly different
initial angle. After three collisions the trajectories become unrelated! This is what we mean by
unpredictability. If the angular difference would be smaller, the error would require more collisions
to grow to the magnitude that decorrelates the two movements. But this would happen, and it would
not require too many collisions, because the error grows exponentially like
= 0 exp   t 
where  is called the Lyapunov exponent. Such an exponent can be easily visualised for some
simple biophysical systems. For example, the logistic equation, which describes the reproduction
of a species (with number of individuals equal to nk in k-th generation) in the limited avialability of
resources is written as:
n k1=ank 1−nk 
For certain values of a this equation is chaotic, i.e. oversensitive to the initial conditions. Consider
a=3.95. Take n0=0.2500 and then consider the case with n'0=0.2501. The trajectories are
summarized in the following table:
k
n
n'
|n-n'|
0
0.2500
0.2501
0.0001
1
0.7406
0.7408
0.0002
2
0.7588
0.7584
0.0004
3
0.7230
0.7237
0.0008
4
0.7912
0.7898
0.0014
5
0.6527
0.6558
0.0031
6
0.8955
0.8917
0.0038
7
0.3698
0.3816
0.0118
8
0.9205
0.9321
0.0116
9
0.2889
0.2500
0.0389
10
0.8115
0.7407
0.0709
11
0.6041
0.7587
0.1546
12
0.9447
0.7232
0.2215
13
0.2064
0.7908
0.5844
It is interesting to plot the differences between n and n' for subsequent k generations to observe the
exponential departure of one trajectory from another:
=∣n k −n ' k∣
k
In fact, such chaotic systems are used in the computer science as random number generators.
Questions:
1. Explain the meaning of Lyapunov exponent.
11. Fluid dynamics
11.1. What is a fluid?
The term fluids refers to all the substances that are able to flow and do not maintain a fixed shape.
The most common examples of fluids are gases and liquids. The fluid dynamics addressed to gases
reveals its importance for example in the studies of aerodynamics, while addressed to liquids
describes the flow of liquids in pipes, rivers, or in the human vienes.
Reference: D. Giancoli, Physics.
11.2. Fluid statics: Pascal principle and the pressure
The pressure p of a fluid is defined as the force F that is exerted by a surface element  s of the
fluid:
p=
F
s
The Pascal principle states that the pressure applied to the fluid changes increases the pressure
inside of the fluid everywhere by the same amount. This is very useful for the construction of a
hydraulic lift:
In this figure, a small weight exerts a small force on a piston with small surface. On the other hand,
the piston on the right has a large surface, k-times larger than the surface on the left. The force
exerted on this piston is:
p 1= p2
F1 F2
=
S 1 S2
F1 F2
=
S1 k S1
F 2=kF 1
and it may become sufficient to overcome the gravitational force of a car's weight.
11.3. Fluid statics: the Archimedes law
Before discussing the Archimedes law, let us consider the pressure distribution in a vertical tube of
fluid. The pressure results in general from the weight of the fluid that remains above the considered
level:
It is clearly seen that on height h, the weght equals to mg while m=ρV, where V=hS. As a result, the
pressure varies with depth of the fluid as:
p= g h
Knowing that the pressure varies with depth of the fluid it is easy to imagine that the pressure acting
on the bottom of any object immersed in the fluid will be larger than the pressure acting on its top.
The resulting force will push the body upwards and it is called the buoyant force. Take look at the
following figure:
The net force originating from the pressure equals to:
F =F 2−F 1= g h 2 S − g h1 S = g  h S = V g
which in other words means, that the buoyant force equals to the weight of the water pushed out
by the immersed object. This is the law of Archimedes.
11.4. Fluid dynamics: the streamline and laminar flow
In case when the fluid is in motion, and does not move too fast, it can be described in terms of a
such called laminar flow. In laminar flow the flow of the fluid can be decomposed into
streamlines, i.e. if you consider a cross-section of a fluid, then each element of the cross-section
follows an unique trajectory (a streamline) surrounded by the neighbouring streamlines and does
not cross the surrounding streamlines. The word „laminar” means „in layers”.
The opposite of the laminar flow is the turbulent flow where eddies occur, which absorb a lot of
energy due to internal friction (viscosity). This can be seen on the wind streamlines approaching a
wing profile (the upper streamlines are going in a laminar flow, while the lower become turbulent):
The origin of the vorticies in the lower case is the „vacuum” region after the wing profile which
sucks down the streamlines and causes their rotation.
The condition for a change from laminar flow to a turbulent one is that the Reynolds number, which
characterizes the system, should be sufficiently large, typically larger than 4000. This number
expresses the ratio of the inertial force to the viscous force ( F = A
v
, where  is the
l
viscosity coefficient, A is the contact area between a body and a fluid, v is the velocity of the fluid, l
is the thickness of the nonzero fluid velocity profile—you will learn about this in details in the
following paragraphs). Consequently, the Reynolds number may be evaluated as:
L
v
3 L
m 2

L
F
ma
t
t
t
ℜ= inertial =
=
=
=
2
F viscous
v
L
L
L
A
 L2
 L2
L
Lt
Lt
m
Now, expressing time by the velocity definition as t=
ℜ=
L
, we obtain:
v
 Lv

Where L is the characteristic length of the system, v is the speed of flow, and ρ is the density of the
fluid. How does it work? Obviously, to sustain a laminar flow, the damping in the system should be
large to attenuate any vortices. This requires a viscous force that is sufficiently large as compared to
the inertial force and results in a small Reynolds number. Otherwise, the vortices are not attenuated
and the Reynolds number is large.
11.5. The continuity equation
Consider an ideal fluid (nonviscous, incompressible fluid) that flows in a pipe which contracts its
diameter as shown in the figure:
Now imagine that we push some amount of fluid on the wide entry of the pipe. Let the volume be
equalt to V =A1  x1 . The same amount of fluid must exit on the right hand side: V =A2  x 2 .
In this situation:
A1  x 1=A2  x 2
We can divide the above by the corresponding time increment to obtain:
A1
 x1
 x2
= A2
t
t
A1 v 1= A 2 v 2
which is called the continuity equation. This is one of the reasons why rivers are fast in the
mountain, where they are not so wide, while they slow down a lot when they pass the lowlands.
The continuity equation can also be written in 3D. To start, let us rewrite the above in form:
A2 v 2 −A1 v 1=0
In 3D a similar expression can be written for all perpendicular axes. Consider a cubic element of
side lengths  x ,  y ,  z . Then:
1
 y  z v x  x x−v x  x  x  z v y  y y−v y  y x  y v z  z z −v z  z=0∣⋅
 x yz
results in:
v x  x x− v x  x v y  y y− v y  y v z  z z −v z  z


=0
x
y
z
which for an infinitely small volume element with  x  y   z 0 reduces to:
∂v x ∂v y ∂ v z


=0
∂x ∂y ∂z
which can be also written using the nabla operator
∇= i
∂  ∂  ∂
j
k
as:
∂x
∂y
∂z
∇⋅v =0
or
div v =0
because div (the divergence) is the name for a dot product between nabla operator and a vector.
11.6. Bernoulli equation
Consider now a flow that changes the altitude, pressure and velocity:
The action of pressure p1 on the left hand side overcomes the pressure p2 on the right hand side
resulting in a net pressure difference that creates a force which drives the motion of the fluid. The
fluid also changes the energetical state: after the fluid moves by the filled part from the left, a filled
part on the right occurs. Instead of a fluid at velocity v1 and height h1 we obtain the same mass of
the fluid in the streamline at velocity v2 on height h2. In such case, the work done by the forces on
both ends of the streamline serves to allow the change of the energetical state of the streamline (the
work-energy theorem), i.e.:
 mv 22  m v 21
−
 m g h 2− m g h1
2
2
 x 2 A2  v 22  x 1 A1  v 21
p 1 A1  x 1 − p2 A 2  x 2=
−
 x 2 A2  g h 2− x 1 A1  g h 1
2
2
V  v 22  V  v12
p1 V − p2  V =
−
V  g h2− V  g h1
2
2
p1 A1  x1− p 2 A2  x 2 =
Dividing by  V and rearranging the terms results in the Bernoulli equation for the
incompressible, nonviscous fluid:
p 1
 v 21
 v2
 g h1 = p 2 2  g h 2=const
2
2
11.7. Viscosity
Consider two surfaces lubricated with a syrpup between them:
It seems reasonable that the larger the upper body, the more difficult it is to maintain its movement.
Similarly, the higher velocity, the more difficult to move. All these effects are due to the internal
friction of the syrup (the fluid in general). The internal friction relates to breaking of the cohesive
bonds of the medium, i.e. the bonds that keep the molecules of the medium together. Newton
proposed a following formula to describe the viscous drag:
F = A
v
l
where F is the force of the viscous drag, A is the surface of the sliding body, v is its velocity and l is
the thickness of the fluid layer.  is a proportionality coefficient, called viscosity.
Why does the Newtonian viscous drag depend upon the surface? Well, the larger the surface, the
more cohesive bonds we need to break. Why does it depend upon the velocity? To explain this we
need to understand the velocity profile of the fluid under the moving surface:
The velocity of a fluid adjacent to the moving surface has the velocity of the moving surface. The
velocity of a fluid layer adjacent to the resting surface has zero velocity. In between we have
intermediate velocities. As the body moves, the adjacent layers of fluid slide over each other. The
higher the velocity of the upper plate, the faster the layers move with respect to each other and the
more cohesive bonds are broken per unit time, and the larger is the momentum loss of the fast layer
( F=
dp
). The force is thus proportional to the velocity.
dt
If the thickness of the fluid increases, the velocity changes decrease and the fluid layer adjacent to
the upper body receives less breaking of the cohesive bonds, and a smaller momentum loss. The
force is thus inversely proportional to the fluid thickness.
11.8 The Poiseuille law
Knowing the Newton's formula for the viscosity, we can derive a formula that describes a flow of a
viscous fluid in a pipe. Consider the following figure:
The situation is not exactly newtonian, because the subsequent layers of the fluid grow with radius.
But if we consider only a small thickness of the fluid between R and R+dR, the change in the layer
area is not significant, and we can use the differential formulation:
F = A
dv
= Av '
dl
Then we can build an equation:
dF =F  R− F  RdR
dF = A R v '  R− A RdR v '  RdR
dF =− d [ A Rv '  R]
Because the force is driven by the pressure (which is constant at the whole pipe cross-section), we
can write:
 p 2 R dR=− d [ A R v '  R]
Integration of the both sides gives:
 p  R2=− A R v '  R
The integration constant is zero, because the velocity has a maximum for R=0 (i.e. v'(0)=0).
Because
A R=2  R L , we can write
 p  R2=− 2  R L v '  R
− p R
v '  R=
2L 
− p R 2
v  R=
C
4L 
2
2
 p l −R 
v  R=
4L 
Where the integration constant was taken such to guarantee v(l)=0. The flow in a pipe is expressed
as the volume passing it per unit time. It can be calculated by:
l
Q=∫0 v  R2  R dR
 p l 2
Q=
l R−R 3 dR
∫
0
2 L
 p l 4 l 4 l 4  p
Q=
− =
2 L 2 4
8L
[
]
which is the Poiseuille equation.
11.9. The Navier Stokes equation for incompressible fluid
If we consider a fluid element in motion, we can try to describe the motion in presence of a pressure
gradient. We start with the second law of dynamics:
m
dv
=F  x− F  x x =− F
dt
dv
V
=− F
dt
dv
F
p
dp
 =−
=−
=−
dt
x A
x
dx
because we can say—in one dimensional case—that the volume subject to movement V =A  x ,
where A is the cross-sectional area of the moving fluid. A three dimensional generalization of the
spatial derivative is the gradient:

dv
=−grad p
dt
The derivative on the left hand side depends upon the change of the velocity with time, but also it
varies with position, and since the fluid is in movement, the different velocity from different
position will enter the current position after moving with velocity v. This gives in 1D:
dv ∂v
∂v
= v
dt ∂t
∂x
.
and in 3D, using the divergence:
d
v ∂
v
= v ∇⋅v
dt ∂t
Expanding the time derivative in the second law of Newton, we obtain the Euler equation3:
∂ v
1
v ∇⋅
v =− grad p

∂t
In case, when we would also like to consider other forces than the pressure gradient, we must put
them on the right hand side of the second law of dynamics. A particularly interesting force is the
viscosity, defined by the Newton's formula. Let us try to calculate the  F per unit volume acting
on the fluid layer in the direction x if the velocity gradient is in direction y:
 F = A[v '  y y−v '  y]
v '  y y −v '  y
F
=
V
y
F
= v ' '  y 
V
i.e. it is the second derivative of the velocity times the viscosity. The same can be said for the
velocity profile in the z direction. A bit more complicated is to investigate the effect of a velocity
change in the direction of flow, x. I will try to motivate it with help of some hand waving. Take a
look on the following figure:
3 Ref: Landau, Lifszyc, Hydrodynamics, p. 16
Please note that due to continuity equation (or conservation of the fluid's volume), as the fluid slows
down, it must expand. If in a time interval  t the streamline propagates in the x direction to a
smaller distance, then having a fixed width of the streamline, it must expand in the y direction by
the same amount, i.e. the change in the velocity in y direction equals to the change of the velocity in
the x dierection:
dv y −dv x
. Parametrizing time t=v x dx , we can modify this relation to
=
dt
dt
dv y −dv x
which drives the viscosity on the plane perpendicular to the motion,
=
dx
dx
F = A
dv y
.
dx
The gradient is measured „to the left”, so against the direction of the x axis, and therefore we can
get rid of the minus in the relation between
dv y −dv x
.
=
dx
dx
Generalizing this to three dimensions requires the use of a laplacian
2
∇=
∂2
∂2
∂2


2
2
2 :
∂ x ∂ y ∂z

F
2
= ∇ v
V
which, after adding another term to the Euler's equation gives the Navier-Stokes equation for an
incompressible fluid4:
∂ v
1
2

v ∇⋅
v = [ grad p ∇ v ]

∂t
11.10. Stokes law
4 Ref: Landau, Lifszyc, Hydrodynamics, p. 67 (Eq. 15.7)
The Stokes law is one of the most frequently used formulas of the fluid dynamics. However, its
derivation is very difficult. In this section I will sketch the steps of the derivation without going into
the mathematical details, which are too difficult for an introductory course in physics.
In case when the movement is laminar, for low Reynolds numbers, the velocity profile along the
flow is stationary and the left hand side of the Navier Stokes equation disappears:
 ∇ 2 v − grad p=0
Taking a curl of this equation results in a condition for the velocity:
∇ 2 curl v =0
(the curl of a gradient disappears, and laplacian can be interchanged in order with curl due to the
linearity). This must be supplemented by a continuity equation, which in the 3D form reads:
div v =0
The above two equations were solved by Stokes to give a velocity profile around a sphere of radius
r falling in a viscous incompressible fluid (actually, he considered a dual problem of a sphere which
remains at rest in a fluid that moves at velocity v), which reads:


v r =v cos  1−
3
3R R

2r 2r 3
v  =−v sin  1−


3R R 3
−
4r 4r 3
where R is the distance from the sphere's center, and  is the angle with respect to the direction
of falling. The derivation is extremly difficult, but if you want to check the correctness of these
formulas, just check the form of the vector operators for spherical coordinates and see how are the
two initial equations satisfied and put R=r, and
R=∞ to check the boundary conditions of the
flow around the sphere.
Plugging this equation into the Navier-Stokes equation gives the pressure distribution, which
contributes to the drag force acting on the sphere. Another contribution comes from the Newton's
viscosity of the flow near the surface, e.g.:
∂v r
∂r
∂ v
F 2= A
∂r
F 1=A 
Taking into account all the forces, and integrating them along the surface gives finally the Stokes
law:
 =6  r v
F
11.11. A general expression for the drag at higher velocities
If the velocities are sufficiently large (which happens for almost any velocity in a less dense
environment, like in aerodynamics), the linear relationship between the drag force and velocity
changes to a quadratic dependency:
1
2
F = C d  Av
2
Where CD is the drag coefficient (for example for cars in the air it is equal to 0.2-0.3),  is the
density of the fluid, A is the projection area of the object that moves in the fluid in the direction of
the velocity, v is the velocity of the object.
To understand this force, consider a deflection of air on the surface of the car. It follows a circular
path, so a centripetal force must have been supplied ( F =
mv 2
). This gives an idea of the origin
R
of quadratic dependency. Another observation is that the Bernoulli equation also gives the pressure
difference in terms of the kinetic energy difference of the fluid layers. This is also quadratic in
character.
The drag force is one of the limitation on the speed of the cars which have a certain amount of the
horsepower. Consider a car with a power of p=100hp, density of the air equal 1.27kg m-3, width
equal to 1.7m, height equal to 1.5m, aerodynamic coefficient CD=0.3.
p=Fv
1
p= C d  A v 3
2
2p
2⋅74.57kW
v =3
=
=53.5 m/s=192.7km /h
C d  w h 1.27kg/ m3⋅1.7m⋅1.5m

Questions
1. Define the pressure and Pascal principle. Explain the operation of hydraulic lift.
2. Derive the Archimedes buoyant force acting on volume V.
3. Discuss the idea of laminar and turbulent flow. Derive an equation for the Reynolds number.
4. Derive the continuity equation in 1D or in 3D*.
5. Derive the Bernoulli equation from work-energy theorem.
6. Define the newtonian viscosity and comment on the contributions from all coefficients.
Write down the Stokes law and the formula for the drag at higher velocities.
7. Derive* or at least write down and comment on the Poiseuille equation.
8. Derive the Euler equation from the second law of dynamics. What term one should add to
the Euler equation to account for viscosity and obtain the Navier-Stokes equation?
9. *Comment on the origin of the viscous term in the Navier-Stokes equation.
Reference: Landau, Lifszyc, Hydrodynamics
12. Harmonic oscillations
12.1. Hooke's law
Hooke's law is used to describe the behaviour of elastic bodies, for example of a spring. It says that
the elasticity force is proportional to the deformation of the elastic body, and the direction of the
elastic force is opposite to the deformation:
F =−kx
where k is the elasticity coefficient.
12.2. Oscillations of a spring
Consider a spring put horizontally on a table, with one end attached to the wall, and with mass m
attached to its other end.
The equation of motion for the mass m is the following:
ma=−kx
m ẍ=−kx
ẍ=−20 x
k
0=
m

The last two equations define the, such called, harmonic oscillator differential equation. Why?
Just take a look on the time derivative of periodic sin and cos functions (A and B are constants):
 A sin 0 t ' ' = 0 Acos 0 t ' =−20 A sin 0 t
 B cos  0 t ' '=− 0 B sin  0 t' =− 20 B cos  0 t
These functions evidently satisfy the harmonic oscillator equation! They are the solution of this
equation. Thus, the motion of a body that hangs on a spring can be described by:
x= Asin  0 tB cos  0 t
12.3. Damped oscillations
Now let us consider the same spring as above but doing oscillations in a viscous medium. We know
that there will appear a viscous drag force
F =−6  r v . The minus sign implies that the
direction of the drag force is the opposite to the direction of the velocity. In general, to consider the
non-spherical objects, we can assume
F =−bv . The second law of dynamics reads then:
ma=−kx −bv
m ẍ=−k x−b ẋ
This is typically rearranged into:
ẍ2  ẋ 20 x=0

k
m
b
2 =
m
 0=
The equations of type
form
ẍa ẋbx=0 are typically solved by searching for the solution of the
x= A e rx . We will do so. Substituting such a function into differential equation yields:
r 2 e rx2  r e rx 02 e rx =0
r 22  r 20 =0
The last line gives a quadratic equation for r:
=4 2−4 02
r 1,2=−±  2− 20=−±i   20−2 =−±i  damped
In such case the formula for x reads:
x=e−t  Ae i 
damped
t
Be−i 
damped
t

Because of the Euler formula:
e i =cos −i sin 
we can express the inner of the bracket as a combination of trigonometric functions:
x=e−t  Asin  damped tB cos  damped t 
which limits the solution to the domain of real (non-complex, non-imaginary) functions and is the
typical expression to describe the damped harmonic motion.
12.4. Forced oscillations
If we supply the dynamics equation of the damped harmonic motion with external forcing driven at
frequency  , (e.g. F =F 0 sin  t we can rewrite the law of dynamics as:
m ẍ=−kx−b ẋ F 0 cos  t
or:
ẍ=− 20 x−2  ẋ
F0
cos  t
m
The resulting oscillations run in the rythm of  so we only need to find the amplitude of the
function
x= A cos t :
−A2 cos t=− 20 A cos t2  Asin  t
A 20−2 cos t−2  A sin  t =
F0
cos t
m
F0
cos  t
m
Now, for t=0, it turns out that
A 20−2 cos −2  A sin =
F0
m
The above equation can be related in geometrical terms to a following straight triangle:
where from te Pythagorean theorem we have:
2
2
0
2 2
2
2
2
A [ − ] 4A   =
F 20
m2
F0
m
A=
2
[0 −2 ]4 2 2
12.5. The mathematical pendulum
Consider a following pendulum. It is composed of mass m, which hangs on a weightless string of
length L. The component of force that drives the rotation is mg sin  . This results in a torque that
acts on the arm length L. The equation for motion reads thus:
I =−mg sin  L
d2
m L2 2 =−mg sin  L
dt
d2
g
=− sin 
2
L
dt
2
d 
g
≈− 
2
L
dt
The last of these equations is the harmonic oscillator equation with frequency of oscillations equal
to  0=

g
.
L
12.6. An oscillating disc suspended on three strings
Another example of a pendulum, useful for the determination of the moment of inertia of a body is
a disc plate suspended on three strings. The second law of dynamics for the rotational motion reads:
I =R mg sin 
I ≈ R mg 
d 2  Rmg
=

I
dt 2
It is possible to relate the angle  to  because:
L = R
R
= 
L
so:
d2
m g
=R 2

2
I L
dt
so:
 0=R

m g
I L
12.6. The wave equation
To derive the wave equation we will consider a simple case of tranverse waves (i.e. oscillating at
right angle to the propagation direction, as opposed to longitudinal waves which oscillate in the
direction of propagation) generated on a rubber garden hose. Take look on the following figure:
In the vertical directions we can write the second law of dynamics as:
m
 A dx
∂2 y
= F y  x dx −F y  x
∂ t2
∂2 y
= Asin [ xdx]− Asin [ x ]
∂ t2
(  is the tension within the hose). If the angles are small, the sine function can be approximated
by a tangent, while tangent is simply the derivative tg =
∂y
:
∂x
∂2 y
∂ y  xdx
∂ yx
 dx 2 =
−
∂x
∂x
∂t
∂ y  xdx  ∂ y  x 
−
∂2 y 
∂x
∂x
=
2

dx
∂t
2
∂ y  ∂2 y
=
∂ t 2  ∂ x2
which is the mechanical wave equation, with speed of propagation v =
∂x

=
, so the

∂t

general form of a wave equation reads:
2
∂2 y
2∂ y
=v
2
2
∂t
∂x
Questions
1. Derive the harmonic oscillations of a spring from the Hooke's law. (formulate and show the
solution of the differential equation).
2. Solve the damped oscillator equation.
3. Derive* or write down and comment the formula for the amplitude of forced oscillations.
Do you know about applications of this phenomenon to the radio?
4. Derive the harmonic oscillator equation for a mathematical pendulum.
5. Derive the harmonic oscillator equation for a disc suspended on three strings.
6. Derive the wave equation for a vibrating string.
13. Diffusion
13.1. Fick's first law
Consider two compartments of particles, divided by a permeable wall, as in the figure below:
In these two compartments, there is a different amount of particles. Half of particles in each
compartment should move to the right, while the other half moves to the left. Taking a time step
which is sufficiently large for the particles to pass the width of the compartment, we can say, that
the flow through the barrier is proportional to:
N  x N  x  x 
N  x x
N  x
−
−
2
N
2
2
− x  S  x
 S  x − x 2 c  x x −c  x 
J=
=
=
=
t S
t  S
2t
x
2t
x
Taking a limit of this expression results in the First Fick's Law for one dimensional case:
J =−D
where,
D=lim t 0
dc
dx
 x2
.
2t
13.2. Second Fick's law
Consider a single cell of width  x which is subject to a flux entry from both sides with surface
area equal to  S :
The increase in the number of particles in this compartment depends upon the net flux entering
inside:
N
= S [ J  x −J  x x ]
t
N
=− S [ J  x x− J  x  ]
t
N
=−[ J  x x −J  x  ]
t  S
N
J  x x− J  x
=−
t S  x
x
∂ c −∂ J
=
∂t
∂x
∂c
∂2 c
=D 2
∂t
∂x
[
]
The last equation is called the Second Fick's Law and it describes the propagation of the
concentration distribution subject to diffusional driving force.
Reference: Feynmann's lectures on physics.
13. Scaling of the variance in a diffusion process
statistics, expected value (on an axample series where its easy to see repeating symbols and
introduce probability) variance, scaling ov variance in diffusion
In statistics, we can characterize a random sample by its mean, and variance (or standard deviation).
The mean value above is denoted by x , and can be calculated by means of simple arithmetical
mean:
x =
1
N
N
∑i −1 x i
But the mean value does not completely characterize the series of data. It is also worthty to know
how much are the samples departing from the mean value (the average length of the blue line).
Because the blue lines can have a length that could be either positive or negative (summing plus and
minus in an average gives zero!), we rather calculate an average of squares of such distances, i.e.
the mean square distance to the mean:
2
 =
N
1
2
 xi −x 
∑
i=1
N
Where actually after detailed analysis it turns out that we should write this as:
2 =
N
1
 x i− x 2
∑
i=1
N −1
because we don't have an exact value of the mean x but only its approximation (so the
uncertainty of x increases  2 ).
To proceed, we need a probabilistic version of the equations for averages. We can derive the
suitable expressions most easily after developing some intuition on an example. Consider a data set:
{1,2,4,3,1,2,3,5,3,2,3,2,1,4,5,2,1,2,3,4}
We can notice only five symbols
si , that repeat multiple times, e.g. N(1)=4, N(2)=5, N(3)=5,
N(4)=3, N(5)=2. The total number of samples equals 19, so we can calculate an average of any
possible function of the sample value:
f  x = 1
N
5
N
∑i=1 f  x i 
f  x =∑ f  s i N
i=1
 si 
N
5
f  x ≈ ∑ f  s i P  si 
i=1
The last formula can be generalized to a continuus distribution of sample values
si . In such case,
because a probability to obtain an exact value is extremely small, we make use of probability
density function p(s) such, that P  s∈ s0, s 0ds= p  s ds . Under such assumptions, we change
summation to integration:
∞
f  x =∑ f  s i  p  s i  si  f  x=∫ f  x  p  x  dx
i
−∞
Notice, that it is not necessery to distinguish symbols with a separate letter s—the integration goes
over the whole domain of the measured data.
Let us now come back to the diffusion equation:
2
∂p
∂ p
=D
2
∂t
∂x
Let us multiply it by x2 and integrate over the whole space:
2
∞
∂ ∞
2
2 ∂ p
∂
x
p
dx
=D
x
dx
∫
∫
−∞
∂t −∞
∂ x2
Now, on the left hand side, we have a time derivative of variance, while the right hand side needs
to be integrated twice by parts to obtain meaningful results. Durring the integration by parts keep in
mind that the probability to find anything at plus/minus infinity is nonexistent, e.g. the term
∞
uv∣−∞ vanishes:
∂ 2
∂p
= Dx 2
∂t
∂x
∞
∣
∞
−∞
∞
−2Dxp∣−∞ 2D ∫−∞ p dx
∂ 2
=2D
∂t
Integrating the last equation we obtain:
 2 =2Dt
Which means that during diffusion, the particles move randomly in such way, that they occuppy
more and more space, and the squared deviation from the initial position grows linearly in time. (the
standard deviation, which is a square root of variance, grows like a square root of time,
=  2Dt ).
13.4 Diffusion as a result of a random walk process
Consider a process of randomly wandering particles which jump at a probability p to the right on
the x-axis and at a probability q to the left. If we disallow trapping of particles at fixed position, the
situation looks like in the following picture:
In such case we can say that all particles that were found in position x after a time step escape to
adjacent positions (the condition that p+q=1), while for the particle probability density at x we can
write a following master equation:
p  x ,t t = p p  x− x ,t q p x x ,t 
This can be expanded into the Taylor series. The left hand side to the accuracy of the first term of
expansion, the right hand side to the second term of expansions, since the first term vanishes for
p=q:
p  x ,t 
∂p
∂p
∂2 p
∂p
∂2 p
2
 t= p p  x , t − p
 x p

x
q
p
x
,
tq

xq
 x2
∂t
∂x
∂x
2 ∂ x2
2 ∂ x2
Because p+q=1, we can simplify the above to:
∂p
∂ p  x ∂2 p  x 2
=− p−q

∂t
∂ x  t ∂ x 2 2 t
or to:
∂p
∂p
∂2 p
=−v d
D
∂t
∂x
∂ x2
x
v d =lim  t 0  p−q
,
t
 x2
D=lim  t 0
2 t
Refernces:
E. Zauderer, Partial differential equations of applied mathematics. Wiley, 2006.
J. Klafter, R. Metzler: A random walk's guide to anomalous diffusion.Phys. Rep. 339, 2000.
13.5. Diffusion as a consequence of a stochastic integral of the Langevin equation
The diffusion results from a microscopic brownian motion of the diffusing particles. A brownian
motion describes an irregular, apparently random trajectory which results from the thermal
collisions with vibrating particles of the environment. The changes in the trajectory can be thought
of as random, and therefore they are described by a Langevin equation, which is the second law of
dynamics in presence of a gaussian random force  :
m ẍ=− ẋF  x  t
in the limit of large damping, the inertial term m ẍ becomes small compared to the other terms of
the equation, and thus we write:
1

ẋ= F  x  t


This can be rewritten as:
1

dx= F dt t dt


In the stochastic calculus we introduce a process that is an integrated gaussian process,
t
t=∫0  s ds which is called the Wiener process. If t tends to 0, then d =t dt . This
allows to write:
1

dx= F dt d 


The Wiener process is delta-correlated, i.e.:
d  i t d  j t=ij dt
which means that the correlation is nonzero only for the correlation between the process and itself.
Now, we can consider a possible function f(x). We can expand it into a Taylor series to say:
df  x = f '  x dx
1
2
f ' '  x  dx
2
We can substitute dx by the Langevin equation:
df  x =
f ' x
[ F dt d  ]  1 2 f ' '  x  [ F 2 dt 22F  d dt 2 d 2 ]

2
Due to the correlation, d  2=dt , so leaving aside in the above equations the terms that are
proportional to greater powers of infinitesimals (e.g. Because dt 2 ≪dt , d  dt≪dt ), we can
write:
df  x =
2
f 'x
[ Fdt  d  ]   2 f ' '  x dt

2
Taking the average of this equation it turns out that the average of Wiener increments is zero, so:
∂〈 f  x 〉
f ' xF
2
=〈
〉〈
f ' '  x〉

∂t
2
Because the average of f is calculated with respect to the probability distribution p(x) as
∞
〈 f 〉=∫−∞ f  x  p x  dx , we can state (note, on the left hand side f(x) does not depend upon t):
 x ,t 
F
dx=∫−∞
∫−∞ f  x ∂ p∂t

∞
∞
p  x ,t 
2
2
∞  p  x ,t  ∂ f
∂f
dx∫−∞
dx
∂x
2 2
∂x2
Applying integration by parts, noticing that the probability density vanishes at plus minus infinity,
we have:
∞
∫−∞
f  x
∞
∞
∂ p  x ,t 
f  x  ∂ F p x , t
f  x  2 ∂2 p x , t
dx=−∫−∞
dx∫−∞
dx

∂t
∂x
2 2
∂ x2
Since this equation is valid for any possible f(x) (we have stated nothing about its identity), the
following condition must be satisfied:
∂ p x , t
1 ∂ F  x  p x , t  2 ∂ 2 p  x ,t 
dx=−
 2

∂t
∂x
2
∂ x2
Which is the drift-diffusion equation (the Smoluchowski equation). An extremly useful equation in
the field of statistical physics. The diffusion coefficient relates to the damping constant  and to
the noise strength  by
2
D= 2 .
2
Questions
1. Derive the first Fick's law
2. Derive the second Fick's law
3. Derive the formula for the variance scaling with time in a diffusion process.
4. *Derive the diffusion equation from the random walk.
5. *Derive the diffusion equation from the Langevin equation.
Reference: Lectures in Theoretical Biophysics . K. Schulten and I. Kosztin, Department of Physics
and Beckman Institute University of Illinois
P. Borys, PhD thesis.
14. Optics
14.1. Huygens principle
The front of a light beam in terms of the Huygens principle can be viewed as a collection of
spherical waves:
The elementary waves form a flat wave front that propagates in the direction of a beam. This
principle can be used to explain the law of reflection and of refraction.
14.2. The law of reflection
The law of reflection can be derived by considering the following picture of huygens waves that fall
on a plane:
In this figure5 I have presented the adjacent beams separated by a wavelength λ.If we consider the
advances of the wavefront at intervals when it moves by the value of λ, then after reflection, a
spherical wave is formed that spreads in multiple directions as a circle which radius grows by λ
each time step, but the points that belonged to a single wavefront before reflection converge to a
new wavefront after reflection (a line that is tangent to their „Huygens circles”). This newly created
wavefront is at the same angle with respect to the surface's normal, as the original wavefront.
14.3. The law of refraction
To derive the law of refraction we shall again consider a wavefront that falls on a surface, but this
time the surface is permeable to the light. However, the light velocity inside of the material is
different (for example slower, v=nc, where n is the refraction coefficient) that outside which results
in a difference in the wavelengths of the waves:
5 Note! To make such drawing on the blackboard (in the notebook)—draw the wavefront lines first (incident and
reflected), since otherwise it is impossible to draw circles with correct, repeating radii!
It is immediatelly seen6 that to maintain a consistent wavefront (e.g. to assure that the wavefront
elements before refraction rejoin after refraction), the light beam must bend. Consider the triangle
with one edge denoted by δ. We can write:
1
=sin  1

2
=sin  2

1 sin  1
=
 2 sin  2
v 1  t sin  1
=
v 2  t sin  2
v 1 sin 1
=
v 2 sin 2
c /n1 sin  1
=
c /n2 sin  2
n 2 sin 1
=
n1 sin 2
The last equality defines the refraction law, first established experimentally by Willebrod Snell.
14.4. The laws of relection and refraction by the Fermat principle
6 Note! To make such drawing on the blackboard (in the notebook)—draw the wavefront lines first (incident and
reflected), since otherwise it is impossible to draw circles with correct, repeating radii!
The laws of refraction and reflection can also be derived by the Fermat principle which states that
the light follows a trajectory between points A and B which consumes an extremal (e.g. Minimum)
amount of time. Consider the following images of reflection and of refraction:
To find a path that minimizes the time in case of reflection, we must find a point x, that minimizes:
t=v l 1l 2=
c

x 2 a 2 d −x 2 b2 

n
Differentiating this with respect to x and comparing to zero, we obtain a condition:
0=
2 d − x
2x
−
2
2
2  x a 2  d − x2b2
x
d−x
=
2
2
 x a  d − x2b2
sin 1=sin 2
Doing the same for the refractive case, keeping in mind that the velocity differs, we minimize a
condition:
t=
l1 l2 1
 =  n  x 2 a 2n2   d −x 2b 2 
v1 v2 c 1
which gives similar results but the sines remain multiplied by the refraction index (which does not
cancel on both sides):
n1 x
 x 2a
=
2
n2  d −x 
 d −x 2b 2
n 1 sin  1=n 2 sin 2
14.5. Diffraction on a diffraction grating
When a beam of light enters a pinhole surrounded by an impermeable material, then the light
exhibits behaviour that is known as diffraction, or bending of light on a scattering object:
Applying the Huygens principle to the pinholes we can observe some patterns of wave
amplification. There are directions where maxima of the wave from one slit meet the maxima of the
wave fron the other slit. On the other hand it is also possible that maxima of the wave produced on
one slit meet the minima of the wave produced on the other slit, and the signal disappears. This is
the diffraction effect, caused by interference of waves.
We will derive it for a double slit system, but these results are applicabile to any periodic diffraction
grating, since if you'd have more slits spaced by the same distance from eachother, then the phase
difference between subsequent rays emiting from these slits (e.g. between slit 3 and 2, or slit 4 and
3) would be the same as between slit 1 and 2 in the considered case.
To obtain a destructive interference (where minimum meets maximum), we must have:

d sin =  2n1
2
To have a constructive interference (a maximum meets a maximum), we must have:
d sin =n 
Which gives the condition for the subsequent diffraction fringles. It can be easily seen that the
angles become small when the ratio

is small. So, when λ <<d, then we essentially do not
d
observe interference as a result of diffraction. In such circumstances we can handle the light beam
as a geometrical object—a straight line that propagates in space—and this is the regime of the such
called geometrical optics.
14.6. Derivation of the diffraction pattern on a single slit
It is possible to consider a superposition of all elementary waves that form within the slit, not only
those from the edges. To do this, we must consider a formula for the phase shift for any point
located at position y from the upper edge. If y=d, we know that the wave delayed by
L=d sin 
Which can be converted to an angular shift by:
d =L
2 2d
=
sin 


in such case, if the elementary waves can be described by:
E=
E0
cos [ kr −t  y ]
d
(we take the amplitude per unit length, which is suitable for integration). Integrating this over the
whole slit, we obtain
E0
2 sin 
cos kr− t
y dy

d
d /2
E0
2 sin 
E=
sin kr −t
y

2  d sin 
−d / 2
d /2
E =∫−d / 2
[
[ 
]
]
To evaluate the definite integration, we recall a formula for difference of sines:
sin −sin =2 sin
[ ] [ ]
−

cos
2
2
which resuluts in Fraunhofer diffraction pattern:
E=
E0
 d sin 
sin
cos [ kr −t ]
 d sin 

[
]
Which (ignoring the constant prefactor) takes a following dependency on 

=0.1 :
d
This pattern is crossing zero when the sine contains a multiplicity of  , so
the maxima, we substitute =d sin  :
E=
E 0 cos [ kr− t ]

sin
[ ]



And calculate the derivative with respect to  :
dE E 0 cos [ kr −t ]
=
d

cos
[  ]  −sin [  ]
2
The numerator equals zero when:
[ ]


=tan


Since tangent is a periodic function, and grows very fast to infinity
d sin 
=n To find

The solutions
x=tan x , except for x=0, occur approximately at the end of period of the function,
i.e. for
x=2n1

, n=1,2 , ...
2
so:


=2n1 n

2

=2n1
2

d sin =2n1
2
and d sin =0 .
14.7. Diffraction on an obstacle
On the laboratory practice we try to calculate the hair width using diffraction methods. The situation
is quite similar to the diffraction on a single slit, it is an inverse of that. In general, the wave front
carries an intensity
E 0 that spreads over the beam width w. Then, after the diffraction takes place,
the Huygens waves that would normally pass the center, where an obstacle (e.g. the hair) is located
is blocked. Therefore, the diffraction pattern, calculated in 14.6. (for a radiation amplitude per unit
length
E 0 /w ) should be subtracted from the intensity occuring from the whole beam of width w
(the pattern in 14.6. with d=w):
E=
{ [
E 0 cos [ kr− t ]
 w sin 
 d sin 
sin
−sin
 w sin 


] [
]}
The resulting pattern looks like the following:
You should notice that the maxima and minima coincide with maxima and minima for the
Fraunhofer pattern for diffraction on a single slit, but they are extremely small in amplitude, much
darker than the main fringle for =0 . Therefore a measurement of such diffraction should be
carried out in rather dark conditions.
14.6. Thin lens equation.
Consider a spherical surface that accepts a light beam7:
7 Be very careful with this drawing on the lecture/on the exam! If it is inaccurate, you may identify the angles
incorrectly.
Here we can write Snell's law for the angled beam as:
n 1 sin 1=n 2 sin  2
To express the angles we can draw a supporting picture:
since  1= and  2=− , for small angles, where sin   , we have:
n1 =n2 −
n 1 n2 = n2−n1 
expressing the angles as ratios of arc length s to the corresponding radii, we have:
s
s
s
n 1 n2 =n 2−n 1
p
o
r
n 1 n 2 n2 −n1
 =
p o
r
Now take a look at a thick lens:
We can write two equations:
1 n n−1
 =
p o
r1
n
1 1−n
 =
p' o'
r2
We can rewrite this:
1 n n−1
 =
p o
r1
n
1 1−n
 =
L−o o '
r2
going with L to zero and adding these eqations we obtain a thin lens equation:
1 1
1 1
 =n−1 − 
p o'
r1 r2
Which can be also written in more compact form as
1 1 1
 =
p o f
where p is the position of the source object with respect to the lens, o is the position of the image
formed by the lens, and f is the focal length of a lens. This equation allows to determine some rules
for constructions in geometrical optics. Namely
●
The ray that enters the lens in parallel to the observation axis, (i.e. it crosses the axis at
infinity, so
●
p=∞ ) exits in such way that o=f, which follows from the lens equation.
The ray that exits the lens in parallel to the observation axis enters the lens at p=f (by the
same reasoning)
●
The ray that is crossing the center of a convex lens exits the lens unchanged due to
symmetrical refraction on entrance and on exit. This is summarized in the figure below:
14.7. Optical instruments
A simple magnifying glass is constructed of only a single lens and forms a diverging set of rays
which resemble a magnified virtual object at position which resembles the distance of a good sight
(larger than 25cm).
The magnification of the magnifying glass can be referred from the lens equation for case when the
eye is put close to the lens so, that the whole „good sight length” is on the virtual side of the lens.
Then:
∣
1
1 1
 = ⋅25cm
−25cm p f
25cm 25cm
−1
=
p
f
25cm
−1m=
f
25cm
m=1
f
A microscope is formed from two lenses (an objective and an ocular (or eyepiece)), which work in
the following way:
A real object sents light through the objective lens and forms a real image. Then, an occular lens
takes this image on the principles of a magnifying glass to form a virtual image that is observed by
the eye. The magnification of the microscope follows from the magnification of the objective lens,
which results from the triangles in bold, i.e. mob=
magnifying glass (the occular), which is moc =1
m=mob moc=
s
and from the magnification of the
f1
25cm
, so in gneral we have
f2

s
25cm
1
f1
f2

A telescope is constructed in similar way to a microscope but it is designed to magnify objects that
are located infinitely far (or simply: very far) from the observer. The magnification relates to a
change in angles of the incoming rays:
(for sake of explanation of the „magical” ray bending in the focus between lenses i have introduced
the path of real rays in red and magenta, as seen by the geometrical optics standard constructions—
i.e. such bending is formed of two different rays! Red line changes to magenta). Taking a look at the
blue triangles, we can conclude that
h
f1
h
2≈tan  2=
f2

f
m= 2 ≈ 1
1 f 2
1≈tan  1=
It should be also noticed that the image becomes inverted: the rays that originally were directed
upwards, are now directed downwards.
How large can be the magnification of a telescope? It could be constructed as large as possible, but
in astronomy a limit is set by the diameter of the eyepupil (7mm). Astronomical objects are often
dark (e.g. the nebulae, distant galaxies) so it is favourable to increase the amount of light entering
the eye, and certainly not to decrease it. This can be accompanied when the radius of the cone
entering the eyepupil is not larger than 7mm:
h2 h1
=
f2 f1
f 1 h 2 7mm
m= = =
f 2 h1
D
where D=2h1 is the diameter of the telescope's objective.
14.8. Polarization
The electromagnetic wave is a transverse wave with perpendicular directions for the oscillations of
the magnetic field vector and electric field vector (we will learn about it when digesting the
Maxwell equations). The electromagnetic wave may look like the following:
In this example the E field oscillates in a single plane. We say that such wave has a flat polarization.
Typically a light beam is composed of many different modes of oscillations, so it is unpolarized. It
may become flat polarized after passing it through a device that is called a polarizer. Such device
allows to pass only waves with fixed polarization and damps out other components.
Except for the flat polariation it is possible to have circular polarization. Imagine two electrical
fields that are shifted by 90o with respect to each other:
It can be easily seen that the resulting vector (vectorial sum of these waves) circulates around the
axis. This is circular polarization.
14.10. Birefringence
The polarization matters a lot for materials which are birefringent.In such materials there is
typically an uniaxial symmetry, i.e. there is one axis which is symmetrical to rotations to certain
degree. The directions perpendicular to the optical axis are isotropic. If there is an increased order in
particular direction, it is possible to have a different (e.g. smaller) electron density, and
consequently the light wave does not interact with matter to the same degree (acceleration of
charges) and a different (e.g. larger) value of the speed of light is possible (maximum value for the
vacuum) in the direction perpendicular to the symmetry axis (perpendicular, because light is a
transverse wave with oscillations perpendicular to the direction of motion).
The ordinary ray oscillates in the direction perpendicular to the optiacl axis and has a velocity that
does not depend upon direction. The extraordinary ray vibrates in the direction that contains
components paralell to the optical axis and these components move at larger velocity than the rest:
As a result, the extraordinary ray is bent inside of the crystal. The birefringent materials are of great
user for the construction of advanced microscopes, the differential image contrast microscopes,
which use birefringent crystals to split the beam into two narrowly spaced beams that pass through
the sample and are then recombined to a single beam to cause a destructive interference if the
beams are identical, and constructive interference when there is a difference in beams (e.g. when an
edge of two material phases occurs in the sample).
Questions
1. Derive the refraction and reflection law using Huygens principle.
2. Derive the refraction and reflection law using Fermat principle.
3. Derive the conditions for maxima and minima in case of a driffraction on a diffraction
grating.
4. Derive the equation for the intensity distribution in Fraunhofer (single slit) diffraction.
5. Derive the thin lens equation.
6. Explain the operation of the magnifying glass, of the telescope and of the microscope.
7. What is the circular polarization?
8. Explain the origin of the extraordinary ray in birefringent materials.