As I have another Blog in Spanish where I mix Mathematics with law,
literature and no matter what, I am going to be more restrained here and speak
only about Math and Physics.
Since I wrote latest posts, I have learnt much
more, so I think that I can aim at posting only things that should be orthodox,
even if I cannot help being somehow “creative”.
Today’s post is about a visual
way for understanding a differentiation rule, the “power rule”.
Let us choose a well-known function,
like L = f(t), which takes as input the time elapsed and generates as output the length
traversed by a body within that time lapse.
This function, like any function, can be viewed as the product between
the input and the average
change rate. This is self-evident, since L is
obviously v * t
= (L/t) * t = L.
In the simplest manifestation of the function (say for example L = 2t), v is a constant,
so it will be the same no matter the interval.
If the body has traversed 2 meters, it will
have done it in 1 second, so the ratio 2/1 is 2. If
displacement is 1 meter, time required will
be 0.5 seconds, the ratio being again 2. And what
if we keep reducing the interval considered
until it is 0, what we call an “instant” without
duration? Well, division by 0 is not permitted: it leads to an “undefined”,
i.e. an expression that breaches the rules about how to build mathematical expressions…
This shouldn’t come as a surprise: if you define velocity as a rate of change
over time, how come that you now want to calculate it without allowing for the
lapse of any time at all? However, here we can perfectly stipulate that this
playful instantaneous rate is the same as in
all other cases, since it is in fact constant: it is 2
as well.
In a more complicated form, the function is L
= t2. In our example, this will happen when the body is subject to
acceleration. Let
me remind you that the kinematic formula for obtaining the displacement of a
body that starts with velocity “vo” and suffers acceleration “a” is L = vo + at2/2. If we
stipulate that the body starts at rest (vo =0) and is subject to a =
2 m/s2, then we get the above mentioned function, L = t2.
This can still be viewed as L = tt, where t is again the average
velocity, but not the instantaneous
one anymore, as velocity is continuously changing (there is acceleration). (It
may surprise you that I refer to t as a
velocity, but it is fine, because I mean its numerical value; if 3 seconds have elapsed and the body has displaced 3*3=9 meters, what I mean is that it has moved at an
average of v m/s during t seconds, it just happens that the numerical value of
v is the same as the number of seconds t having elapsed).
Because this instantaneous velocity is variable, it will be a function (the
derivative or L’),
where the argument can only be the one that we are in possession of (time), although arranged differently. For this sort of
functions, the so-called “power functions” [where the argument is raised to
progressively higher powers (t, t2, t3…)], the rule for
finding the derivative is the so-called “power rule”, which is L’ = n * tn-1, i.e. (i) reduce the
power by 1 and (ii) multiply by the power.
We can learn this rule by heart or try to understand the logic behind
it.
I asked about such logic here, but I am not very convinced with the answer,
which extracts the proof from another differentiation rule, the product rule.
My intention is however finding a logic that could precisely later illuminate such
other rule.
Another approach is doing the normal operation that you carry out to
find derivatives and then repeat it with a few power functions, until a pattern
emerges.
In particular, with the function L = t2,
you would reason as follows:
If you repeat this operation with L = t3,
the result is L’ = 3t2 and thus
the pattern represented by the power rule (L’ = n *
tn-1) clearly shines up.
I am not very convinced, however, about this idea that the interval “tends to zero in the limit”, because this
suggests that the value obtained for the derivative is an “approximation”. But
it is not such thing, it is an exact value. It is true that in the real world you
can only measure a velocity as a ratio between distance and time, but the
abstract idea of a body having a state of motion at a given “instant” (null
interval) is perfectly valid. Precisely the visual interpretation that I
propose is a description of things where you can dispense with the concept of a
ratio. In particular, the idea is viewing the function
(L = length traversed in our original
example) as the area (A) (or volume or
hyper-volume) of a geometric figure, which is stretched as you pull in the direction of the
input from one or several sides,
which side/s is/are thus providing a sort of path
or rail constraining the growth of the figure.
The
simplest manifestation is what we can call rectangular
growth, as shown in these pictures:
In both cases, the function is the same: it is A
= 3 x
(which is the equivalent of L = 3 t in the original
example). The characteristic of this
function is that there is a neat separation of roles: one dimension (either the
horizontal or the vertical one, the choice is arbitrary) plays the role of input or growing dimension that
pulls from the other side and is logically variable; you take this out and what is
left, the other dimension, is fixed
and one can view it as the side that you pull from
to stretch the figure, which is acting as a path or
rail constraining the growth thereof.
Again the function is the same for both pictures: it is A = x2/2 (which would be L = t2/2 in the original). Here both
dimensions are growing harmoniously at the same time. After taking out one
dimension, what is left is average growth rate,
which is x/2, a half-side. In turn, the derivative will be the side that
you pull from and is therefore determining how much the area grows, which is one side (x). The
choice of which side to pull from is arbitrary: in the first picture I chose to
pull from the vertical one; in the second one we pull from the horizontal side;
but in any case we pull from only one side. The novelty now is that, yes,
certainly, such path is not fixed, it is variable, but that is not a problem.
We said that the derivative would be the side from which we pull and that constitutes the path
for growth of the other dimension and
there is no reason for changing our mind just because such path is variable: it
is in fact continuously widening following the hypotenuse of the triangle and
that is why we say that the derivative, instead of the coefficient 3 or whatever
number, is a variable, xv as in
the left picture or xh as in the
right one (any of them is fine, since both grow harmoniously).
The average growth rate is one side (x), which is what is left after taking
out one dimension. What is the derivative? It
is two sides of the square, that is to say, 2x. The visual reason is that in order to obtain a
square-like growth, as suggested by the expression x2,
you must pull from two sides, because
(unlike what happened in the triangle case) only this way can you achieve to
cover the area that the expression x2
demands. We could expect this by realizing that the square area x2 is the sum of the areas of two
triangles (x2/2 + x2/2),
so the growth path should also be double (xh
+ xv = 2x). And, yes, certainly, these
paths are both variable, but that is not a problem: it just happens that
each side is acting as a rail for the growth
of the other one and growing
itself within the other’s channel. So,
in this case, we have two weird aspects: unlike in the rectangular case, the
path is variable and, unlike in the triangle case, there is a mixture of roles,
as both sides are reciprocally and simultaneously pulling
from and constraining each other. But that is not dramatic, we just address the situation mutatis mutandi: because the path
is variable, the derivative is x; because
there are two paths, it is 2x.
If we now face a cube-like growth,
where the function is V = x3, we
can infer that the average growth rate will
be one side, a face of the cube (x2)
and the derivative, based on the visual
criterion, should be 3 sides (3 faces of x2
each = 3x2). Why? Because that is what
you pull from to make the cube grow in its three dimensions and also grow along
the full volume embraced by those sides, as demanded by the expression of the
function. The sides will be double than
the dimensions, but you just need to pull from the dimensions (half of the
sides), what we could call a “half-skin”.
From here onwards, with 4D or higher-dimensional figures (so called “hyper-cubes”), you apply the same algebraical rules:
- since you must pull from the sides, to find
yourself at the level thereof, you go down one dimension (you reduce the power
by one) and
- since you must pull from as many sides as
dimensions has the figure, you multiply by the number of dimensions (which is
the power itself).
Conclusion: the objective was finding a way to describe the derivative
(instantaneous rate of change), at least for power functions, that is not
relying on a ratio and I think that we have found it in this idea that the
derivative is the side/s that you pull from in order to stretch a geometric
figure.
Some other aspects to be discussed to complete the study:
- What if the input of the function is not a
side of the figure but its radius?
- Can you see the function as a sum of
individual growths in each dimension and the derivative as the sum of the
velocities in each dimension?
- Link with the calculation of initial / final
velocity in kinematic formulas.
- How this approach can serve also to explain
product rule and other differentiation rules.
- The link of all this with Zeno paradoxes.
About the impossibility of instantaneous measurement of speed (or anything else), there has been right now an interesting thread in physicsforums. The conclusion (or the one that I agree with) is in line with what I said above: there is no physical way to make a measurement that does not require a time period, during which the speed of the object may have changed. See as most explanatory, this post: https://www.physicsforums.com/threads/speed-measurement-limitations-to-instantaneous-measurements.977557/post-6237175
ReplyDeleteIn a much more impressionistic way, given my limitations in these fields, I can follow your discussion and "feel" its original and creative ideas. This is much more interesting or inspiring as it may seem, you know that we learn a lot from our basic, amateur intuitions.....thanks!!!
ReplyDelete