Question

Why can't I define $\Delta u=(g(x)+\Delta x)-g(x)$?

Original question: Let u=g(x)u=g(x), then we have

dfdu=f(u+Δu)g(u)Δu\frac{df}{du}=\frac{f(u+\Delta u)-g(u)}{\Delta u}

where Δu=g(x+Δx)g(x)\Delta u=g(x+\Delta x)-g(x), but why can’t I define Δu=(g(x)+Δx)g(x)\Delta u=(g(x)+\Delta x)-g(x). I understand this leads to the issue Δx=Δu\Delta x=\Delta u, however this feels more intuitively right to me, whereas the other one seems completely incorrect. This may be a misunderstanding in what an instantaneous rate of change is.

Expert Verified Solution

thumb_up100%(1 rated)

Key concept: The confusion is very common in chain rule and differential notation. The key idea is that Δu\Delta u describes the change in the output caused by changing the input.

Step by step

The main reason

If u=g(x)u=g(x), then Δu\Delta u must mean the change in uu caused by changing xx from xx to x+Δxx+\Delta x.

So the correct definition is

Δu=g(x+Δx)g(x)\Delta u=g(x+\Delta x)-g(x)

because the new output is g(x+Δx)g(x+\Delta x), not g(x)+Δxg(x)+\Delta x.

Why your idea does not work

Your proposal

Δu=(g(x)+Δx)g(x)\Delta u=(g(x)+\Delta x)-g(x)

simplifies to

Δu=Δx\Delta u=\Delta x

That would mean the change in the output is always exactly the same as the change in the input. That is only true for special functions like g(x)=xg(x)=x, not for a general function.

What the instantaneous rate of change means

The derivative is based on the ratio

ΔuΔx=racg(x+Δx)g(x)Δx\frac{\Delta u}{\Delta x}=rac{g(x+\Delta x)-g(x)}{\Delta x}

and then taking the limit as Δx0\Delta x\to 0.

This measures how fast the function output changes because of the input change.

A quick example

If g(x)=x2g(x)=x^2, then

Δu=(x+Δx)2x2=2xΔx+(Δx)2\Delta u=(x+\Delta x)^2-x^2=2x\Delta x+(\Delta x)^2

This is clearly not equal to Δx\Delta x.

Final idea

So Δu\Delta u cannot be defined as (g(x)+Δx)g(x)(g(x)+\Delta x)-g(x), because that adds the input change directly to the function value instead of applying the function to the new input.

Pitfall alert

A common trap is mixing up the change in the input with the change in the output. Δx\Delta x belongs to the input variable, while Δu\Delta u must come from evaluating the function at the changed input.

Try different conditions

If g(x)=xg(x)=x, then your proposed expression happens to give the same result because the function is the identity. But for nonlinear functions such as x2x^2, sinx\sin x, or exe^x, the two definitions are completely different.

Further reading

chain rule, increment notation, derivative limit

chat