Question

Why can't I define $\Delta u=(g(x)+\Delta x)-g(x)$?

Original question: Let u=g(x)u=g(x), then we have

dfdu=f(u+Δu)g(u)Δu\frac{df}{du}=\frac{f(u+\Delta u)-g(u)}{\Delta u}

where Δu=g(x+Δx)g(x)\Delta u=g(x+\Delta x)-g(x), but why can’t I define Δu=(g(x)+Δx)g(x)\Delta u=(g(x)+\Delta x)-g(x). I understand this leads to the issue Δx=Δu\Delta x=\Delta u, however this feels more intuitively right to me, whereas the other one seems completely incorrect. This may be a misunderstanding in what an instantaneous rate of change is.

Expert Verified Solution

thumb_up100%(1 rated)

Key concept: The confusion is very common in chain rule and differential notation. The key idea is that Δu\Delta u describes the change in the output caused by changing the input.

Step by step

The main reason

If u=g(x)u=g(x), then Δu\Delta u must mean the change in uu caused by changing xx from xx to x+Δxx+\Delta x.

So the correct definition is

Δu=g(x+Δx)g(x)\Delta u=g(x+\Delta x)-g(x)

because the new output is g(x+Δx)g(x+\Delta x), not g(x)+Δxg(x)+\Delta x.

Why your idea does not work

Your proposal

Δu=(g(x)+Δx)g(x)\Delta u=(g(x)+\Delta x)-g(x)

simplifies to

Δu=Δx\Delta u=\Delta x

That would mean the change in the output is always exactly the same as the change in the input. That is only true for special functions like g(x)=xg(x)=x, not for a general function.

What the instantaneous rate of change means

The derivative is based on the ratio

ΔuΔx=racg(x+Δx)g(x)Δx\frac{\Delta u}{\Delta x}=rac{g(x+\Delta x)-g(x)}{\Delta x}

and then taking the limit as Δx0\Delta x\to 0.

This measures how fast the function output changes because of the input change.

A quick example

If g(x)=x2g(x)=x^2, then

Δu=(x+Δx)2x2=2xΔx+(Δx)2\Delta u=(x+\Delta x)^2-x^2=2x\Delta x+(\Delta x)^2

This is clearly not equal to Δx\Delta x.

Final idea

So Δu\Delta u cannot be defined as (g(x)+Δx)g(x)(g(x)+\Delta x)-g(x), because that adds the input change directly to the function value instead of applying the function to the new input.

Pitfall alert

A common trap is mixing up the change in the input with the change in the output. Δx\Delta x belongs to the input variable, while Δu\Delta u must come from evaluating the function at the changed input.

Try different conditions

If g(x)=xg(x)=x, then your proposed expression happens to give the same result because the function is the identity. But for nonlinear functions such as x2x^2, sinx\sin x, or exe^x, the two definitions are completely different.

Further reading

chain rule, increment notation, derivative limit

FAQ

Why is Δu defined as g(x+Δx)-g(x)?

Because Δu represents the change in the output value of u=g(x) when the input changes from x to x+Δx. The new output is g(x+Δx), so the change is g(x+Δx)-g(x).

Why is Δu=(g(x)+Δx)-g(x) not correct in general?

That expression adds the input increment Δx directly to the function value instead of evaluating the function at the new input. It would only match the true change for very special functions like g(x)=x.

chat