Alternatively, it follows from (2) that if the polar coordinates of the point p are (r1,θ1) and those of q are (r2,θ2), then the distance between the points is r1² + r2² - 2r1r2cos(θ1 - θ2). The '2' is there because it's an average of '2' cells. The absence of the abs function makes this metric convenient to deal with analytically, but the squares cause it to be very sensitive to large outliers.

Two dimensions[edit] In the Euclidean plane, if p=(p1,p2) and q=(q1,q2) then the distance is given by d(p,q) = √[(q1-p1)² + (q2-p2)²]. For cells described by more than 1 variable this gets a little hairy to figure out, it's a good thing we have computer programs to do this for us. The resolution of the raster can be controlled with the Output cell size parameter or the Cell Size environment. Now there are these clusters at stage 4 (the rest are single cells and don't contribute to the SSE): 1. (2 & 19) from stage 1; SSE = 0.278797 2. (8

Because all SSE's have to be added together at each stage the total SSE is going to be 0.737739. Alternatively, it follows from (2) that if the polar coordinates of the point p are (r1,θ1) and those of q are (r2,θ2), then the distance between the points is r1² + r2² - 2r1r2cos(θ1 - θ2). So, the SSE for stage 1 is: 6.

A vector can be described as a directed line segment from the origin of the Euclidean space (vector tail), to a point in that space (vector tip). Distance functions are often used as error or cost functions to be minimized in an optimization problem. If allocation output is desired, use the Euclidean Allocation tool, which can generate all three outputs (allocation, distance, and direction) at the same time. At each stage of cluster analysis the total SSE is minimized with SSEtotal = SSE1 + SSE2 + SSE3 + SSE4 .... + SSEn.

In this case, the equation becomes d²(p,q) = (p1-q1)² + (p2-q2)². However, instead of determining the distance between 2 cells (i & j) its between cell i (or j) and the vector means of cells i & j.

