Andrews plot

From HandWiki
Short description: Type of data visualization
An Andrews curve for the Iris data set

In data visualization, an Andrews plot or Andrews curve is a way to visualize structure in high-dimensional data. It is basically a rolled-down, non-integer version of the Kent–Kiviat radar m chart, or a smoothed version of a parallel coordinate plot. It is named after the statistician David F. Andrews.[1][2][3][4]

A value [math]\displaystyle{ x }[/math] is a high-dimensional datapoint if it is an element of [math]\displaystyle{ \mathbb R ^d }[/math]. We can represent high-dimensional data with a number for each of their dimensions, [math]\displaystyle{ x = \left \{ x_1, x_2, \ldots, x_d \right \} }[/math]. To visualize them, the Andrews plot defines a finite Fourier series:

[math]\displaystyle{ f_x(t) = \frac{x_1}{\sqrt 2} + x_2 \sin(t) + x_3 \cos(t) + x_4 \sin(2t) + x_5 \cos(2t) + \cdots }[/math]

This function is then plotted for [math]\displaystyle{ -\pi \lt t \lt \pi }[/math]. Thus each data point may be viewed as a line between [math]\displaystyle{ -\pi }[/math] and [math]\displaystyle{ \pi }[/math]. This formula can be thought of as the projection of the data point onto the vector:

[math]\displaystyle{ \left ( \frac 1 {\sqrt 2}, \sin(t), \cos(t), \sin(2t), \cos(2t), \ldots \right ) }[/math]

If there is structure in the data, it may be visible in the Andrews curves of the data.

These curves have been utilized in fields as different as biology, neurology, sociology and semiconductor manufacturing. Some of their uses include the quality control of products, the detection of period and outliers in time series, the visualization of learning in artificial neural networks, and correspondence analysis.

Theoretically, it is possible to project them onto an n-sphere. The projection onto the circle results in the aforementioned radar chart.

References

  1. Andrews, D. F. (1972). "Plots of High-Dimensional Data". Biometrics 28 (1): 125–136. doi:10.2307/2528964. https://www.jstor.org/stable/2528964. 
  2. Embrechts, Paul; Herzberg, Agnes M. (1991). "Variations of Andrews' Plots". International Statistical Review / Revue Internationale de Statistique 59 (2): 175–194. doi:10.2307/1403442. ISSN 0306-7734. https://www.jstor.org/stable/1403442. 
  3. García-Osorio, César; Fyfe, Colin (2005). "Visualization of High-Dimensional Data via Orthogonal Curves". Journal of Universal Computer Science 11 (11): 1806–1819. http://www.jucs.org/jucs_11_11/visualization_of_high_dimensional/jucs_11_11_1806_1819_garc_a_osorio.pdf. 
  4. "Andrews Curves". Rip’s Applied Mathematics Blog. 3 October 2011. http://rip94550.wordpress.com/2011/10/03/andrews-curves/.