Cumulative Distribution Transform (CDT)

This tutorial will demonstrate: how to use the forward and inverse operations of the CDT in the the PyTransKit package.

Class:: CDT

Functions:

Forward transform: sig1_hat, sig1_hat_old, xtilde = forward(x0, sig0, x1, sig1)

Inputs:
----------------
x0 : 1d array
    Independent axis variable of reference signal (sig0).
sig0 : 1d array
    Reference signal.
x1 : 1d array
    Independent axis variable of the signal to transform (sig1).
sig1 : 1d array
    Signal to transform.

Outputs:
----------------
sig1_hat : 1d array
    CDT of input signal sig1.
sig1_hat_old : 1d array
    CDT of input signal sig1 using the old definition.
xtilde : 1d array
    Independent axis variable of sig1_hat.

Inverse transform: sig1_recon = inverse(sig1_hat, sig0, x1)

Inputs:
----------------
sig1_hat : 1d array
    CDT of a signal sig1.
sig0 : 1d array
    Reference signal.
x1 : 1d array
    Independent axis variable of the signal to reconstruct.

Outputs:
----------------
sig1_recon : 1d array
    Reconstructed signal.

Definition

Forward Transform

Let \(s(x), x\in\Omega_s\subset\mathbb{R}\) be a positive density function (PDF). The CDT of the PDF \(s(x)\) with respect to a reference PDF \(s_0(x), x\in\Omega_{s_0}\subset\mathbb{R}\) is given by the mass preserving function \(\widehat{s}(x)\) that satisfies:

\begin{equation} \int_{inf(\Omega_s)}^{\widehat{s}(x)}s(u)du = \int_{inf(\Omega_{s_0})}^xs_0(u)du \end{equation}

which yields:

\begin{equation} \widehat{s}(x) = S^{-1}(S_0(x)), \end{equation}

where \(S(x)=\int_{inf(\Omega_s)}^{x}s(u)du\), and \(S_0(x)=\int_{inf(\Omega_{s_0})}^{x}s_0(u)du\). For continuous positive PDFs, \(\widehat{s}\) is a continuous and monotonically increasing function. If \(\widehat{s}\) is differentiable, the above equation can be rewritten as:

\begin{equation} s_0(x) = \widehat{s}'(x)s(\widehat{s}(x)). \end{equation}

Inverse Transform

The inverse transform of the CDT \(\widehat{s}(x)\) is given by:

\begin{equation} s(x) = (\widehat{s}^{-1}(x))'s_0(\widehat{s}^{-1}(x)), \end{equation}

where \(\widehat{s}^{-1}\) is the inverse of \(\widehat{s}\), i.e. \(\widehat{s}^{-1}(\widehat{s}(x)) = x\).

CDT Demo

The examples will cover the following operations: * Forward and inverse operations of the CDT * CDT Properties - translation, scaling, and linear separability in CDT space

Forward CDT

Import necessary python packages

[1]:

import numpy as np
import matplotlib.pyplot as plt

Create reference \(I_0\) (uniform distribution) and a demo signal \(I_1\)

[2]:

N=250
x0 = np.linspace(0, 1, N)
I0= np.ones(x0.size)
x=np.linspace(0, 2, N)
mu=x[len(x)-1]/2.
sigma=0.1
I1=1/(sigma*np.sqrt(2*np.pi))*np.exp(-((x-mu)**2)/(2*sigma**2))

Convert signals to strictly positive PDFs

[3]:

epsilon = 1e-7
I0 = abs(I0) + epsilon
I0 = I0/I0.sum()
I1 = abs(I1) + epsilon
I1 = I1/I1.sum()

Compute forward CDT transform of \(I_1\)

[4]:

import sys
sys.path.append('../')

from pytranskit.optrans.continuous.cdt import CDT

# Create a CDT object
cdt = CDT()

# Compute the forward transform
I1_hat, I1_hat_old, xtilde = cdt.forward(x0, I0, x, I1)

# Plot I1 and I1_hat
fontSize=14
fig, ax = plt.subplots(1, 2, sharex=False, sharey=False, figsize=(15,5))
ax[0].plot(x, I1, 'r-',linewidth=2)
ax[0].set_title('Original signal',fontsize=fontSize)
ax[0].set_yticks([])

ax[1].plot(xtilde, I1_hat, 'b-',linewidth=2)
ax[1].set_title('CDT',fontsize=fontSize)
#ax[1].set_yticks([])

#ax[2].plot(xtilde, I1_hat_old, 'g-',linewidth=2)
#ax[2].set_title('CDT (old definition)',fontsize=fontSize)
#ax[2].set_yticks([])

plt.show()

../_images/tutorials_01_tutorial_cdt_12_0.png

Inverse CDT

Reconstruct the original signal \(I_1\) from \(\hat{I_1}\) using inverse CDT

[5]:

I1_hat[0] = 0
I1_recon = cdt.inverse(I1_hat, I0, x)

# Plot I1 and I1_recon
fig, ax = plt.subplots(1, 2, sharex=True, sharey=False, figsize=(10,5))
ax[0].plot(x, I1, 'r-')
ax[0].set_title('Original signal')

ax[1].plot(x, I1_recon, 'b-')
ax[1].set_title('Reconstructed signal')
#ax[1].set_yticks([])

plt.show()

../_images/tutorials_01_tutorial_cdt_14_0.png

Translation

Generate a second signal \(I_2\) which is a shifted version \(I_1\), i.e. \(I_2 = I_1(t-\tau)\). Then convert the signals into PDFs and compute CDT for both signals

[6]:

from scipy.ndimage.interpolation import shift
tau = 0.5
I1=1/(sigma*np.sqrt(2*np.pi))*np.exp(-((x-mu)**2)/(2*sigma**2))

I2 = np.interp(x-tau, x, I1)

# Convert signals to strictly positive PDFs
I1 = abs(I1) + epsilon
I1 = I1/I1.sum()
I2 = abs(I2) + epsilon
I2 = I2/I2.sum()

# Create a CDT object
cdt1 = CDT()

# Compute the forward transform
I1_hat, I1_hat_old, xtilde = cdt1.forward(x0, I0, x, I1, rm_edge=False)
I2_hat, I2_hat_old, xtilde = cdt1.forward(x0, I0, x+tau, I2, rm_edge=False)

#Plot the signals and CDTs
fig, ax = plt.subplots(1, 2, sharex=False, sharey=False, figsize=(15,7))
ax[0].plot(x, I1, 'b-')
ax[0].plot(x, I2, 'r-')
ax[0].set_title('Original signals')
ax[0].set_yticks([])
ax[0].legend([r'$I_1$',r'$I_2$'])

ax[1].plot(xtilde, I1_hat, 'b-')
ax[1].plot(xtilde, I2_hat, 'r--')
ax[1].set_yticks([])
ax[1].set_title('CDTs')
ax[1].legend([r'$\hat{I}_1$',r'$\hat{I}_2$'])

#ax[2].plot(xtilde, f1_hat, 'b-')
#ax[2].plot(xtilde, f2_hat, 'r--')
#ax[2].set_yticks([])
#ax[2].set_title('Transport map')
#ax[2].legend([r'$f_1$',r'$f_2$'])

plt.show()

../_images/tutorials_01_tutorial_cdt_16_0.png

Scaling

Generating a scaled version of \(I_1\), i.e. \(I_2 = I_1(\alpha t)\), and computing CDTs of both signals.

[7]:

from scipy.stats import norm

x = np.arange(-10,10,0.001)

N=len(x)

x0 = np.linspace(0, 1, round(N/2))
I0= np.ones(x0.size)

I1 = norm.pdf(x,0,1)

# create a scaled version of I1; i.e. I2(t) = I1(alpha*t)
alpha = 0.75
I2 = np.interp(alpha*x, x, I1)

# Convert signals to strictly positive PDFs
I0 = abs(I0) + epsilon
I0 = I0/I0.sum()
I1 = abs(I1) + epsilon
I1 = I1/I1.sum()
I2 = abs(I2) + epsilon
I2 = I2/I2.sum()

# Create a CDT object
cdt2 = CDT()

# Compute the forward transform
I1_hat, I1_hat_old, xtilde = cdt2.forward(x0, I0, x, I1, rm_edge=False)
I2_hat, I2_hat_old, xtilde = cdt2.forward(x0, I0, x/alpha, I2, rm_edge=False)

#Plot the signals and CDTs
fig, ax = plt.subplots(1, 2, sharex=False, sharey=False, figsize=(15,5))
ax[0].plot(x, I1, 'b-')
ax[0].plot(x, I2, 'r-')
ax[0].set_title('Original signals')
ax[0].set_yticks([])
ax[0].legend([r'$I_1$',r'$I_2$'])

ax[1].plot(xtilde, I1_hat, 'b-')
ax[1].plot(xtilde, I2_hat, 'r--')
ax[1].set_yticks([])
ax[1].set_title('CDTs')
ax[1].legend([r'$\hat{I}_1$',r'$\hat{I}_2$'])

#ax[2].plot(xtilde, f1_hat, 'b-')
#ax[2].plot(xtilde, f2_hat, 'r--')
#ax[2].set_yticks([])
#ax[2].set_title('Transport map')
#ax[2].legend([r'$f_1$',r'$f_2$'])

plt.show()

../_images/tutorials_01_tutorial_cdt_18_0.png

Linear Separability

Here we are defining three classes of signal, i.e. class \(k \in \{1,2,3\}\). Each class \(k\) consists of translated versions of a \(k\)-modal Gaussian distribution.

[8]:

N=250
x=np.arange(N)

# Creating reference signal I0
x0 = np.linspace(0, 1, round(N/2))
I0= np.ones(x0.size)
epsilon = 1e-7
I0 = abs(I0) + epsilon
I0 = I0/I0.sum()

# Creating three classes of translated versions of k-modal Gaussian distribution -- skolouri
K=3 # Number of classes
L=500 # Number of datapoints per class

rm_edge = False
if rm_edge:
    rml = 2
else:
    rml = 0

I=np.zeros((K,L,N))
Ihat=np.zeros((K,L,x0.size-rml))
kmodal_shift=[]
kmodal_shift.append(np.array([0]))
kmodal_shift.append(np.array([-15,15]))
kmodal_shift.append(np.array([-30,0,30]))
sigma=5
for k in range(K):
    for i,mu in enumerate(np.linspace(50,200,L)):
        for j in range(k+1):
            I[k,i,:]+=1/((k+1)*sigma*np.sqrt(2*np.pi))*np.exp(-((x-mu-kmodal_shift[k][j])**2)/(2*sigma**2))
        I[k,i,:] = abs(I[k,i,:]) + epsilon
        I[k,i,:] = I[k,i,:]/I[k,i,:].sum()
        Ihat[k,i,:], _,xtilde = cdt.forward(x0, I0, x+mu, I[k,i,:], rm_edge)
        #Ihat[k,i,:]=cdt.transform(I[k,i,:])
fig,ax=plt.subplots(1,3,figsize=(15,5))
for count,index in enumerate([0,int(L/2),L-1]):
    for k in range(K):
        ax[count].plot(x, I[k,index,:]) #template
        ax[count].set_xticks([])
        ax[count].set_yticks([])
        ax[count].legend([r'$I_1\in C_1$',r'$I_2\in C_2$',r'$I_3\in C_3$'])
plt.show()

fig,ax=plt.subplots(1,3,figsize=(15,5))
for count,index in enumerate([0,int(L/2),L-1]):
    for k in range(K):
        ax[count].plot(xtilde, Ihat[k,index,:]) #template
        ax[count].set_xticks([])
        ax[count].set_yticks([])
        ax[count].legend([r'$\hat{I}_1\in C_1$',r'$\hat{I}_2\in C_2$',r'$\hat{I}_3\in C_3$'])
plt.show()

../_images/tutorials_01_tutorial_cdt_20_0.png

../_images/tutorials_01_tutorial_cdt_20_1.png

Now we’ll do linear classification in Signal and CDT spaces. LDA is used for visualization.

[9]:

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
from sklearn.svm import LinearSVC

import warnings
warnings.filterwarnings('ignore')

X=np.reshape(I,(K*L,N))      #Combine the signals into a features vector X
Xhat=np.reshape(Ihat,(K*L,x0.size-rml))     #Combine the transformed signals into a features vector Xhat
data=[X,Xhat]
label=np.concatenate((np.zeros(L,),np.ones(L,),-1*np.ones(L,))) # Define the labels as -1,0,1 for the three classes
lda=LinearDiscriminantAnalysis(n_components=2)
svm=LinearSVC()
title_=['LDA projections in the signal space','LDA projections in the CDT space']
fig,ax=plt.subplots(1,2,figsize=(15,5))
for i in range(2):
    dataLDA=lda.fit_transform(data[i],label)
    ax[i].plot(dataLDA[:L,0],dataLDA[:L,1],'b*')
    ax[i].plot(dataLDA[L:2*L,0],dataLDA[L:2*L,1],'ro')
    ax[i].plot(dataLDA[2*L:,0],dataLDA[2*L:,1],'kx')
    x_min, x_max = ax[i].get_xlim()
    y_min, y_max = ax[i].get_ylim()
    nx, ny = 400, 200
    xx, yy = np.meshgrid(np.linspace(x_min, x_max, nx),np.linspace(y_min, y_max, ny))
    svm.fit(dataLDA,label)
    Z = svm.predict(np.c_[xx.ravel(), yy.ravel()])
    Z = Z[:].reshape(xx.shape)
    ax[i].pcolormesh(xx, yy, Z,cmap='OrRd')
    ax[i].contour(xx, yy, Z, linewidths=.5, colors='k')
    ax[i].set_xticks([])
    ax[i].set_yticks([])
    ax[i].set_title(title_[i],fontsize=fontSize)
plt.show()

../_images/tutorials_01_tutorial_cdt_22_0.png

[ ]: