Time-Series Learning using Monotonic Logical Properties

Marcell Vazquez-Chanlatte$^{\star (\text{me})}$
Shromona Ghosh$^\star$
Jyotirmoy V. Deshmukh$^{\dagger}$
Alberto Sangiovanni-Vincentelli$^{\star}$
Sanjit A. Seshia$^\star$

Slides @ mjvc.me/RV2018

$\star$: University of California-Berkeley

$\dagger$: University of Southern California

The Verification Problem

Specification Mining

An Incomplete Survery

Survey: Li, Wenchao. Specification mining: New formalisms, algorithms and applications.

$^1$Vazquez-Chanlatte et al, Logic-based Clustering and Learning for Time-Series Data, CAV 2017

$^2$ Bombara et al., Signal Clustering Using Temporal Logics, RV 2017.

Two Messages

Use Machine Learning to learn Formal Specifications.

Specification Mining

Use Formal Specifications to guide Machine Learning.

Logical Lens

This talk is about unifying these two.

Contributions

Relax restrictions of our CAV 2017 work.
Learn "traffic slow-down" spec from simple logical lens.

Logical Lens$^*$ \[\downarrow\] Logic Respecting Distance
or
Features

$^*$Logical Lens = A Parametrized family of Boolean Properties, $\phi : \text{Parameter} \to \big (\text{Data} \to \{0, 1\}\big )$, e.g. PSTL.

In memory of Oded Maler

Overlapped with my internship at Toyota.
Work on "Learning Monotone Partitions of Partially-Ordered Domains" was catalyst for this paper.

Outline

Motivating Example
Logical Lens $\to$ Logical Distance
Logical Lens $\triangleq$ Parametric Trace Property
Computing Logical Distance
Application to Highway 101 data.

Motivating Example

Goal: Detect highway slow downs.

Have rough idea of shapes to look for.

Outline

Motivating Example
Logical Lens $\to$ Logical Distance
Logical Lens $\triangleq$ Parametric Trace Property
Computing Logical Distance
Application to Highway 101 data.

Parametric Logical Formula

\[\phi(\tau, h) \triangleq \square_{[\tau, 20]}(\text{car_speed} \leq h)\]

Different parameters result in different classifications.

Platonic Ideal of a Metric

If two things are logically similar, they should be "close" together.

Platonic Ideal of a Metric

If two things are logically similar, they should be "close" together.

"If it quacks like a duck and looks like a duck$\ldots $ it should have a small distance to a duck."

Domain Independent

Time-Series using PSTL is just an example

Logically Similar

Boolean Features

\[\varphi_i : \text{Data} \to \{0, 1\}\]

Each axis represents a property.

Families of Boolean features

Sometimes need to test $\geq 3$ properties.
Consider the following (infinite) size family of Boolean features.

You must be $h$ meters tall or more to ride.

Families of Boolean features

\[\phi : \text{Height} \to \bigg ( \text{Data} \to \{0, 1\}\bigg)\]

Families of Boolean features

\[\phi : \text{Data} \to \bigg ( \text{Height} \to \{0, 1\}\bigg)\]

Parametric Specifications

\[\phi : \text{Data} \to \bigg ( {[0, 1]}^n \to \{0, 1\}\bigg)\]

Data induces a geometric structure in the parameter space.

Monotonic Specifications

\[\phi : \text{Data} \to \bigg ( {[0, 1]}^n \to \{0, 1\}\bigg)\]

Complicated geometry in general.
Many families are "monotonic".

\[\theta \leq \theta' \implies \phi(x)(\theta) \leq \phi(x)(\theta')\]

Can Forget Interior

It is sufficent to just remember the boundary (surface)

Logical Lens

Map Data to Surfaces.

CAV 2017 work can only pick out a single point.

💰Billion Dollar Question💰

Does similar data induce a similar structure??

Transform data into a surface encoding logical trade-offs.
Does this transformation give us a metric that achieves our goals?

Platonic Ideal 2.0

"If it quacks like a duck and looks like a duck$\ldots $ it should produce a similar validity domain boundary??"

Does similar data induce a similar structure??

It depends...

For many Monotonic Specifications this is true: \[ \big( x \geq \theta\big ) \text{ vs } \big(x + 0.1 \geq \theta\big) \]

Logical Distance Recipe

Define logical lens: $\phi : \text{Data} \to \bigg ( {[0, 1]}^n \to \{0, 1\}\bigg)$
Define distance metric between parameters.
Lift parameter distance to distance between boundaries. (e.g., Hausdorff Distance)

Outline

Motivating Example
Logical Lens $\to$ Logical Distance
Logical Lens $\triangleq$ Parametric Trace Property
Computing Logical Distance
Application to Highway 101 data.

Recursively Computing Boundaries

Sample points from each rectangle.
Compute Hausdorff Distance.

Technique due to Oded Maler.

Oded Maler. Learning Monotone Partitions of Partially-Ordered Domains 2017. 〈hal-01556243〉

Recursively Computing Boundaries

Steady state error example

$\Rightarrow $

Slow Downs Revisited

\[\square_{[\tau, 20]}(\text{car_speed} \leq h) \]

Slow Downs Revisited

\[\square_{[\tau, 20]}(\text{car_speed} \leq h) \]

Outline

Motivating Example
Logical Lens $\to$ Logical Distance
Logical Lens $\triangleq$ Parametric Trace Property
Computing Logical Distance
Application to Highway 101 data.

Real Highway Data Example

Want to detect traffic slow downs.
Too much data to run logical distance on!!!!
Many metric based learning algorithms are will require $O(\#\text{dataPoints}^2)$ queries.

Colyar, J., Halkias, J.: US highway 101 dataset. Federal Highway Administration (FHWA), Tech. Rep. FHWA-HRT-07-030 (2007)

Outline

Motivating Example
Logical Lens $\to$ Coarse Logical Distance
Computing Logical Distance
Application to Highway 101 data.

Coarse Logical Features

Compute intersections along a line.

Coarse Logical Features

Parameterize line between 0 and 1.

$\gamma_i : \text{Data} \to [0, 1]$.

Coarse Logical Features

Can project to multiple lines.

Outline

Motivating Example
Logical Lens $\to$ Logical Distance
Logical Lens $\triangleq$ Parametric Trace Property
Computing Logical Distance
Application to Highway 101 data.

Real Highway Data Example

Project onto two interior lines.
Label using Gaussian Mixture Model.
Label toy data to identify possible slow downs.

Distance to Idealized Slowdown

Zoom in on cluster 4 by computing distances to idealized slow down.

Extract Formula?

Labels $\to$ Parameter Ranges $\to$ Formula

Details in the CAV 2017 paper.

Logic-based Clustering and Learning for Time-Series Data, CAV 2017

Slow Down Formula

Extracting a formula:

\[ \phi(\tau, h) \triangleq \square_{[\tau, 20]}(\text{car_speed} < h) \]

\[ \psi_1 = \phi(0.25, 0.55) \wedge \neg \phi(0.38, 0.55) \wedge \neg \phi(0.25, 0.76) \]

\[ \psi_2 = \phi(0.35, 0.17) \wedge \neg \phi(0.35, 0.31) \wedge \neg \phi(0.62, 0.17) \]

\[\text{slow_down} \triangleq \psi_1 \wedge \psi_2\]

Two Messages

Use Machine Learning to learn Formal Specifications.

Specification Mining

Use Formal Specifications to guide Machine Learning.

Logical Lens

Summary

Relax restrictions of our CAV 2017 work.
Learn "traffic slow-down" spec from simple logical lens.

Logical Lens$^*$ \[\downarrow\] Logic Respecting Distance
or
Features

Future work

Performance improvements.
Kernel trick analog to learn better specifications.

Questions?

Slides @ mjvc.me/RV2018

Code @ github.com/mvcisback/multidim-threshold

A hammer looking for a nail?

A lot of great tools to time-series learning: Dynamic Time Warping, Shaplets, PCA, ...
All of these techniques are fantastic!
But you know what's better? A good feature.

Feature Engineering is King

"Coming up with features is difficult, time-consuming, requires expert knowledge. "Applied machine learning" is basically feature engineering."

- Andrew Ng,
"Machine Learning and AI via Brain simulations"

" …some machine learning projects succeed and some fail. What makes the difference? Easily the most important factor is the features used."

- Pedro Domingos,
"A Few Useful Things to Know about Machine Learning"

... or at least is was before Deep Learning

Designing features is HARD

Need to embed non-trivial domain specific knowledge into the feature.
Domain specific knowledge is typically stuck in the head of a non-ML expert.
ML techniques can be sensitive to encoding of features.

This work focuses on try to make designing features slightly less hard (e.g., time series).

Validity Domain

\[\phi : \text{Data} \to \bigg ( {[0, 1]}^n \to \{0, 1\}\bigg)\]

\[V_\phi(x) \triangleq ~\text{preImg}_{\phi(x)}[\{1\}] = \{\theta \in {[0, 1]}^n~.~\phi(x)(\theta) = 1\}\]

What's to come

(3/5) Derive a "logical distance" under this lens.

\[\phi(\tau, h) \triangleq \square_{[\tau, 20]}(\text{car_speed} \leq h)\]

What's to come

(4/5) Apply to real highway data.

\[\phi(\tau, h) \triangleq \square_{[\tau, 20]}(\text{car_speed} \leq h)\]

What's to come

(5/5) Characterize slow downs in this dataset.

\[\phi(\tau, h) \triangleq \square_{[\tau, 20]}(\text{car_speed} \leq h)\]

\[ \begin{split}\psi_1 \triangleq \phi(0.25, 0.55) &\wedge \neg \phi(0.38, 0.55) \\&\wedge \neg \phi(0.25, 0.76)\end{split} \]

\[ \begin{split}\psi_2 \triangleq \phi(0.35, 0.17) &\wedge \neg \phi(0.35, 0.31) \\&\wedge \neg \phi(0.62, 0.17)\end{split} \]

\[\text{slow_down} \triangleq \psi_1 \wedge \psi_2\]

Distance between Parameters

Infinity Norm: Let the distance between two parameters be the difference in their largest component, e.g,

\[ \text{height} \mapsto h, \text{weight} \mapsto w \] \[ d_\infty((h_1, w_1), (h_2, w_2)) \triangleq \max(|h_1 - h_2|, |w_1 - w_2|)\]

Distance between Boundaries

Hausdorff Distance: Let $A$ and $B$ be two sets of parameters:

\[ d^H_\infty(A, B) \triangleq \max(\sup_{a \in A} \inf_{b \in B} d_\infty(a, b), \sup_{b \in B} \inf_{a \in A} d_\infty(a, b))\]

Distance between Boundaries

\[ d^H_\infty(A, B) \triangleq \max(\sup_{a \in A} \inf_{b \in B} d_\infty(a, b), \sup_{b \in B} \inf_{a \in A} d_\infty(a, b))\]

Interpretation: The curves "$\epsilon$-bisimulate" each other.

Two player game where player 1 selects a point, $a \in A$ and then player 2 selects a point $b\in B$.
Player 1 wants to maximize distance between and $a$ and $b$.
Player 2 wants to minimize the distance.
They then switch roles.

Time-Series Learning using Monotonic Logical Properties

The Verification Problem

Specification Mining

An Incomplete Survery

Two Messages

Use Machine Learning to learn Formal Specifications.

Specification Mining

Use Formal Specifications to guide Machine Learning.

Logical Lens

Contributions

Logical Lens\(^*\) \[\downarrow\] Logic Respecting Distance or Features

In memory of Oded Maler

Outline

Motivating Example

Outline

Parametric Logical Formula

Platonic Ideal of a Metric

Platonic Ideal of a Metric

Domain Independent

Logically Similar

Boolean Features

Families of Boolean features

Families of Boolean features

Families of Boolean features

Parametric Specifications

Monotonic Specifications

Can Forget Interior

Logical Lens

💰Billion Dollar Question💰

Platonic Ideal 2.0

Logical Distance Recipe

Outline

Recursively Computing Boundaries

Recursively Computing Boundaries

Slow Downs Revisited

Slow Downs Revisited

Outline

Real Highway Data Example

Outline

Coarse Logical Features

Coarse Logical Features

Coarse Logical Features

Coarse Logical Features

Outline

Real Highway Data Example

Distance to Idealized Slowdown

Extract Formula?

Slow Down Formula

Two Messages

Use Machine Learning to learn Formal Specifications.

Specification Mining

Use Formal Specifications to guide Machine Learning.

Logical Lens

Summary

Logical Lens\(^*\) \[\downarrow\] Logic Respecting Distance or Features

Future work

Questions?

A hammer looking for a nail?

Feature Engineering is King

Designing features is HARD

Validity Domain

What's to come

What's to come

What's to come

Distance between Parameters

Distance between Boundaries

Distance between Boundaries

Logical Lens\(^*\) \[\downarrow\] Logic Respecting Distance
or
Features

Logical Lens\(^*\) \[\downarrow\] Logic Respecting Distance
or
Features