Converting test-methods into ConTeXt

2018-07-31 10:22:13 +02:00 · 2018-07-31 10:22:13 +02:00 · fefc170d4b
commit fefc170d4b
parent e70c3947e9
7 changed files with 915 additions and 0 deletions
--- a/content.tex
+++ b/content.tex
@ -0,0 +1,28 @@
 \project thesis
 \startproduct content
 \starttext
 \completecontent
 \completelistoffigures
 \completelistoftables
 \completelistofalgorithms
 \showbodyfont
 \chapter{Intro}
 \section{Learning and Testing}
 \section{Nominal Techniques}
 \component content/test-methods
 \chapter{Learning Embedded Stuff}
 \chapter{Separating Sequences}
 \chapter{Learning Nominal Automata}
 \chapter{Ordered Nominal Sets}
 \chapter{Succinct Nominal Automata?}
 \stoptext
 \stopproduct
--- a/content/test-methods.tex
+++ b/content/test-methods.tex
@ -0,0 +1,785 @@
 \project thesis
 \startcomponent test-methods
 %\usepackage[vlined]{algorithm2e}
 %
 %\newcommand{\todo}[1]{{\bf * }\marginpar{#1}}
 %
 %\newcommand{\Def}[1]{\emph{#1}}
 %\newcommand{\bigO}{\mathcal{O}}
 %\newcommand{\N}{\mathbb{N}}
 %\newcommand{\tot}[1]{\xrightarrow{\,\,{#1}\,\,}}
 \startchapter[title={A Framework for FSM-based Test Methods}]
 \startsection[title={Introduction}]
 \stopsection
 \startsection[title={Preliminaries}]
 \startsubsection[title={Mealy machines}]
 We will focus on Mealy machines, as those capture many protocol specifications and reactive systems.
 Note that we restrict ourselves to deterministic and complete machines.
 We fix finite alphabets $I$ and $O$ of inputs respectively outputs.
 We use the usual notation for operations on words:
 $uv$ for the concatenation of two words $u, v \in I^{\ast}$ and $|u|$ for the length of $u$.
 \startdefinition
 A \emph{Mealy machine} $M$ consists of a finite set of \emph{states} $S$, an \emph{initial state} $s_0 \in S$ and two functions:
 \startitemize
 \item a \emph{transition function} $\delta \colon S \times I \to S$, and
 \item an \emph{output function} $\lambda \colon S \times I \to O$.
 \stopitemize
 Both functions are extended to words as $\delta \colon S \times I^{\ast} \to S$ and $\lambda \colon S \times I^{\ast} \to O^{\ast}$.
 By abuse of notation we will often write $s \in M$ instead of $s \in S$ and leave out $S$ altogether.
 For a second Mealy machine $M'$ its members are denoted $S', s'_0, \delta'$ and $\lambda'$ by convention.
 The \emph{behaviour} of a state $s$ is given by the output function $\lambda(s, -) \colon I^{\ast} \to O^{\ast}$.
 Two states $s$ and $t$ are \emph{equivalent} if they have equal behaviours, written $s \sim t$, and two Mealy machines are equivalent if their initial states are equivalent.
 \stopdefinition
 An example Mealy machine is given in \in{Figure}[fig:running-example].
 \startplacefigure[title={An example specification with input $I=\{a,b,c\}$ and output $O=\{0,1\}$.},reference=fig:running-example,list={An example specification.}]
 \hbox{
 \starttikzpicture[shorten >=2pt,node distance=0.9cm and 3cm,bend angle=20]
 	\tikzstyle{every state}=[draw=black,text=black,inner
 	sep=2pt,minimum size=12pt,initial text=]
 	\node[state] (0) {$s_0$};
 	\node[state] (1) [above left = of 0] {$s_1$};
 	\node[state] (2) [below left = of 0] {$s_2$};
 	\node[state] (3) [above right = of 0] {$s_3$};
 	\node[state] (4) [below right = of 0] {$s_4$};
 	\node        (5) [above = of 0] {};
 	\path[->]
 	(5) edge (0)
 	(0) edge [bend left=]    node [below] {${a}/1$} (1)
 	(0) edge [bend right]    node [below] {${b}/0$} (4)
 	(0) edge [loop below]    node [below] {${c}/0$} (0)
 	(1) edge                 node [left ] {${a}/0$} (2)
 	(1) edge [bend left]     node [above] {${b}/0$, ${c}/0$} (0)
 	(2) edge [bend right]    node [below] {${b}/0$, ${c}/0$} (0)
 	(2) edge [loop left]     node [left ] {${a}/1$} (1)
 	(3) edge [bend left=30]  node [right] {${a}/1$} (4)
 	(3) edge [bend right]    node [above] {${b}/0$} (0)
 	(3) edge [loop right]    node [right] {${c}/1$} (3)
 	(4) edge [bend left]     node [right] {${a}/1$} (3)
 	(4) edge [loop right]    node [right] {${b}/1$} (4)
 	(4) edge [bend right]    node [above] {${c}/0$} (0);
 \stoptikzpicture
 }
 \stopplacefigure
 \stopsubsection
 \startsubsection[title={Testing}]
 In conformance testing we have a specification modelled as a Mealy machine and an implementation (the system under test, or SUT) which we assume to behave as a Mealy machine \cite{LeeYannakakis}.
 Tests are generated from the specification and applied to the implementation. We assume that we can reset the implementation before every test.
 If the output is different than the specified output, then we know the implementation is flawed.
 The goals is to test as little as possible, while covering as much as possible.
 \startdefinition[reference=test-suite]
 A \emph{test suite} is a finite subset $T \subseteq I^{\ast}$.
 A test $t \in T$ is called \emph{maximal} if it is not a proper prefix of another $s \in T$.
 We denote the set of maximal tests of $T$ with $max(T)$.
 \todo{In de voorbeelden geef ik altijd $max(X)$,
 	maar in de bewijzen is juist $Pref(X)$ handig.
 	Moet dit consistent houden...}
 \stopdefinition
 The maximal tests are the only tests in $T$ we actually have to apply to our SUT as we can record the intermediate outputs.
 Also note that we do not have to consider outputs in the test suite, as those follow from the deterministic specification.
 We define the size of a test suite as usual \cite{Dorofeeva,PLGHO14}.
 The size of a test suite is measured as the sum of the lengths of all its maximal tests plus one reset per test.
 \startdefinition[reference=test-suite-size]
 The \emph{size} of a test suite $T$ is defined to be
 $||T|| = \sum\limits_{t \in max(T)} (|t| + 1)$.
 \stopdefinition
 \stopsubsection
 \startsubsection[title={Completeness of test suites}]
 \startexample[reference=incompleteness]
 {\bf No test suite is complete.}
 Consider the specification in \in{Figure}{a}[fig:incompleteness-example].
 This machine will always produce zeroes.
 For any test suite we can make a faulty implementation which passes the test suite.
 Such an implementation might look like \in{Figure}{b}[fig:incompleteness-example] with $n$ big enough.
 This justifies the following definition.
 \startplacefigure[title={A basic example showing that finite test suites are incomplete. The	implementation on the right will pass any test suite if we choose $n$ big enough.},reference=fig:incompleteness-example]
 \startcombination[2*1]
 {\hbox{
 	\starttikzpicture[shorten >=2pt,node distance=2cm]
 	\tikzstyle{every state}=[draw=black,text=black,inner
 	sep=2pt,minimum size=12pt,initial text=]
 	\node[state] (0) {$s_0$};
 	\path[->]
 	(0) edge [loop] node [below] {${a}/0$} (0);
 	\stoptikzpicture
 }} {a}
 {\hbox{
 	\starttikzpicture[shorten >=2pt,node distance=2cm]
 	\tikzstyle{every state}=[draw=black,text=black,inner
 	sep=2pt,minimum size=12pt,initial text=]
 	\node[state] (0) {$s'_0$};
 	\node[state] (1) [right of=0] {$s'_1$};
 	\node        (2) [right of=1] {$\cdots$};
 	\node[state] (3) [right of=2] {$s'_n$};
 	\path[->]
 	(0) edge [bend left=20 ] node [below] {${a}/0$} (1)
 	(1) edge [bend left=20 ] node [below] {${a}/0$} (2)
 	(2) edge [bend left=20 ] node [below] {${a}/0$} (3)
 	(3) edge [loop         ] node [below] {${a}/1$} (3);
 	\stoptikzpicture
 }} {b}
 \stopcombination
 \stopplacefigure
 \stopexample
 \startdefinition[reference=completeness]
 Let $M$ be a Mealy machine and $T$ be a test suite.
 We say that $T$ is \emph{$m$-complete (for $M$)} if for all inequivalent machines $M'$ with at most $m$ states there exists a $t \in T$ such that $\lambda(s_0, t) \neq \lambda'(s'_0, t)$.
 \stopdefinition
 We are often interested in the case of $m$-completeness, where $m = n + k$ for some $k \in \naturalnumbers$ and $n$ is the number of states in the specification.
 Here $k$ will stand for the number of \emph{extra states} we can test.
 The issue of an unknown bound is addressed later in the paper.
 \stopsubsection
 \startsubsection[title={State identifiers, access sequences and sets of words}]
 \define[1]\Fam{\mathcal{#1}}
 In order to define a test suite modularly, we introduce notation for combining sets of words.
 We require all sets to be \emph{prefix-closed}, this is very convenient in later proofs.
 For sets of words $X$ and $Y$, we define:
 \startitemize
 \item their concatenation $X \cdot Y = \{ xy \mid x \in X, y \in Y \}$,
 \item iterated concatenation $X^0 = \{ \epsilon \}$ and $X^{n+1} = X \cdot X^{n}$,
 \item bounded concatenation $X^{\leq n} = \bigcup_{i \leq n} X^i$, and
 \item prefix closure $Pref(X) = \{ y \mid y \text{ is a prefix of } x, x \in X \}$.
 \stopitemize
 For a set $S$, a family of sets $\Fam{X}$ is a set of sets indexed by $S$:
 $\Fam{X} = \{ X_s \}_{s \in S}$.
 We will use families to define words relevant for a state $s$.
 We define:
 \startitemize
 \item flattening: $\bigcup \Fam{X} = \{ x \mid x \in X_s, s \in S \}$,
 \item union: $\Fam{X} \cup \Fam{Y}$ is defined point-wise:
 $(\Fam{X} \cup \Fam{Y})_s = X_s \cup Y_s$, and
 \item $\Fam{X}$-equivalence:
 $x \sim_\Fam{X} y$ if $\lambda(x,w) = \lambda(y,w)$ for all $w \in X_x \cap X_y$.
 \todo{Hangt af van $M$, maar is nog niet gefixed}
 \stopitemize
 \startdefinition
 An \emph{access sequence for state $s$} is a word $w$ such that $\delta(q_0, w) = s$.
 A set $P$ consisting of an access sequence for each state is called a \emph{state cover}.
 If $P$ is a state cover, $P \cdot I$ is called a \emph{transition cover}.
 \stopdefinition
 \startdefinition
 A family of sets $\Fam{X}$ is a \emph{separating family} (or set of \emph{harmonized state identifiers} \cite{YP90}) if $x \sim_\Fam{X} y$ implies $x \sim y$.
 \stopdefinition
 In other words, a family of words is a separating family if for each pair of inequivalent states it contains a word separating those states.
 Following the definitions in \cite{LeeYannakakis}, a separating family where each set is a singleton is an \emph{adaptive distinguishing sequence} (ads).
 An ads is of special interest since they can identify a state using a single word.
 Separating families always exist for Mealy machines, but an ads might not exist.
 Given a specification $M$ (with states $S$) we define two operations:
 \startitemize
 \item concatenation:
 $X \odot \Fam{Y} = \{ xy \mid x \in X, y \in Y_{\delta(s_0, x)} \}$ and
 \item refinement: $\Fam{X} ; \Fam{Y}$ defined by
 $(\Fam{X} ; \Fam{Y})_s = X_s \cup \bigcup_{y \sim_\Fam{X} x} D(\Fam{Y}, x, y)$, where
 $D(\Fam{Y}, x, y) = Pref\{w \mid \lambda(x,w) \neq \lambda(y,w), w \in Y_x \cap Y_y\}$.
 \stopitemize
 Note that these operations depend on the specification $M$.
 We omit $M$ from the notation as the specification is always clear from the context.
 We will use the following facts in defining our new test method.
 \startlemma[reference=lemma:refinement-props]
 For all families $\Fam{X}$ and $\Fam{Y}$:
 \startitemize
 \item $\Fam{X} ; \Fam{X} = \Fam{X}$,
 \item $\Fam{X} ; \Fam{Y} = \Fam{X}$, whenever $\Fam{X}$ is a separating family, and
 \item $\Fam{X} ; \Fam{Y}$ is a separating family whenever $\Fam{Y}$ is a separating family.
 \stopitemize
 \stoplemma
 \stopsubsection
 \stopsection
 \startsection[title={Hybrid adaptive distinguishing sequences}]
 In this section we describe a new test generation method for Mealy machines.
 Its completeness will be proven in a later section, together with completeness for related, known, methods.
 From a high level perspective, the method uses the algorithm by Lee and Yannakakis \cite{LeeYannakakis} to obtain an ads.
 If no ads exists, their algorithm still provides some sequences which separates some inequivalent states.
 Our extension is to refine the set of sequences by using pairwise separating sequences.
 The reason we do this is the fact that the ADS-method generally constructs small test suites \cite{Dorofeeva}.
 The test suites are small since an ads can identify a state with a single word, instead of a set of words which is generally needed.
 Even if the ads does not exist, using the partial result of Lee and Yannakakis' algorithm can reduce the size of test suites.
 We will describe how to obtain a separating families and adaptive distinguishing sequences for Mealy machines.
 For the former, one typically uses Moore's or Hopcroft's minimization algorithm \cite{Smetsers}.
 For the latter, an efficient algorithm is described in \cite{LeeYannakakis}.
 Both algorithms use a splitting tree as data structure.
 \startdefinition[reference=splitting-tree]
 A \defn{splitting tree (for $M$)} is a rooted tree where each node $u$ has
 \startitemize
 	\item a set of states $l(u) \subseteq M$, and
 	\item if $u$ is not a leaf, a sequence $\sigma(u) \in I^{\ast}$.
 \stopitemize
 We require that if a node $u$ has children $C(u)$ then
 \startitemize
 	\item the sets of states of the children of $u$ partition $l(u)$, i.e., the set
 	$P(u) = \{l(v) \,|\, v \in C(u)\}$ is a partition of $l(u)$, and
 	\item the sequence $\sigma(u)$ witnesses the partition $P(u)$, meaning that
 	$\lambda(s, \sigma(u)) = \lambda(t, \sigma(u))$ iff $p = q$ for all $s \in p, t \in q$ for all $p, q \in P(u)$.
 \stopitemize
 A splitting tree is called \defn{complete} if all inequivalent states belong to	different leaves.
 \stopdefinition
 The splitting tree records the execution of a partition refinement algorithm (such as Moore's or Hopcroft's algorithm).
 Each non-leaf node encode a \defn{split} together with a witness.
 Since a complete splitting tree contains witnesses for all inequivalences, we can extract a separating family from it.
 Briefly, for each state we define the set of words as follows: locate the leaf containing the state and collect all the sequences you read when traversing to the root of the tree.
 We refer to \cite{Smetsers} for more details on these algorithms.
 For adaptive distinguishing sequences an additional requirement is put on the splitting tree:
 for each non-leaf node $u$, the sequence $\sigma(u)$ defines an injective map $x \mapsto (\delta(x, \sigma(u)), \lambda(x, \sigma(u)))$ on the set $l(u)$.
 In \cite{LeeYannakakis} such splits are called \defn{valid}.
 \in{Figure}[fig:example-splitting-tree] shows both a valid and invalid split.
 Validity precisely ensures that after performing a split, the states are still distinguishable.
 \startplacefigure[title={A complete splitting tree with shortest witnesses for the specification of \in{Figure}[fig:running-example].	Only the splits $a$ and $aa$ are valid.},
 reference=fig:example-splitting-tree]
 \hbox{
 \starttikzpicture[node distance=1.4cm]
 \node (0) [text width=2cm, align=center, ]                 {$s_0, s_1, s_2, s_3, s_4$ $a$};
 \node (1) [text width=1cm, align=center, below left of=0 ] {$s_1$};
 \node (2) [text width=2cm, align=center, below right of=0] {$s_0, s_2, s_3, s_4$ $b$};
 \node (3) [text width=1cm, align=center, below left of=2 ] {$s_0, s_2, s_3$ $c$};
 \node (4) [text width=1cm, align=center, below right of=2] {$s_4$};
 \node (5) [text width=1cm, align=center, below left of=3 ] {$s_0, s_2$ $aa$};
 \node (6) [text width=1cm, align=center, below right of=3] {$s_3$};
 \node (7) [text width=1cm, align=center, below left of=5 ] {$s_0$};
 \node (8) [text width=1cm, align=center, below right of=5] {$s_2$};
 \path[->]
 (0) edge (1)
 (0) edge (2)
 (2) edge (3)
 (2) edge (4)
 (3) edge (5)
 (3) edge (6)
 (5) edge (7)
 (5) edge (8);
 \stoptikzpicture
 }
 \stopplacefigure
 \startfact
 A complete splitting tree with valid splits exists if and only if there exists an adaptive distinguishing sequence \cite{LeeYannakakis}.
 \stopfact
 Our method uses the exact same algorithm as the one by Lee and Yannakakis.
 However, we also apply it in the case when the splitting tree with valid splits is not complete (and hence no adaptive distinguishing sequence exists).
 Their algorithm still produces a family of sets, but is not necessarily a separating family.
 In order to recover separability, we refine that family of sets.
 Let $\Fam{Z'}$ be the result of Lee and Yannakakis' algorithm (to distinguish from their notation, we add a prime) and let $\Fam{H}$ be a separating family extracted from an ordinary splitting tree.
 The hybrid ADS family is defined as $\Fam{Z'} ; \Fam{H}$, and can be computed as sketched in \in{Algorithm}[alg:hybrid].
 By \in{Lemma}[lemma:refinement-props] we note the following: in the best case this family is an adaptive distinguishing sequence; in the worst case it is equal to $\Fam{H}$; and in general it is a combination of the two families.
 For this reason we call the method a hybrid method.
 \startplacealgorithm[title={Obtaining the hybrid separating family $\Fam{Z'} ; \Fam{H}$},reference=alg:hybrid]
 \startalgorithmic
 	\REQUIRE{A Mealy machine $M$}
 	\ENSURE{A separating family $Z'$}
 	\STATE{$T_1 \leftarrow$ splitting tree for Moore's minimization algorithm}
 	\STATE{$\Fam{H} \leftarrow$ separating family extracted from $T_1$}
 	\STATE{$T_2 \leftarrow$ splitting tree with valid splits (see \cite{LeeYannakakis})}
 	\STATE{$\Fam{Z'} \leftarrow$ (incomplete) family as constructed from $T_2$}
 	\FORALL{inequivalent states $s, t$ in the same leaf of $T_2$}{
 		\STATE{$Z'_s \leftarrow Z_s \cup H_s$}
 		\STATE{$Z'_t \leftarrow Z_t \cup H_t$}
 	}
 	\ENDFOR
 	\RETURN{$Z'$}
 \stopalgorithmic
 \stopplacealgorithm
 With the hybrid family we can define the test suite as follows.
 In words, the suite consists of tests of the form $p w s$, where $p$ brings the SUT to a certain state, $w$ is to detect $k$ extra states and $s$ is to identify the state.
 Its $m$-completeness is proven in Section~\ref{sec:completeness}.
 \startdefinition
 Let $P$ be a state cover,
 $\Fam{Z'}$ be a family of sets constructed with the Lee and Yannakakis algorithm and
 let $\Fam{H}$ be a separating family.
 The \defn{hybrid ADS} test suite is
 \startformula
 T = P \cdot I^{\leq k+1} \odot (\Fam{Z'} ; \Fam{H}).
 \stopformula
 \stopdefinition
 \startsubsection[title={Example}]
 \todo{Beter opschrijven, en uiteindelijke test suite geven.
 	En vergelijken met andere methode? Misschien later pas.}
 In the figure we see the (unique) result of Lee and Yannakakis' algorithm.
 We note that the states $s_2, s_3, s_4$ are not split, so we need to refine the family for those states.
 \todo{Hier was ik gebleven}
 \stopsubsection
 \stopsection
 \stopchapter
 \stopcomponent
 \begin{figure}
 \centering
 \begin{subfigure}{0.45\textwidth}
 	\centering
 	\begin{tikzpicture}[node distance=1.4cm]
 	\node (0) [text width=2cm, align=center, ]                 {$s_0, s_1, s_2, s_3, s_4$\\$a$};
 	\node (1) [text width=1cm, align=center, below left of=0 ] {$s_1$};
 	\node (2) [text width=2cm, align=center, below right of=0] {$s_0, s_2, s_3, s_4$\\$aa$};
 	\node (3) [text width=1cm, align=center, below left of=2 ] {$s_0$};
 	\node (4) [text width=1cm, align=center, below right of=2] {$s_2, s_3, s_4$};
 	\path[->]
 	(0) edge (1)
 	(0) edge (2)
 	(2) edge (3)
 	(2) edge (4);
 	\end{tikzpicture}
 	\caption{Largest splitting tree with only valid splits for Figure~\ref{fig:running-example}.}
 	\label{fig:example-splitting-tree-valid}
 \end{subfigure}~
 \begin{subfigure}{0.45\textwidth}
 	\centering
 	\begin{tikzpicture}[node distance=1.4cm]
 	\node (0) [text width=2cm, align=center, ]                 {\tiny $s_0, s_1, s_2, s_3, s_4$\\$s_0, s_1, s_2, s_3, s_4$\\$a$};
 	\node (1) [text width=1cm, align=center, below left of=0 ] {\tiny $s_1$\\$s_2$};
 	\node (2) [text width=2cm, align=center, below right of=0] {\tiny $s_0, s_2, s_3, s_4$\\$s_1, s_2, s_4, s_3$\\$a$};
 	\node (3) [text width=1cm, align=center, below left of=2 ] {\tiny $s_0$\\$s_2$};
 	\node (4) [text width=1cm, align=center, below right of=2] {\tiny $s_2, s_3, s_4$\\$s_2, s_3, s_4$};
 	\path[->]
 	(0) edge (1)
 	(0) edge (2)
 	(2) edge (3)
 	(2) edge (4);
 	\end{tikzpicture}
 	\caption{The adaptive distinguishing tree in notation of \cite{LeeYannakakis}.}
 	\label{fig:example-splitting-tree-ads}
 \end{subfigure}
 \end{figure}
 From the splitting tree in Figure~\ref{fig:example-splitting-tree}, we obtain the following separating family $\Fam{H}$.
 From the figure above we obtain the family $\Fam{Z}$.
 These families and the refinement $\Fam{Z'};\Fam{H}$ are given below:
 \begin{center}
 \begin{tabular}{l @{\hspace{1cm}} l @{\hspace{1cm}} l}
 	$H_0 = \{aa,b,c\}$ & $Z'_0 = \{aa\}$ & $(Z';H)_0 = \{aa\}$ \\
 	$H_1 = \{a\}$      & $Z'_1 = \{a\}$  & $(Z';H)_1 = \{a\}$  \\
 	$H_2 = \{aa,b,c\}$ & $Z'_2 = \{aa\}$ & $(Z';H)_2 = \{aa,b,c\}$ \\
 	$H_3 = \{a,b,c\}$  & $Z'_3 = \{aa\}$ & $(Z';H)_3 = \{a,b,c\}$ \\
 	$H_4 = \{a,b\}$    & $Z'_4 = \{aa\}$ & $(Z';H)_4 = \{aa,b\}$
 \end{tabular}
 \end{center}
 \subsection{When $k$ is not known}
 In many of the applications described in Section~\ref{sec:applications} no bound on the number of states of the SUT was known.
 In such cases it is possible to randomly select test cases from an infinite test suite.
 Unfortunately, we lose the theoretical guarantees of completeness with random generation.
 Still, for the applications in Section~\ref{sec:applications} it has worked well in finding flaws.
 We can randomly test cases as follows.
 In the above definition for the hybrid ADS test suite we replace $I^{\leq k}$ by $I^{\ast}$ to obtain an infinite test suite.
 Then we sample tests as follows:
 1) sample an element $p$ from $P$ uniformly,
 2) sample a word $w$ from $I^{\ast}$ with a geometric distribution then
 3) sample uniformly from $(\Fam{Z'} ; \Fam{H})_s$ for the state $s = \delta(s_0, pw)$.
 \section{Related test methods}
 \label{sec:methods}
 In this section, we briefly review the classical conformance testing methods:
 the W, Wp, UIO, UIOv, HSI, ADS methods.
 Our method described is very similar to some of these methods, so we will
 relate them by describing them uniformly.
 There are many more test generation methods. Literature shows, however, that not
 all of them are complete. For example, the methods in \cite{Bernhard} are
 falsified by \cite{Petrenko} and the UIO-method \cite{Sabnani} is shown to be
 incomplete in \cite{Vuong}. For that reason, completeness of the correct methods
 is shown (again) in the next section.
 We fix a state cover $P$ throughout this section and take the transition cover $Q = P \cdot I$.
 \subsection{W-method \cite{Chow,Vasilevskii}}
 Possibly one of the earliest $m$-complete test methods.
 \begin{mdefinition}\label{w-method}
 	A set of words $W$ is a \Def{characterization set} if for each pair of inequivalent states $s$ and $t$ there exists a word $w \in W$ with $\lambda(s,w) \neq \lambda(t,w)$.
 	\newline
 	Let $W$ be a characterization set, the \Def{W test suite} is
 	defined as $(P \cup Q) \cdot I^{\leq k} \cdot W$.
 \end{mdefinition}
 If we have a separating family $\Fam{W}$, we can obtain a characterization
 set by flattening: take $W = \bigcup \Fam{W}$.
 \subsection{The Wp-method \cite{Fujiwara}}\label{sec:wp}
 The W-method was refined by Fujiwara to use smaller sets when identifying states.
 In order to do that he defined state-local sets of words.
 \begin{mdefinition}
 \label{state-identifier}
 \label{wp-method}
 A \Def{state identifier for $s$} is a set $W_s$ such that for every inequivalent
 state $t$ there is a $w \in W_s$ such that $\lambda(s, w) \neq \lambda(t, w)$.
 \newline
 Let $\Fam{W}$ be a family of state identifiers, the \Def{Wp test suite} is
 defined as $P \cdot I^{\leq k} \cdot \bigcup \Fam{W} \,\cup\, Q \cdot I^{\leq k}
 \odot \Fam{W}$.
 \end{mdefinition}
 Note that $\bigcup \Fam{W}$ is a characterization set as defined for the W-method.
 It is needed for completeness to test states with the whole set $\bigcup \Fam{W}$.
 Once states are tested as such, we can use the smaller sets $W_s$ for testing transitions.
 \subsection{The HSI-method \cite{Luo,YP90}}\label{sec:hsi}
 The Wp-method in turn was refined by Yevtushenko and Petrenko.
 They make use of so called \emph{harmonized} state identifiers (which are called
 separating families in \cite{LeeYannakakis} and in present paper).
 By having this global property of the family, less tests need to
 be executing when testing a state.
 \begin{mdefinition}\label{hsi-method}
 Let $\Fam{H}$ be a separating family, the \Def{HSI test suite} is defined as
 $(P \cup Q) \cdot I^{\leq k} \odot \Fam{H}$.
 \end{mdefinition}
 Our newly described test method is an instance of the HSI-method.
 However, in \cite{Luo,YP90} they describe the HSI-method together with a
 specific way of generating the separating families. Namely, the set obtained
 by a splitting tree with shortest witnesses. In present paper that
 is generalized, allowing for our extension to be an instance.
 \subsection{The ADS-method \cite{LeeYannakakis}}\label{sec:ads}
 As discussed before, when a Mealy machine admits a adaptive distinguishing sequence,
 only one test has to be performed for identifying a state.
 This is exploited in the ADS-method.
 \begin{mdefinition}\label{ds-method}
 Let $\Fam{Z}$ be a separating family where every set is a singleton, the
 \Def{ADS test suite} is defined as $(P \cup Q) \cdot I^{\leq k} \odot \Fam{Z}$.
 \end{mdefinition}
 \subsection{The UIOv-method \cite{Vuong}}\label{sec:uiov}
 Some Mealy machines which do not admit an adaptive distinguishing sequence,
 may still admit state identifiers which are singletons.
 These are exactly UIO sequences and gives rise to the UIOv-method.
 In a way this is a generalization of the ADS-method, since the requirement
 that state identifiers are harmonized is dropped.
 \begin{mdefinition}\label{uio}\label{uiov-method}
 	Given $s \in M$, we say that a word $w \in I^{\ast}$ is an \Def{UIO sequence for
 		$s$} if for all inequivalent $t \in M$ we have $\lambda(s, w) \neq \lambda(t,
 	w)$.
 	\newline
 	Let $\Fam{U} = \{ \text{a single UIO for } s \}_{s \in S}$ be a family of UIO
 	sequences, the \Def{UIOv test suite} is defined as $P \cdot I^{\leq k} \cdot
 	\bigcup \Fam{U} \,\cup\, Q \cdot I^{\leq k} \odot \Fam{U}$.
 \end{mdefinition}
 \subsection{Overview}
 \label{sec:overview}
 We give an overview of the aforementioned test methods.
 We classify them in two directions,
 1) whether they use harmonized state identifiers or not and
 2) whether it used singleton state identifiers or not.
 \begin{theorem}\label{thm:completeness}
  The following test suites are all $n+k$-complete:
 \begin{center}
 \begin{tabular}{l|p{4.5cm}|p{4.5cm}|}
 % empty cell
  & Arbitrary
  & Harmonized
  \\ \hline
 Many / pairwise
  & Wp  \newline $ P \cdot I^{\leq k} \cdot \bigcup \Fam{W} \,\cup\, Q \cdot I^{\leq k} \odot \Fam{W} $
  & HSI \newline $ (P \cup Q) \cdot I^{\leq k} \odot \Fam{H} $
  \\ \hline
 Hybrid
  & 
  & Hybrid ADS \newline
    $ (P \cup Q) \cdot I^{\leq k} \odot (\Fam{Z'} ; \Fam{H}) $
  \\ \hline
 Single / global
  & UIOv \newline $ P \cdot I^{\leq k} \cdot \bigcup \Fam{U} \,\cup\, Q \cdot I^{\leq k} \odot \Fam{U} $
  & ADS  \newline $ (P \cup Q) \cdot I^{\leq k} \odot \Fam{Z} $
  \\ \hline
 \end{tabular}
 \end{center}
 \end{theorem}
 \todo{Iets zeggen over de hybrid UIO method}
 \todo{Geef groottes van suites voor het voorbeeld. Merk op dat ADS and UIOv niet van toepassing zijn (state 3 heeft geen UIO)}
 The incomplete \Def{UIO test suite} is defined as $(P \cup Q) \cdot
 I^{\leq k} \odot \Fam{U}$, incompleteness is shown in \cite{Vuong}.
 It should be noted that the ADS-method is a specific instance of the HSI-method
 and similarly the UIOv-method is an instance of the Wp-method. What is
 generally meant by the Wp-method and HSI-method is the above formula together
 with a particular way to obtain the (harmonised) state identifiers.
 We are often interested in the size of the test suite. In the worst case, all
 methods generate a test suite with a size in $\bigO(pn^3)$ and this bound is
 tight \cite{Vasilevskii}. Nevertheless we expect intuitively that the right column
 performs better, as we are using a more structured set (given a separating
 family for the HSI-method, we can always forget about the common prefixes and
 apply the Wp-method, which will never be smaller if constructed in this way).
 Also we expect the bottom row to perform better as there is a single test for
 each state. Small experimental results confirm this intuition
 \cite{Dorofeeva}.
 \section{Proof of completeness}
 \label{sec:completeness}
 We fix a specification $M$ which has a minimal representative with $n$ states and an implementation $M'$ with at
 most $n+k$ states. We assume that all states are reachable from the initial
 state in both machines (i.e. both are \Def{connected}). We define the following
 notation:
 \begin{mdefinition}
 	Let $W \subseteq I^{\ast}$ be a set of words. Two states $x, y$ (of possibly
 	different machines) are $W$-equivalent, written $x \sim_W y$, if $\lambda(x,
 	w) = \lambda(y, w)$ for all $w \in W$.
 \end{mdefinition}
 We note that for a fixed set $W$ the relation $\sim_W$ is a equivalence relation
 and that $W \subseteq V$ gives $x \sim_V y \Rightarrow x \sim_W y$. The next lemma
 gives sufficient conditions for a test suite of the given anatomy to be
 complete. We then prove that these conditions hold for the test suites in this
 paper.
 \begin{lemma}
 	Let $\Fam{W}$ and $\Fam{W'}$ be two families of words and $P$ a state cover
 	for $M$. Let $T = P \cdot I^{\leq k} \odot \Fam{W} \,\cup\, P \cdot I^{\leq
 		k+1} \odot \Fam{W'}$ be a test suite.
 	If
 	\vspace{-3mm}
 	\begin{itemize}
 		\item[1] for all $x, y \in M:$ $x \sim_{W_x \cap W_y} y$ implies $x \sim y$,
 		\item[2] for all $x, y \in M$ and $z \in M'$: $x \sim_{W_x} z$ and $z
 		\sim_{W'_y} y$ implies $x \sim y$ and
 		\item[3] the machines $M$ and $M'$ agree on $T$,
 	\end{itemize}\vspace{-3mm}
 	then $M$ and $M'$ are equivalent.
 \end{lemma}
 \begin{proof}
 	First, we prove that $P \cdot I^{\leq k}$ reaches all states in $M'$.
 	For	$p, q \in P$ and $x = \delta(s_0, p), y = \delta(s_0, q)$ such that $x
 	\not\sim_{W_x \cap W_y} y$, we also have $\delta'(m'_0, p) \not\sim_{W_x
 		\cap W_y} \delta'(m'_0, q)$ in the implementation $M'$.
 	By (1) this means that there
 	are at least $n$ different behaviours in $M'$, hence at least $n$ states.
 	Now $n$ states are reached by the previous argument (using the set $P$).
 	By assumption $M'$ has at most $k$ extra states and so we can reach all those extra states by using $I^{\leq k}$ after $P$.
 	Second, we show that the reached states are bisimilar.
 	Define the relation $R = \{(\delta(q_0, p), \delta'(q_0', p)) \mid p \in P \cdot I^{\leq k}\}$.
 	Note that for each $(s, i) \in R$ we have $s \sim_{W_s} i$.
 	For each state $i \in M'$ there is a state $s \in M$ such that $(s, i) \in R$, since we reach all states in both machines by $P \cdot I^{\leq k}$.
 	We will prove that this relation is in fact a bisimulation up-to equivalence.
 	For output, we note that $(s, i) \in R$ implies $\lambda(s, a) = \lambda'(i, a)$ for all $a$,
 	since the machines agree on $P \cdot I^{\leq k+1}$.
 	For the successors, let $(s, i) \in R$ and $a \in I$ and consider the successors $s_2 = \delta(s, a)$ and $i_2 = \delta'(i, a)$.	
 	We know that there is some
 	$t \in M$ with $(t, i_2) \in R$. We also know that we
 	tested $i_2$ with the set $W'_t$. So we have:
 	$$ s_2 \sim_{W'_{s_2}} i_2 \sim_{W_t} t. $$ By the
 	second assumption, we conclude that $s_2 \sim t$.
 	So $s_2 \sim t$ and $(t, i) \in R$, which means that $R$ is a bisimulation up-to equivalence.
 	Using the theory of up-to techniques \cite{RotUpTo} we know that there exists a bisimulation relation containing $R$.
 	In particular $q_0$ and $q'_0$ are bisimilar.
 	And so the machines $M$ and $M'$ are equivalent.
 	\qed
 \end{proof}
 Before we show that the conditions hold for the test methods described in this paper, we reflect on the above proof first.
 This proof is very similar to the completeness proof in \cite{Chow}.
 (In fact, it is also similar to Lemma 4 in \cite{Angluin} which proves termination in the L* learning algorithm. This correspondence was noted in \cite{BergCorrespondence}.)
 In the first part we argue that all states are visited by using some sort of counting and reachability argument.
 Then in the second part we show the actual equivalence.
 To the best of the authors knowledge, this is first $m$-completeness proof which explicitly uses the concept of a bisimulation.
 Using a bisimulation allows us to slightly generalize and use bisimulation up-to equivalence, dropping the the often-assumed requirement that $M$ is minimal.
 \begin{lemma}
 	Let $\Fam{W'}$ be a family of state identifiers for $M$. Define the family
 	$\Fam{W}$ by $W_s = \bigcup \Fam{W'}$. Then the conditions (1) and (2) in the
 	previous lemma are satisfied.
 \end{lemma}
 \begin{proof}
 	The first condition we note that $W_x \cap W_y = W_x = W_y$, and so $x
 	\sim_{W_x \cap W_y} y$ implies $x \sim_{W_x} y$, now by definition of state
 	identifier we get $x \sim y$.
 	For the second condition, let $x \sim_{\bigcup \Fam{W'}} z \sim_{W'_y} y$.
 	Then we note that $W'_y \subseteq \bigcup{W'}$ and so we get $x \sim_{W'_y} z
 	\sim_{W'_y} y$. By transitivity we get $x \sim_{W'_y} y$ and so by definition
 	of state identifier we get $x \sim y$.
 	\qed
 \end{proof}
 \begin{corollary}
 	The W, Wp, UIOv and hybrid UIOv test suites are $n+k$-complete.
 \end{corollary}
 \begin{lemma}
 	Let $\Fam{H}$ be a separating family and take $\Fam{W} = \Fam{W'} = \Fam{H}$.
 	Then the conditions (1) and (2) in the previous lemma are satisfied.
 \end{lemma}
 \begin{proof}
 	Let $x \sim_{H_x \cap H_y} y$, then by definition of separating family $x \sim
 	y$.
 	For the second condition, let $x \sim_{H_x} z \sim_{H_y} y$. Then we get $x
 	\sim_{H_x \cap H_y} z \sim_{H_x \cap H_y} y$ and so by transitivity $x
 	\sim_{H_x \cap H_y} y$, hence again $x \sim y$.
 	\qed
 \end{proof}
 \begin{corollary}
 	The HSI, ADS and hybrid ADS test suites are $n+k$-complete.
 \end{corollary}
 \section{Related Work}
 \todo{Opnieuw lezen, want verouderd}
 Comparison of test methods already appeared in the recent papers
 \cite{Dorofeeva} and \cite{Endo}. Their work is mainly evaluated on randomly
 generated Mealy machines. We continue their work by evaluating on many
 specifications from industry. In addition, we evaluate in the context of
 learning, which has been lacking so far.
 Many of the methods describe here are benchmarked on small Mealy machines in \cite{Dorofeeva} and \cite{Endo}.
 Additionally, they included the P, H an SPY methods.
 Unfortunately, the P and H methods do not fit naturally in the overview presented in Section~\ref{sec:overview}.
 The P method is not able to test for extra states, making it less usable.
 And the H method 
 The SPY method builds upon the HSI-method and changes the
 prefixes in order to minimize the size of a test suite, exploiting overlap in test sequences.
 We believe that this technique is independent of the HSI-method and can in fact be applied to all methods presented in this paper.
 As such, the SPY method should be considered as an optimization technique, orthogonal to the work in this paper.
 More recently, a novel test method was devised which uses incomplete distinguishing sequences \cite{Hierons}.
 They use sequences which can be considered to be adaptive distinguishing sequences on a subset of the state space.
 With several of those one can cover the whole state space, obtaining a $m$-complete test suite.
 This is a bit dual to our approach, as our "incomplete" adaptive distinguishing sequences define a course partition of the state space.
 Our method becomes complete by refining the tests with pairwise separating sequences.
 Some work is put into minimizing the adaptive distinguishing sequences themselves.
 In \cite{TY14} the authors describe greedy algorithms which construct small adaptive distinguishing sequences.
 \todo{We expect that similar
 	heuristics also exist for the other test methods and that it will improve the
 	performance.}
 However, they show that finding the minimal adaptive distinguishing sequence is 
 NP-complete, even approximation is NP-complete.
 We would like to incorporate their greedy algorithms in our implementation.
 \section{Applications}
 \label{sec:applications}
 The presented test generation methods is implemented and used in a couple of applications.
 The implementation can be found on \url{https://gitlab.science.ru.nl/moerman/Yannakakis}.
 This implementations has been used in several model learning applications: learning embedded controller software \cite{Smeenk}, learning the TCP protocol \cite{FiteruauTCP} and learning the MQTT protocol \cite{TapplerMQTT}.
 \todo{Enkele resultaten bespreken, test-suite-groottes vergelijken}
 \todo{Future work? Meer benchmarks? Andere automaat-modellen?}
 \bibliographystyle{splncs03}
 \bibliography{references}
 % \subsection{Known algorithms}
 % We expose the well-known classical algorithms to find state identifiers here.
 % \begin{algorithm}[h]
 %     \DontPrintSemicolon
 %     \KwIn{A Mealy machine $M$}
 %     \KwResult{A complete splitting tree $T$}
 %     initialize with single node\;
 %     \While{true}{
 %       \ForEach{symbol $a \in I$ and leaf $u \in T$}{
 %         \If{if $\lambda(l(u), a)$ hits two or more outputs}{
 %           split $u$ accordingly\;
 %           store $a$ as a witness in $\sigma(u)$\;
 %         } \ElseIf{$\delta(l(u), a)$ lands in labels of different leaves}{
 %           $v \leftarrow lca(\delta(l(u), a))$\;
 %           split $u$ similar to $v$\;
 %           store $a \sigma(v)$ as witness in $\sigma(u)$\;
 %         }
 %       }
 %       \If{no split has been made}{
 %         \Return{constructed tree}
 %       }
 %     }
 %     \caption{Moore's minimization algorithm adopted to record witnesses. When
 %     implemented efficiently it runs in $\bigO(pn^2)$. A more involved algorithm
 %     based on ideas of Hopcroft exists and runs in $\bigO(pn \log n)$.}
 %     \label{moore}
 % \end{algorithm}
 % \begin{algorithm}[h]
 %     \DontPrintSemicolon
 %     \KwIn{A Mealy machine $M$}
 %     \KwResult{A (in)complete valid splitting tree $T$}
 %     initialize $T$ with single node\;
 %     \While{true}{
 %       \ForEach{symbol $a \in I$ and leaf $u \in T$}{
 %         \If{if $\lambda(l(u), a)$ hits two or more outputs and $\delta \times \lambda(-, a)$ is injective on $l(u)$}{
 %           split $u$ accordingly\;
 %           store $a$ as a witness in $\sigma(u)$\;
 %         } \ElseIf{$\delta(l(u), a)$ lands in labels of different leaves and $\delta \times \lambda(-, a)$ is injective on $l(u)$}{
 %           $v \leftarrow lca(\delta(l(u), a))$\;
 %           split $u$ similar to $v$\;
 %           store $a \sigma(v)$ as witness in $\sigma(u)$\;
 %         }
 %       }
 %       \If{no split has been made}{
 %         \Return{constructed tree}
 %       }
 %     }
 %     \caption{(Simplified) algorithm by Lee and Yannakakis. Similarly to
 %     algorithm~\ref{moore} this creates a splitting tree, but additionally only
 %     allows \emph{valid} splits.}
 %     \label{lee-yannakakis}
 % \end{algorithm}
 \end{document}
--- a/environment.tex
+++ b/environment.tex
@ -0,0 +1,46 @@
 \startenvironment environment
 % Bug in context? Zonder de ../ kunnen componenten de files niet vinden
 \usepath[{environment,../environment}]
 \environment font
 \environment pdf
 \environment paper
 \mainlanguage[en]
 \setuppagenumbering[alternative=doublesided]
 \defineenumeration[definition][text=Definition]
 \defineenumeration[example][text=Example]
 \defineenumeration[lemma][text=Lemma]
 \defineenumeration[fact][text=Fact?] % niet nodig?
 \setupenumeration[definition,example,lemma,fact][alternative=serried,width=fit,right=.]
 \definefloat[algorithm][algorithms][figure]
 \setuplabeltext[en][algorithm=Algorithm ]
 \setupcombinedlist[content][list={chapter,section}]
 % The black dot is missing from Euler?
 \setupitemize[2,joinedup,nowhite,after]
 % Standaard \colon had niet veel ruimte
 \define\colon{{\,{:}\,\,}}
 \define[1]\defn{\emph{#1}}
 \define[1]\todo{\color[red]{\ss \bf #1}}
 \usemodule[tikz]
 \usetikzlibrary[automata, arrows, positioning]
 \usemodule[algorithmic]
 % Debugging
 %\enabletrackers[references.references, visualizers.justification]
 %\showmakeup
 %\showlayout
 %\showboxes
 %\showframe
 \stopenvironment
--- a/environment/font.tex
+++ b/environment/font.tex
@ -0,0 +1,26 @@
 \startenvironment font
 % Punctuatie mag van de regel afvallen.
 % Old-style getallen
 \definefontfeature[default][default][expansion=quality,protrusion=quality,onum=yes]
 \setupalign[hz,hanging]
 % Standaard gebruiken we Pagella (een Open Source versie van Palatino)
 \definefontfamily[mainfamily] [rm] [TeX Gyre Pagella]
 % Voor sans-serif en teletype gebruiken we de Latin Modern fonts
 % Het teletype font is iets groter gemaakt
 \definefontfamily[mainfamily] [ss] [Latin Modern Sans]
 \definefontfamily[mainfamily] [tt] [Latin Modern Mono][rscale=1.05]
 % (Neo) Euler heeft geen mathbb symbolen, dus die halen we uit Pagella
 \definefallbackfamily[mainfamily] [mm] [TeX Gyre Pagella Math][range=uppercasedoublestruck]
 % Eventueel kunnen we mathcal vervangen door een ander font
 %\definefallbackfamily[myfamily] [mm] [TeX Gyre Pagella Math][range=uppercasescript,rscale=1.05]
 % (Neo) Euler heeft niet alle symbolen, maar Pagella wel
 \definefallbackfamily[mainfamily] [mm] [TeX Gyre Pagella Math][range=0x0266D-0x0266F,rscale=0.95] % sharp and flat
 % Neo Euler voor wiskunde. Merk op dat dit niet cursief is!
 \definefontfamily[mainfamily] [mm] [Neo Euler]
 % Gebruik het font
 \setupbodyfont[mainfamily,10pt]
 \stopenvironment
--- a/environment/paper.tex
+++ b/environment/paper.tex
@ -0,0 +1,13 @@
 \startenvironment paper
 % Proefschrift formaat is 17x24cm
 % We doen aan beide zijden 5mm afloop, zodat we plaatjes vol kunnen printen
 \definepapersize[proefschrift+rand][width=180mm, height=250mm]
 \definepapersize[proefschrift][width=170mm, height=240mm]
 \setuppapersize[proefschrift][proefschrift+rand]
 \setuplayout[location={middle,middle}]
 % Snijranden
 \setuplayout[marking=on]
 \stopenvironment
--- a/environment/pdf.tex
+++ b/environment/pdf.tex
@ -0,0 +1,13 @@
 \startenvironment pdf
 % Synctex lijkt niet goed te werken voor TeXstudio.
 % Uitzetten voor uiteindelijke versie!
 %\setupsynctex[state=start]
 % Klikbare referenties
 \setupinteraction[state=start,focus=standard,contrastcolor=darkgreen]
 % Bookmarks in de pdf
 \placebookmarks[chapter,section]
 \stopenvironment
--- a/thesis.tex
+++ b/thesis.tex
@ -0,0 +1,4 @@
 \environment environment
 \startproject thesis
 \product main
 \stopproject