paradyx/ma-tex - Change NEWG2Q2EYR3BOPWVN44EXIPVH2GH23BYFHJXUSNYWWT7WWDEAIKQC

WIP

Created by paradyx on July 10, 2023

NEWG2Q2EYR3BOPWVN44EXIPVH2GH23BYFHJXUSNYWWT7WWDEAIKQC

Dependencies

In channels

main

Change contents

Replacement in literature.bib at line 213 [4.18034]
∅:D[3.528] → [4.28439:28495]
B:BD[4.28439] → [4.28439:28495]
```
  author  = {Fabian Meyer, Marcel Hark, Jürgen Giesl},
```
[3.528]
[4.28495]
```
  author  = {Fabian Meyer and Marcel Hark and Jürgen Giesl},
```
Replacement in literature.bib at line 716 [4.18034]
B:BD[5.32] → [5.32:68]
```
  author    = {Puterman, Martin L},
```
[5.32]
[5.68]
```
  author    = {Puterman, Martin L.},
```
Replacement in literature.bib at line 728 [4.18034]
B:BD[5.408] → [5.408:434]
```
  author    = {Puterman},
```
[5.408]
[5.434]
```
  author    = {Puterman, Martin L.},
```

Insertion in literature.bib at line 836 [4.18034]

[6.3028]

[4.35999]

@InCollection{barett2009smt,
  author    = {Clark W. Barrett and Roberto Sebastiani and Sanjit A. Seshia and Cesare Tinelli},
  booktitle = {Handbook of Satisfiability},
  publisher = {{IOS} Press},
  title     = {Satisfiability Modulo Theories},
  year      = {2009},
  editor    = {Armin Biere and Marijn Heule and Hans van Maaren and Toby Walsh},
  pages     = {825--885},
  series    = {Frontiers in Artificial Intelligence and Applications},
  volume    = {185},
  bibsource = {dblp computer science bibliography, https://dblp.org},
  biburl    = {https://dblp.org/rec/series/faia/BarrettSST09.bib},
  doi       = {10.3233/978-1-58603-929-5-825},
  file      = {:/home/paradyx/MA/literatur/walsh2009sat.pdf:PDF},
  timestamp = {Fri, 06 May 2022 08:00:40 +0200},
  url       = {https://doi.org/10.3233/978-1-58603-929-5-825},
}
@InProceedings{abraham17smt,
  author    = {Erika {\'{A}}brah{\'{a}}m and Gereon Kremer},
  booktitle = {19th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, {SYNASC} 2017, Timisoara, Romania, September 21-24, 2017},
  title     = {{SMT} Solving for Arithmetic Theories: Theory and Tool Support},
  year      = {2017},
  editor    = {Tudor Jebelean and Viorel Negru and Dana Petcu and Daniela Zaharie and Tetsuo Ida and Stephen M. Watt},
  pages     = {1--8},
  publisher = {{IEEE} Computer Society},
  bibsource = {dblp computer science bibliography, https://dblp.org},
  biburl    = {https://dblp.org/rec/conf/synasc/AbrahamK17.bib},
  doi       = {10.1109/SYNASC.2017.00009},
  timestamp = {Fri, 24 Mar 2023 00:01:57 +0100},
  url       = {https://doi.org/10.1109/SYNASC.2017.00009},
}
@Book{walsh2009smt,
  editor    = {Armin Biere and Marijn Heule and Hans van Maaren and Toby Walsh},
  publisher = {{IOS} Press},
  title     = {Handbook of Satisfiability},
  year      = {2009},
  isbn      = {978-1-58603-929-5},
  series    = {Frontiers in Artificial Intelligence and Applications},
  volume    = {185},
  bibsource = {dblp computer science bibliography, https://dblp.org},
  biburl    = {https://dblp.org/rec/series/faia/2009-185.bib},
  file      = {:/home/paradyx/MA/literatur/walsh2009sat.pdf:PDF},
  timestamp = {Fri, 06 May 2022 08:00:40 +0200},
}

Replacement in figures/ch2_ex_pip_nds.tex at line 11 [7.3344]

B:BD[7.3719] → [8.9497:9592]

                    \tau&= x > 0  \Land 1 \leq t \leq 3 \\
                    \eta(x)&= x - t

[7.3719]

[7.3822]

                    \tau&= x > 0  \Land 1 \leq u \leq 3 \\
                    \eta(x)&= x - u

Insertion in ch4_implementation.tex at line 2 [9.7792]

[9.7815]

This chapter will present the implementation of a (sub-\gls{scc})
partial-evaluation with property-based abstraction as described in the previous
\Cref{ch:theory}. In \Sref{sec:implementation} the implementation is described
using pseudo-code focusing mostly on the practical aspects of the partial
evaluation. \Sref{sec:abstract_domain} will briefly discuss the possible
abstract domains and the reasons why we decided to use the abstract domain of
Polyhedra. \Sref{sec:loop_heads} will focus on the different possibilities to
mark locations for abstraction, which is part of the history-dependent aspect of
the abstraction layer. \Sref{sec:technical_details} describes the technical
details of the actual implementation and finally in \Sref{sec:performance} the
correctness and performance of the implementation is evaluated practically.

Insertion in ch4_implementation.tex at line 15 [9.7792]
[3.7378]
[6.3075]

Replacement in ch4_implementation.tex at line 34 [9.7792]

B:BD[6.4175] → [6.4175:4256]

to iRankFinder as described in \cite{giesl2022arxiv}, possibly replacing the old

[6.4175]

[6.4256]

to iRankFinder as described in \cite{giesl2022lncs}, possibly replacing the old

Replacement in ch4_implementation.tex at line 84 [9.7792]
∅:D[2.5319] → [8.22683:22726]
B:BD[6.4751] → [8.22683:22726]
```
\subsection{Selecting the Abstract domain}
```
[2.5319]
[8.22726]
```
\section{Selecting the Abstract domain}\label{sec:abstract_domain}
```
Replacement in ch4_implementation.tex at line 101 [9.7792]
B:BD[7.9384] → [8.23836:23855]
```
\subsection{Loops}
```
[7.9384]
[8.23855]
```
\subsection{Selecting Loop Heads}\label{sec:loop_heads}
```

Replacement in ch4_implementation.tex at line 116 [9.7792]

B:BD[8.24608] → [8.24608:24688]

\citeauthor{domenech2019arxiv}. The heuristic uses backwards-propagation as one

[8.24608]

[8.24688]

\citeauthor{Domenech19}. The heuristic uses backwards-propagation as one

Replacement in ch4_implementation.tex at line 157 [9.7792]

B:BD[6.8062] → [6.8062:8143]

measure \gls{koat}s performance.\cite{giesl2022arxiv} The at the time of writing

[6.8062]

[6.8143]

measure \gls{koat}s performance.\cite{giesl2022lncs} The at the time of writing

Replacement in ch4_implementation.tex at line 165 [9.7792]

B:BD[6.8553] → [6.8553:8630]

        functions. \cite{giesl2022arxiv} No control-flow-refinement is used.

[6.8553]

[6.8630]

        functions. \cite{giesl2022lncs} No control-flow-refinement is used.

Replacement in ch4_implementation.tex at line 168 [9.7792]

B:BD[6.8793] → [6.8793:8860]

        depth used in competitions.\cite{giesl2022arxiv} Still, no

[6.8793]

[6.8860]

        depth used in competitions.\cite{giesl2022lncs} Still, no

Replacement in ch4_implementation.tex at line 313 [9.7792]

B:BD[6.15994] → [6.15994:16075]

\citeauthor{domenech2019arxiv}. However, other abstract domains can be trivially

[6.15994]

[6.16075]

\citeauthor{Domenech19}. However, other abstract domains can be trivially

Replacement in ch3_theory.tex at line 2 [9.7949]

B:BD[9.7972] → [8.25822:25897]

\gls{koat}s analysis goal is to find tight upper bounds for \gls{pip}. The

[9.7972]

[8.25897]

\gls{koat}s analysis goal is to find tight upper bounds for \glspl{pip}. The

Replacement in ch3_theory.tex at line 9 [9.7949]

B:BD[10.72] → [8.26208:26944]

Recall \ref{def:runtime_complexity}; the newly constructed refined program shall
have they have the same worst-case expected runtime complexity as the original
program which implies that runtime-complexity bounds found for the refined
program are also runtime-complexity bounds for the original one. Naturally the
control-flow refinement will add and replace transitions and locations, hence
the number runtime-complexity bounds and size-bounds of each transition won't
be preserved. However we will show, that each transition in the refined program
is a similar to a transition in the original program and hence every run in the
new program has a run of equal runtime-complexity and probability in the
original program and vice-versa.

[10.72]

[10.557]

Recall Definition\ref{def:runtime_complexity}; the newly constructed refined
program shall have the same worst-case expected runtime complexity as the
original program which implies that runtime-complexity bounds found for the
refined program are also runtime-complexity bounds for the original one.
Naturally the control-flow refinement will add and replace transitions and
locations, hence the number runtime-complexity bounds and size-bounds of each
transition won't be preserved. However we will show, that each transition in the
refined program is a similar to a transition in the original program and hence
every run in the new program has a run of equal runtime-complexity and
probability in the original program and vice-versa.

Replacement in ch3_theory.tex at line 22 [9.7949]

B:BD[8.26979] → [8.26979:27057]

\citeauthor{domenech2019arxiv}\cite{domenech2019arxiv} for \acrlong{pip}. The

[8.26979]

[8.27057]

\citeauthor{Domenech19}\cite{Domenech19} for \acrlong{pip}. The

Replacement in ch3_theory.tex at line 45 [9.7949]

B:BD[8.28003] → [8.28003:28082]

    which we call $\Prog_3$. $\Prog_3$ is visually represented as a transition

[8.28003]

[2.5320]

    which we call $\Prog_3$. $\Prog_3$ is visually represented by a transition

Replacement in ch3_theory.tex at line 48 [9.7949]

B:BD[8.28237] → [8.28237:28388]

    are temporary variables whose values reassigned by the scheduler on every
    transition and whose values do not propagate throughout the program.

[8.28237]

[9.7972]

    are temporary variables whose values are reassigned by the scheduler on
    every transition and whose values do not propagate throughout the program.

Replacement in ch3_theory.tex at line 67 [9.7949]

B:BD[11.2735] → [11.2735:2788]

programs}\label{sec:theory}\label{partialevaluation}

[11.2735]

[10.2054]

programs}\label{sec:theory}\label{sec:partialevaluation}

Replacement in ch3_theory.tex at line 77 [9.7949]

B:BD[8.29867] → [8.29867:30507]

complex, since the transition taken do not only depend on the current state but
also on the answer of the scheduler and some random event. Instead of simulating
every single possible run, it would be nice to treat \emph{similar} states as
one. Instead of simulating a single assignment, the set of all possible
assignments after a step is described by a set of constraints. The resulting
graph is called a \emph{partial evaluation graph}. It was formalized for
\glspl{chc} by \citeauthor{gallagher2019eptcs}\cite{gallagher2019eptcs}. In this
thesis the definitions are adapted to better fit the algorithm and the formalism
of \Glspl{pip}.

[8.29867]

[12.1532]

complex, since the transition taken does not only depend on the current state
but also on the answer of the scheduler and some random event. Instead of
simulating every single possible run, it would be nice to treat \emph{similar}
states as one. Instead of simulating a single assignment, the set of all
possible assignments after a step is described by a set of constraints. The
resulting graph is called a \emph{partial evaluation graph}. It was formalized
for \glspl{chc} by \citeauthor{gallagher2019eptcs}\cite{gallagher2019eptcs}. In
this thesis the definitions are adapted to better fit the algorithm and the
formalism of \Glspl{pip}.

Replacement in ch3_theory.tex at line 134 [9.7949]
B:BD[2.6529] → [2.6529:6559]
```
            \text{ is SAT},\\
```
[2.6529]
[2.6559]
```
            \text{ is satisfiable},\\
```

Replacement in ch3_theory.tex at line 142 [9.7949]

B:BD[13.784] → [8.33284:33449]

    For any $\varphi\in\Phi_\PV$, probabilistic update $\eta : \PV \rightarrow
    \Z[\V\union\D]$, if there exists an $s\in \Sigma$ with $s\models\varphi$
    then

[13.784]

[14.168]

    For any $\varphi\in\Phi_\PV$ and probabilistic update $\eta : \PV
    \rightarrow \Z[\V\union\D]$, if there exists an $s\in \Sigma$ with
    $s\models\varphi$ then

Replacement in ch3_theory.tex at line 172 [9.7949]

B:BD[8.34164] → [8.34164:34245]

    Imagine a constraint $y > 0$, that is unfolded with the probabilistic update

[8.34164]

[8.34245]

    Imagine a constraint $y > 0$ that is unfolded with the probabilistic update

Replacement in ch3_theory.tex at line 230 [9.7949]

B:BD[6.21316] → [2.8571:8648]

Let $\evaluate^n$ be $n$ times concatenation of $\evaluate$ and $G_n$ be the

[6.21316]

[2.8648]

Let $\evaluate^n$ be the $n$ times concatenation of $\evaluate$ and $G_n$ be the

Replacement in ch3_theory.tex at line 267 [9.7949]

B:BD[5.12982] → [5.12982:13080]

        \caption{An simple example program with an infinite evaluation graph.\label{fig:ch3_inf}}

[5.12982]

[5.13080]

        \caption{A simple example program with an infinite evaluation graph.\label{fig:ch3_inf}}

Replacement in ch3_theory.tex at line 272 [9.7949]

B:BD[2.9738] → [2.9738:9822]

    $(\ell_0,t_\text{in},s_0)(\ell_1,t_1,s_0)(\ell_1,t_1,s_1)(\ell_1,t_1,s_2)\dots$

[2.9738]

[2.9822]

    $(\ell_0,t_\text{in},s_0)(\ell_1,t_1,s_0)(\ell_1,t_2,s_1)(\ell_1,t_2,s_2)\dots$

Replacement in ch3_theory.tex at line 277 [9.7949]

B:BD[2.10098] → [2.10098:10736]

    \textit{true}\rangle$ is evaluated over the transition $t_1$,
    resulting in a single new version $\langle \ell_1, \textit{true}\rangle$. The
    second unfolding steps, unfolds $\langle \ell_1,
    \textit{true}\rangle$ over $t_3$ to $\langle \ell_2, x\leq0\rangle$ and over
    $t_2$ to $\langle \ell_1, x>1\rangle$. $\ell_2$ has no outgoing transitions,
    hence no new transitions are added. 
    $\langle \ell_1, x>1\rangle$ is unfolded to $\langle \ell_1, x>2\rangle$
    over $t_2$ only since the guard of $t_3$ is not satisfiable from within $x >
    2$. The new version is unfolded over and aver again, until end of time.

[2.10098]

[2.10736]

    \textit{true}\rangle$ is evaluated over the transition $t_1$, resulting in a
    single new version $\langle \ell_1, \textit{true}\rangle$. The second
    unfolding steps, unfolds $\langle \ell_1, \textit{true}\rangle$ over $t_3$
    to $\langle \ell_2, x\leq0\rangle$ and over $t_2$ to $\langle \ell_1,
    x>1\rangle$. $\ell_2$ has no outgoing transitions, hence no new transitions
    are added. $\langle \ell_1, x>1\rangle$ is unfolded to $\langle \ell_1,
    x>2\rangle$ over $t_2$ only since the guard of $t_3$ is not satisfiable from
    within $x > 2$. The new version is unfolded over and after again, until the
    end of time.

Replacement in ch3_theory.tex at line 293 [9.7949]
∅:D[15.103] → [12.2943:2975]
∅:D[16.806] → [12.2943:2975]
∅:D[7.10546] → [12.2943:2975]
B:BD[12.2943] → [12.2943:2975]
```
\begin{definition}[Abstraction]
```
[12.2943]
[2.11045]
```
\begin{definition}[Abstraction]\label{def:abstraction}
```

Replacement in ch3_theory.tex at line 812 [9.7949]

B:BD[2.34596] → [2.34596:34730]

    \textbf{Claim:} For any finite prefix $f \in \fpath_\Prog'$ there exists a
    finite prefix $\varf' \in \fpath_\Prog$ for which

[2.34596]

[2.34730]

    \textbf{Claim:} Let $s_0\in\Sigma$ be an initial state. For any finite
    prefix $f \in \fpath_\Prog'$ with probability $\prSs(\varf) > 0$ there
    exists a finite prefix $\varf' \in \fpath_\Prog$ for which

Insertion in ch3_theory.tex at line 819 [9.7949]

[2.34876]

[6.31510]


    \textbf{Induction start:} $n = 0$. Let $f_0 = c_0 = (\langle \ell_0,
    \texttt{true}\rangle, t_\text{in}, s_0) \in \fpath_\Prog'$ which is the only
    initial configuration with non-zero probability $\prSs(f_0) = 1$. We set
    $\varf'_0 = c'_0 = (\ell_0, t_\text{in}, s_0)$ which is by construction the
    only valid starting configuration of $\Prog'$ with $\prSns(c'_0) = 1$.

Replacement in ch3_theory.tex at line 826 [9.7949]

B:BD[6.31511] → [2.34877:34917]

    \textbf{Induction start:} $n = 0$.

[6.31511]

[2.34917]

    \textbf{Induction step.} Assume the induction hypothesis holds for $n \in
    \N$. Let $f_{n+1} = c_0\dots{}c_nc_{n+1} \in \fpath_{\Prog'}$ of length $n+1
    \in \N$ be an admissible prefix with probability $\prSs(f_{n+1})>0$, then
    $f_n = c_0\dots{}c_{n}$ of length $n$ is also admissible with $\prSs(f_n) >
    0$. By induction hypothesis a finite prefix $\varf'_n  = c'_0\dots{}c'_{n}
    \in \fpath_{\Prog}$ of equal length and probability exists.
    Let $c_{n+1} = (\langle \ell_{n+1}, \_ \rangle, t_{n+1}, s_{n+1}) \in
    \confs_{\Prog'}$. We set $c'_{n+1} = (\ell_{n+1}, \bar{t}_{n+1}, s_{n+1})$
    to be the last configuration of $\varf'_{n+1}=\varf'_{n}c'_{n+1}$.
    Let $g\in\GTPn$, and $\tilde{s} \in \Sigma$ be the general transition and
    assignment selected by the scheduler $\scheduler(c_{n+1}) = (g, \tilde{s})$.
    \todo{continue here}

Replacement in ch3_theory.tex at line 848 [9.7949]

B:BD[6.31742] → [2.34931:35048]

        \sup_{\scheduler \in \MDS}(\ESs(\Rt_\Prog')) \leq
        \sup_{\scheduler' \in \HDS}(\ESns(\Rt_{\Prog})) \\

[6.31742]

[6.31859]

        \sup_{\scheduler \in \MDS_{\Prog}}(\ESs(\Rt_{\Prog'})) \leq
        \sup_{\scheduler' \in \MDS_{\Prog}}(\ESs(\Rt_{\Prog})) \\

Replacement in ch3_theory.tex at line 854 [9.7949]

B:BD[6.31976] → [6.31976:32134]

B:BD[6.32134] → [11.3172:3243]

∅:D[11.3243] → [6.32173:32265]

B:BD[6.32173] → [6.32173:32265]

        $s_0 \in \Sigma$ be any starting state. For any scheduler $\scheduler
        \in \Pi_\text{MD}$ let $\scheduler' \in\Pi_\text{HD}$ be constructed as
        in Lemma \ref{lem:constrfleq}.\todo{lemma Rückrichtung fehlt}
        Analogously to Theorem \ref{thm:correctness} the following relation
        holds:

[6.31976]

[6.32265]

        $s_0 \in \Sigma$ be any initial state. For any scheduler $\scheduler \in
        \MDS_{\Prog'}$ let $\scheduler' \in \HDS_\Prog$ be constructed as in
        Lemma \ref{lem:constrfgeq}. Analogously to Theorem \ref{thm:correctness}
        the following relation holds:

Replacement in ch3_theory.tex at line 859 [9.7949]

B:BD[6.32291] → [6.32291:32416]

            \sup_{\scheduler \in \MDS}(\ESs(\Rt(\Prog))) \leq
            \sup_{\scheduler' \in \HDS}(\ESns(\Rt(\Prog'))) \\

[6.32291]

[6.32416]

            \sup_{\scheduler \in \MDS_\Prog}(\ESs(\Rt_\Prog)) \leq
            \sup_{\scheduler \in \HDS_{\Prog}}(\ESs(\Rt_{\Prog'})) \\

Replacement in ch3_theory.tex at line 863 [9.7949]

B:BD[6.32441] → [6.32441:32935]

        It is known from literature that for a markovian decision process the
        best expected reward using a history dependent scheduler is the same as
        the best expected reward using a markovian scheduler(see. Theorem
        \todo{cite literature} of \cite{puterman1994markov}). Since a run on the
        program is equivalent to a markovian decision process as explained in
        \Sref{ssec:markov}, the Theorem applies to the expected value and the
        following holds:

[6.32441]

[6.32935]

        By Lemma \ref{lem:hdsvsmds} the fact that we used a history dependent
        scheduler doesn't matter in the supremum and we get

Replacement in ch3_theory.tex at line 867 [9.7949]

B:BD[6.32962] → [6.32962:33148]

            \sup_{\scheduler \in \MDS}(\ESs(\Rt(\Prog))) \leq \sup_{\scheduler'
            \in \HDS}(\ESns(\Rt(\Prog'))) = \sup_{\scheduler' \in
            \MDS}(\ESns(\Rt(\Prog')))\\

[6.32962]

[6.33148]

            \sup_{\scheduler \in \HDS_\Prog}(\ESs(\Rt_\Prog)) \leq
            \sup_{\scheduler \in \HDS_{\Prog}}(\ESs(\Rt_{\Prog'})) \\

Replacement in ch3_theory.tex at line 882 [9.7949]

B:BD[6.33521] → [6.33521:33870]

B:BD[6.33870] → [11.3244:3892]

new program has a worst-case expected runtime and worst-case expected cost at
least as large as the original program. Hence any bounds found for the partial
evaluation are bounds for the original program. Theorem \ref{thm:thightness} has
shown additionally, that the worst-case expected runtime and costs don't get
worse with a partial evaluation.
\section{Abstraction}\label{sec:abstraction}
In \Sref{sec:partialevaluation} the abstraction was represented by an oracle,
that decides if the location is abstracted and an abstraction function. It was
stated that Algorithm \ref{alg:evaluate_abstr} terminates if 
\begin{enumerate}
    \item the oracle eventually always decides to abstract, and 
    \item the abstraction has only a finite number of results.
\end{enumerate}
First, we will discuss some possibilities to select the locations for
abstraction, and then adapt the properties based abstraction developed by
\citeauthor{Domenech19}\cite{Domenech19} for probabilistic integer programs.

[6.33521]

[6.33904]

new program has a worst-case expected runtime at least as large as the original
program. Hence any bounds found for the partial evaluation are bounds for the
original program. Theorem \ref{thm:thightness} has shown additionally, that the
worst-case expected runtime and costs don't get worse with a partial evaluation.

Replacement in ch3_theory.tex at line 887 [9.7949]

B:BD[6.33905] → [11.3893:4949]

\begin{comment}
    pro: easy to implement
    contra: 
    - abstractions are expensive
    - abstractions loose precision
    open question:
    - how to select properties
\end{comment}
The naive approach would be to abstract at every location, as it also stated in
\cite{Domenech19}. The advantage is that the implementation is easy. The
evaluated program is small and the analysis thereof fast. The disadvantage is
that a lot of evaluated control-flow is lost to the abstraction. 
The property based abstraction abstraction presented in
\Sref{sec:propertybasedabstraction} requires $\Theta(n)$ calls to the
\gls{smt}-solver which is computationally expensive. 
The primary goal of the presented \gls{cfr} via partial evaluation is to
evaluate overlapping loops in order to find easier to analyse smaller loops.
In practice, the control-flow of a loop is mostly linear. Abstracting the linear
control-flow could lead to two linear sections merging together which would
otherwise-stay separated and thus make finding tight bounds harder in the
process.

[6.33905]

[11.4949]

\section{History Dependent Abstractions}
In \Sref{sec:partialevaluation} the abstraction the details of the abstraction
were omitted. Recall Definition \ref{def:abstraction}, an abstraction space must
be finite and the abstraction function must map any probability to elements of
this abstraction space. The abstraction function was kept simple, in order to
keep the proofs consise. In this section we will expand to take the evaluation
history into account, which opens the possibility for more refined abstraction
functions.

Replacement in ch3_theory.tex at line 896 [9.7949]

B:BD[11.4950] → [11.4950:5944]

\subsection{Abstract Loop heads}\label{ssec:loopheads}
The second method proposed by \cite{Domenech19} is to abstract only on loop
heads. Termination is guaranteed since every infinite path in the evaluation
tree of a finite program must visit some loop (and in turn the head) an infinite
number of times. With only finitely many possible abstraction results, it is
guaranteed that at some point some version repeats and the evaluation
backtracks. 
The exact definition of a loop head is unclear. For sake of termination it is
sufficient that every loop contains at least one (but possibly arbitrary)
location marked for abstraction. The intuitive candidate is the head of the loop
since that's the location where most branching will probably occur in practice.
For this implementation a loop head is defined as follows.
\begin{definition}
    A \textit{loop head} of a loop is the first location of the loop encountered
    during a depth-first search starting at the start location $l_0$.

[11.4950]

[11.5944]

\begin{definition}[History dependent abstraction]
    A history dependent abstraction function is a function $\G \rightarrow$

Replacement in ch3_theory.tex at line 900 [9.7949]

B:BD[12.3839] → [11.5962:6174]

When selecting all loop heads for abstraction, multiple loop heads can end up on
a single loop, for when the loop is overlapping with another loop. 
\begin{figure}
    \input{figures/ch3_loopheads}
\end{figure}

[12.3839]

[11.6174]

In this section we will present the property-based abstraction
proposed by \citeauthor{Domenech19}\cite{Domenech19} for non-probabilistic
programs but which can be applied equally to probabilistic programs.

Replacement in ch3_theory.tex at line 904 [9.7949]

B:BD[11.6175] → [11.6175:6829]

B:BD[11.6829] → [8.37806:41231]

∅:D[8.41231] → [11.6829:6830]

B:BD[11.6829] → [11.6829:6830]

B:BD[11.6830] → [8.41232:42668]

∅:D[8.42668] → [11.6830:6831]

B:BD[11.6830] → [11.6830:6831]

B:BD[11.6831] → [8.42669:42837]

∅:D[11.6831] → [12.3839:4979]

∅:D[8.42837] → [12.3839:4979]

B:BD[12.3839] → [12.3839:4979]

B:BD[12.4979] → [16.807:862]

B:BD[16.862] → [17.3710:4125]

B:BD[17.4125] → [18.838:1053]

∅:D[18.1053] → [17.6126:6127]

B:BD[17.6126] → [17.6126:6127]

B:BD[17.6169] → [17.6169:6723]

B:BD[17.6723] → [18.1054:1238]

∅:D[18.1238] → [17.6873:7150]

B:BD[17.6873] → [17.6873:7150]

B:BD[17.7150] → [7.10547:10624]

∅:D[7.10624] → [17.7227:7307]

B:BD[17.7227] → [17.7227:7307]

B:BD[17.7307] → [7.10625:10702]

∅:D[7.10702] → [17.7384:7572]

B:BD[17.7384] → [17.7384:7572]

B:BD[17.7572] → [7.10703:10772]

∅:D[7.10772] → [17.7641:7760]

B:BD[17.7641] → [17.7641:7760]

B:BD[17.7760] → [7.10773:10937]

∅:D[7.10937] → [17.7926:7939]

B:BD[17.7926] → [17.7926:7939]

B:BD[17.7939] → [7.10938:11017]

∅:D[7.11017] → [17.8018:8165]

B:BD[17.8018] → [17.8018:8165]

B:BD[17.8165] → [7.11018:11240]

∅:D[7.11240] → [17.8390:8903]

B:BD[17.8390] → [17.8390:8903]

∅:D[16.1475] → [12.5065:5066]

∅:D[17.8903] → [12.5065:5066]

B:BD[12.5065] → [12.5065:5066]

B:BD[12.5066] → [16.1476:1504]

B:BD[16.1504] → [17.8904:9651]

∅:D[17.9651] → [16.1504:1784]

B:BD[16.1504] → [16.1504:1784]

B:BD[16.1784] → [10.9462:9493]

∅:D[10.9493] → [17.9652:9653]

B:BD[16.1784] → [17.9652:9653]

B:BD[17.9653] → [10.9494:9516]

B:BD[10.9516] → [19.378:2093]

B:BD[5.16490] → [5.16490:16491]

∅:D[19.5775] → [16.1784:1821]

∅:D[10.9516] → [16.1784:1821]

∅:D[17.9653] → [16.1784:1821]

B:BD[16.1784] → [16.1784:1821]

B:BD[16.1821] → [17.9654:10289]

B:BD[17.10289] → [10.9517:11374]

B:BD[10.11374] → [19.5776:7475]

∅:D[19.7475] → [10.11374:11375]

B:BD[10.11374] → [10.11374:11375]

B:BD[10.11375] → [19.7476:7681]

∅:D[19.7681] → [10.11375:11376]

B:BD[10.11375] → [10.11375:11376]

∅:D[17.10289] → [16.1821:2003]

∅:D[10.11376] → [16.1821:2003]

B:BD[16.1821] → [16.1821:2003]

B:BD[9.8020] → [16.2034:2637]

(see examples \todo{} \todo{}). Detecting loops and loop-heads is further
discussed in \Sref{sec:findingloops}.
\todo{prove upper bound on program size}
% Since 
% In addition control-flow refinement was introduced to unroll loops in such a way
% that hidden branching is resolved. The heuristics presented by \cite{Domenech19}
% are focused on loops and the properties are selected for the loop heads.
% Constructing a heuristic, that is well suited for every location in the program
% is not obvious is left open for future work. 
\subsection{Property based abstraction}\label{sec:propertybasedabstraction}
\section{Sub-\gls{scc} level evaluation}
For the sub-SCC level evaluation Algorithm \ref{alg:abstr_eval} is adapted to
stop the evaluation when leaving the component and instead of starting only from
the initial location of the program, the evaluation starts from every
entrytransition to the program. The exact algorithm is displayed in Algorithm
\ref{alg:eval_component}.
\begin{algorithm}
    \caption{Evaluate the component $S \subseteq \T_\Prog$ of a Program $\Prog$
    with abstraction $\alpha$.\label{alg:eval_component}}
    \KwData{ A \gls{pip} $\Prog$, component $S$, abstraction function $\alpha$,
    and abstraction oracle $S_\alpha$}
    \KwResult{A Graph $G$, and a set of pairwise disjunct general transitions
    $\GTG \subset \GT_\text{PE}$}
    $\ET \leftarrow \Tout(S)$\;
    \SetKwFunction{FEvaluate}{evaluate}
    \SetKwProg{Fn}{Function}{:}{end}
    $\GTG \leftarrow \emptyset$\;
    \Fn{\FEvaluate{$G$, $v$}}{
        $\langle l, \varphi\rangle \leftarrow v$\;
        \For{$g \in \GTout(l)$} {
            $g' \leftarrow \unfold^*(v, g)$\label{alg:abstr_line_unfold_g}\;
            \If {$g^* \neq \emptyset$ \label{alg:abstr_line_filteremptyg}}{
                $g^* \leftarrow \emptyset$\;
                \For{$t = (v, p, \tau, \eta, \langle l', \varphi\rangle) \in g$}{
                    \uIf {$g \in \ET$} {
                        $t \leftarrow (v, p, \tau, \eta, \langle l',
                        \texttt{true}\rangle)$\label{alg:abstr_line_exit}\;
                    }
                    \ElseIf {$S_\alpha$} {
                        $t \leftarrow (v, p, \tau, \eta, \langle l',
                        \alpha(\varphi)\rangle)$\label{alg:abstr_line_abstraction}\;
                    }
                    $g^* \leftarrow g^* + t$\label{alg:abstr_line_addtog}\;
                    \uIf {$v' \not\in V(G)$}{
                        $G \leftarrow G + t$\;
                        $G \leftarrow \FEvaluate{G, v'}$\;
                    }
                    \Else {
                        $G \leftarrow G + t$\label{alg:abstr_line_addt}\;
                    }
                }
                $\GTG \leftarrow \GTG + g^*$\;
            }
        }
        \Return {$G$}
    }
    $G \leftarrow \text{ graph of }\Prog \text{ lifted to a partial evaluation
    graph}$\;
    $G \leftarrow G - S$\;
    \For{$t = (\ell, p, \tau, \eta, \ell')\in \Tin(S)$} {
        $G \leftarrow \FEvaluate{$G, \langle\ell',\texttt{true}\rangle $}$ \;
    }
    \Return $G, \GTG$
    
\end{algorithm}
The partial evaluation starts at every entry transition $\T_in(S)$ to the
component. When unfolding an exit transition $t = (\ell, \_, \_, \_, \ell') \in
\Tout(S)$ from a version $\langle \ell, \varphi\rangle$, to $\unfold^*(\langle
\ell, \varphi\rangle, t) = \{(\langle\ell,\varphi\rangle, \_, \_, \_,
\langle\ell', \varphi'\rangle)\}$ the target version
$\langle\ell',\varphi'\rangle$ is replaced by the trivial overapproximation
$\langle\ell',\texttt{true}\rangle$. This target is always already contained in
the program, by construction and the algorithm always backtracks. 
Lets cosider an admissible finite prefix $f \in \fpath_\Prog$ with 
\begin{align*}
    f &= c_0c_1\dots{}c_k\dots{}c_m\dots{}c_n \\
      &=
      (\ell_0,t_\in,s_0)(\ell_1,t_1,s_1)\dots\underbrace{(\ell_k,t_k,s_k)\dots(\ell_m,t_m,s_m)}_{t_k,\dots,t_m
      \in S}\dots(\ell_n,t_n,s_n)
\end{align*}
with an infix $c_m\dots{}c_k$ going through the component.
By construction, up to the component the partial evaluation $\Prog'$ contains a
similar prefix
$(\langle\ell_0,\texttt{true}\rangle,t_\in,s_0)(\langle\ell_1,\texttt{true}\rangle,t_1,s_1)\dots(\langle\ell_{k-1},\texttt{true}\rangle,t_{k-1},s_{k-1})
\in \fpath_{\Prog'}$. The component is entered via the transition
$t_k\in\Tin(S)$ which is was used as a starting point for the evaluation with
the trivial version $\langle\ell_{k-1}, \texttt{true}\rangle$ in Algorithm
\ref{alg:eval_component}. By the same argument as in Theorem
\ref{thm:correctness}, a similar prefix
$(\langle\ell_0,\texttt{true}\rangle,t_\in,s_0)(\langle\ell_1,\texttt{true}\rangle,t_1,s_1)\dots(\langle\ell_{k-1},\texttt{true}\rangle,t_{k-1},s_{k-1})(\langle\ell_k,\varphi_k\rangle,t_k,s_k)\dots(\langle\ell_{m-1},\varphi_{m-1}\rangle,t_{m-1},s_{m-1})
\in \fpath_{\Prog'}$ through the component exists in $\Prog'$. The exit
transition $t_m$ must have been unfolded before backtracking and afterwards the
similar prefix trivially follows the lifted versions again, yielding a similar
finite prefix
$(\langle\ell_0,\texttt{true}\rangle,t_\in,s_0)(\langle\ell_1,\texttt{true}\rangle,t_1,s_1)\dots(\langle\ell_{k-1},\texttt{true}\rangle,t_{k-1},s_{k-1})(\langle\ell_k,\varphi_k\rangle,t_k,s_k)\dots(\langle\ell_{m-1},\varphi_{m-1}\rangle,t_{m-1},s_{m-1})(\langle\ell_m,\texttt{true}\rangle,t_m,s_m)\dots(\langle\ell_n,\texttt{true}\rangle,t_n,s_n)
\in \fpath_{\Prog'}$.
For any finite prefix visiting the component more than once, the similar finite
prefix of $\Prog'$ is found by making the argument above multiple times.
\section{Foo}
\citeauthor{Domenech19} described the property based abstraction that
will be used for this algorithm. The idea is the following: the solution space
is cut into pieces by a finite number of constraints, the so called properties. 
Then the abstraction computes which of those properties, are entailed by the
version. Those are chosen as abstract representation for the version. Since the
number of properties is finite, the powerset of properties is finite as well and
hence the number of all possible abstractions is too. 
Also the abstraction is entailed by the version. \todo{rewrite}
\begin{definition}[Property based abstraction].
    \todo{definition}
\end{definition}
\begin{example}
    \todo{picture with properties}
\end{example}
Every abstraction is expensive, because it requires numerous entailement checks
(depending on the number of properties), wich require each a call to a
SAT-checker. The more properties are chose the more precise the abstraction
get's but the more expensive the computation is. Choosing the relevant
properties for the loops is done heuristically and will be described in the
following subsection. 
\begin{comment}
    motivate abstraction
\end{comment}
Selecting the locations to abstract is crucial for good implementation. From a
theoretical point of view the selection of locations to abstract guarantees
termination of the control-flow refinement. Yet, with every abstraction the
refined program looses precision compared to the evaluated control-flow. In the
following some different possibilities to select the locations marked for
abstractions are discussed. 
Finding loops in a program can be done with the algorithm described by
\citeauthor{johnson1975}\cite{johnson1975}. 
The algorithm is linear in the number of edges in the graph, specifically
$\mathcal{O}(n+e (c+1))$
\subsection{Feedback Vertex Set}
As stated in the previous section \Sref{ssec:loopheads}, it is sufficient to
find only a set of locations that cover every loop in the program. The problem
is one of Karp's 21 NP-complete problems and
commonly called \textit{Feedback Vertex Set} or \textit{Feedback Node Set}\cite{karp2010reducibility}. 
\begin{definition}[Feedback Vertex Set Problem]
    Given a directed graph $G = (V, E)$. Find the minimal subset $R \subseteq
    V$ such that every (directed) cycle of $G$ contains a node in $R$.
\end{definition}
Detecting all loops(cycles) in a graph can be done easily done with an  
algorithm describe\gls{dfs} on
the graph. And the problem can instead be formulated as a hitting set problem. 
\begin{definition}[Hitting Set Problem]
    Given a finite collection of finite sets $S = \{S_1, \dots, S_n\}$, where
    each $S_i \subseteq U$ for some set $U$, find a minimal set $R$ such that
    $R\intersect S_i \neq \emptyset$ for all $i = 1,\dots,n$.
\end{definition}
A hitting set problem can be formulated as a \gls{ilp} and solved by readily
available \gls{smt} solvers. Recall definition \ref{def:loop}. In the following
the \gls{ilp} used for finding the feedback vertex set of an integer program
with locations $L$ and a total of $N$ loops $a_1, \dots, a_N$ is constructed. It
is assumed that all loops have been found by \gls{dfs} as described in
\Sref{findingloops}.
\begin{mini!}
    {}{\sum_{i=1}^{n} x_i\label{eq:lip1-goal}}{\label{eq:ilp-vfs}}{}
    \addConstraint{\sum_{i : l_i \in \text{Loc}(a_j)} x_i}{\geq
    0,\protect\quad\label{eq:lip1-hit}}{j = 1,\dots,N}
    \addConstraint{ x_i }{\leq 1,\protect\quad\label{eq:ilp-leq1}}{i = 1,\dots,n}
    \addConstraint{ x_i }{\geq 0,\protect\quad\label{eq:ilp-geq0}}{i = 1,\dots,n}
\end{mini!}
Every variable $x_i$ of the \gls{ilp} \ref{eq:ilp-vfs} indicates if a location
is part of the hitting set in which case the variable is assigned to 1, or not
in which case the variable is assigned the value 0. The constraints
\ref{eq:ilp-hit} guarantees that every loop is \enquote{hit}, while
\ref{eq:ilp-leq1} and \ref{eq:ilp-geq0} enforce that every location is only
selected at most once. The optimization goal \ref{eq:ilp-vfs} being to select
the fewest locations possible.
Computing the feedback vertex set of a program is independent of the partial
evaluation and can be done a-priori. It is thus considered to be an offline
method. It shall be noted that the locations inside the feedback vertex set do
not need to be the closed location to the start location of a loop. For
determination of properties it might be necessary to rotate the loop so that the
marked locations are the first location in the loop. 
\todo{prove upper bound on program size}
\subsection{$k$-encounters}
We have seen two offline strategies to select locations for abstraction. In the
following we will introduce a technique that decides on abstraction during
partial evaluation using the path evaluated. The technique counts the number of
occurences on the path for every location in the program. Instead of abstracting
on first re-occurrence, one can decide on a parameter $k \in \N$ resulting in
the first $k$ re-occurrences to be ignored before abstracting.
The idea is to unroll loops up to a constant number of times so that loops with
constant bounds $b \leq k$ are perfectly unrolled without any abstraction. 
\begin{example}
    \todo{example with constantly bounded loop}
\end{example}
\todo{formalize, prove upper bound on program size}
\begin{comment}
    technique: 
    - online
    - daikstra during traversal
    pro: 
    - takes unreachable loops into account
    - allows for some constantly bounded loops to be perfectly unrolled. 
    contra: 
    - hard to implement 
    - hard to reason on
\end{comment}
\section{Choosing properties}
\section{Correctness}
\begin{definition}[Equivalent \glspl{pip}]
    Let $\Prog_1,\Prog_2$ be \glspl{pip} with $\Prog_1 = (\PV, \Loc_1, \GT_1,
    l_0)$, $\Prog_2 = (\PV, \Loc_2, \GT_2, l_0)$ and their respective
    probability spaces $(\runs_1, \F_1, \PrSs^1)$ and $(\runs_2, \F_2,
    \PrSs^2)$. Besides $\T_1$ and $\T_2$ denote the set of transitions of
    $\Prog_1$ and $\Prog_2$; $\fpath_1$ and $\fpath_2$ the set of finite paths,
    etc. 
    The two programs are equivalent if the two random variables $\Rt_{\Prog_1}$
    and $\Rt_{\Prog_2}$ have the same probability distributions $\mu_1$ and
    $\mu_2$ with $\mu_1(k) = \mu_2(k)$ for all $k\in\overline{\N}$.
\end{definition}
\begin{theorem}
    If $\Prog_1$ and $\Prog_2$ are equivalent then:
    \begin{enumerate}
        \item A total runtime-complexity bound for the $\Prog_1$ is a total
            runtime complexity bound for $\Prog_2$ and vice-versa. 
        \item $\Prog_1$ and $\Prog_2$ have the same expected runtime-complexity. 
    \end{enumerate}
    \proof{
        Let $\RB_P$ 
    }
\end{theorem}
\begin{rem}
    The probability distribution doesn't care about the underlying probability
    space making it the perfect tool to compare the semantics of two different
    \glspl{pip}.
\end{rem}
We will show the partial evaluation $\Prog'$ is equivalent to the original
\gls{pip} $\Prog$. In a first step we will show that for every admissible finite
prefix in $\Prog$ there exists an admissible finite prefix in $\Prog'$ with the
same probability and intermediate states and vice versa. This in turn will be
expanded for all admissible runs and finally we will show that this proves an
equal probability distribution of the total runtime complexity. 
\subsection{Considering probability}
During this section the fact that programs are probabilistic has been completely
ignored. One could wonder if loops with zero probability should be removed in
order to further shrink the problem. We will assume that all transitions with
zero probability have been removed before the analysis. All other transitions
with a non-zero probability are walkable in theory and will be unfolded by the
partial evaluation. Hence they must be considered for the set of abstracted
locations in order to guarantee termination of the algorithm.
If a path has should result in a zero-probability, this could be filtered during
partial evaluation. 
\section{Partial evaluation of \gls{scc}s}
During \gls{koat}s analysis the program is split into \glspl{scc}. Bounds are
found for every \gls{scc} separately and then composed into an overall bound at
the end. Control-flow refinement is only needed for those \glspl{scc} where the
analysis fails to find tight bounds. In practice, linear bounds are considered
tight enough, whereas polynomial and exponential complexity bounds warrant
refinement in hope for tighter bounds.
The previously presented control-flow refinement starts at the start locations
and evaluates the whole program. The size of the refined program is limited by
the number of locations and transitions as well as the number of properties on
each abstracted location. The smaller the initial program, the faster the
control-flow, the refined program and subsequent analysis. Instead of evaluating
the program, one can adapt the partial evaluate to only evaluate a single
\gls{scc} of the program. This section presents the partial evaluation of an
\gls{scc}.
The result is again an entire program, where only transitions the \gls{scc}, and 
incoming and outgoing transitions to this \gls{scc} are changed. 
In a first step the program is copied as a whole, lifted to a trivial evaluation
graph and the \gls{scc}, and all incoming and outgoing transitions are removed
from the copy. 
Then the evaluation algorithm is executed from every incoming transition, but
with the additional condition to backtrack after unfolding an exiting
transition. The new transitions are added into the copied evaluation graph and
therefore progress made by earlier evaluations of entry transitions can be
reused, shortening the evaluation of succeeding incoming transitions. 
Finally, the evaluation graph is converted back to a \gls{pip} and returned as
result of the partial evaluation of the \gls{scc}.
\begin{figure}[h]
    \centering
    \begin{subcaptionblock}[t]{0.45\textwidth}
        \centering
        \input{figures/ch4_scc_prog}
        \caption{The original program}
    \end{subcaptionblock}
    \begin{subcaptionblock}[t]{0.45\textwidth}
        \centering
        \input{figures/ch4_scc_lift}
        \caption{Copy and lift}
    \end{subcaptionblock}
    \begin{subcaptionblock}[t]{0.45\textwidth}
        \centering
        \input{figures/ch4_scc_remove}
        \caption{Remove SCC}
    \end{subcaptionblock}
    \begin{subcaptionblock}[t]{0.45\textwidth}
        \centering
        \input{figures/ch4_scc_evaluate}
        \caption{Evaluate entry transitions}
    \end{subcaptionblock}
    \caption{Visualization of partial evaluation for a single
    SCC\label{fig:evaluate_scc}}
\end{figure}
\begin{algorithm}
    \caption{$evaluate_{SCC}(P, A)$}
    \KwData{ A PIP $P$, a \gls{scc} $A\subset\T$ }
    \KwResult{A Graph $G$}
    $\Tin \leftarrow \Tin(A)$\;
    $\Tout \leftarrow \Tout(A)$\;
    \SetKwFunction{FEvaluate}{evaluate}
    \SetKwProg{Fn}{Function}{:}{end}
    \Fn{\FEvaluate{$G$, $v$, $t$}}{
        $N = \unfold^*(v)$\;
        \For{$t = (v, \tau, \eta, p, v') \in N $}{
            \uIf {$(l_v, \tau, \eta,p, l_{v'}) \in \Tout$}{
                $G \leftarrow G + (v, \tau, \eta, p, \langle v_{l'},
                \textit{true}\rangle)$\;
            }
            \uElseIf {$v' \not\in G$}{
                $G \leftarrow G + t$\;
                $G \leftarrow evaluate(P, G, v')$\;
            }
            \Else {
                $G \leftarrow G + t$\;
            }
        }
        \Return {$G$}
    }
    $G \leftarrow \text{lift}(P - A - \Tin - \Tout)$\;
    \For{$t_i = (l_\text{i}, \_, \_ ,\_, l_\text{i}') \in \Tin $}{
        $ G \leftarrow \text{evaluate}(P, G, \langle l_i, \textit{true}\rangle,
        t_i)$\;
    }
    \Return {$G$}\;
\end{algorithm}
\begin{comment}
    - probability is not important for finding loops.
    - all transitions that exist have a non-zero probability and hence are
    walkable in theory
\end{comment}
\begin{comment}
    termination:
    - assume the abstraction method cuts every cycle
    - only finitely many abstractions exist
    -> the partial evaluation graph is finite
    -> the algorithm terminates
    correctness:
    - the abstraction is entailed by the state
    - every walkable path in the program/scc exists in the partial evaluation
    graph
\end{comment}
\section{Refining invariants}
\begin{comment}
    because we use polyhedra instead of boxes, our invariants are probably
    tighter than the original ones. When this is the case replace them in the
    program. 
\end{comment}

[11.6175]

\todo{continue here}

Replacement in ch2_preliminaries.tex at line 43 [9.8115]

B:BD[11.6905] → [11.6905:6981]

\{\infty\}$. $A^*$ is the set of finite words and $\A^\omega$ is the set of

[11.6905]

[11.6981]

\{\infty\}$. $A^*$ is the set of finite words and $A^\omega$ is the set of

Deletion in ch2_preliminaries.tex at line 61 [9.8115]

B:BD[11.8219] → [18.2253:2632]

∅:D[18.2632] → [11.8350:8351]

B:BD[11.8350] → [11.8350:8351]

B:BD[11.8351] → [18.2633:2889]

∅:D[18.2889] → [11.8920:8921]

B:BD[11.8920] → [11.8920:8921]

% Let $\mathfrak{I} = ()$ be the
% structure of 
% \begin{definition}[Satisfiability\cite{barwise1977}]\label{def:satisfiability}
%     For a domain $A$ and a structure $\mathfrak{A} = (A, +, \cdot, 0, 1, \leq,
%     <, =)$ we say a variable assignment $\beta: \V \rightarrow A$
%     \textit{satisfies} the formula $\phi \in \Phi$ when $\mathfrak{A}, \beta
%     \models \psi$.
%     $\phi $ is \textit{satisfiable} in when an assignment $\beta: \V \rightarrow
%     A$ exists such that $\mathfrak{A},\beta\models \phi$. When $\phi$ is
%     satisfiable for any assignments, one writes $\mathfrak{A} \models \phi$.
% \end{definition}

Replacement in ch2_preliminaries.tex at line 62 [9.8115]

B:BD[11.9267] → [11.9267:9329]

    meyer2021tacas,domenech2019arxiv}\label{def:constraints}]

[11.9267]

[17.10768]

    meyer2021tacas,Domenech19}\label{def:constraints}]

Replacement in ch2_preliminaries.tex at line 104 [9.8115]

B:BD[8.43321] → [8.43321:43659]

\gls{fo} logic.\cite{barwise1977}  
In this thesis we will only consider integer arithmetic. Let $\Zs := (\Z, +,
\cdot, 0, 1, \leq, <, =)$ be the structure standard integer arithmetic. For an
assignment $\beta : \V \rightarrow \Z$ and a formula $\varphi \in \Phi$ we write
$\beta \models \varphi$ instead of $\Zs, \beta \models \varphi$.

[8.43321]

[20.1249]

\gls{fo} logic.\cite{barwise1977} In this thesis we will only consider integer
arithmetic. Let $\Zs := (\Z, +, \cdot, 0, 1, \leq, <, =)$ be the structure of
standard integer arithmetic. For an assignment $\beta : \V \rightarrow \Z$ and a
formula $\varphi \in \Phi$ we write $\beta \models \varphi$ instead of $\Zs,
\beta \models \varphi$. Finally, we define two trivial formulas $\texttt{true}
= (0 = 0)$ and $\texttt{false} = (1 < 0)$

Replacement in ch2_preliminaries.tex at line 186 [9.8115]

∅:D[11.10818] → [7.12981:13061]

B:BD[7.12981] → [7.12981:13061]

B:BD[7.13061] → [11.10819:10893]

∅:D[11.10893] → [19.12126:12194]

B:BD[19.12126] → [19.12126:12194]

B:BD[19.12194] → [18.5236:5303]

valid variable assignments for a given formula is part of \gls{smt} and is well
researched. The theories relevant for this thesis are the quantifier-free
linear integer arithmetic (usually called \texttt{QF\_LIA}) and the
quantifier-free non-linear integer arithmetic (\texttt{QF\_NIA}).

[11.10818]

[16.7879]

valid variable assignments for a given formula is part of \gls{smt} and is a
well studied field in computer science.\cite{walsh2009smt,abraham17smt} The
theories relevant for this thesis are the quantifier-free linear integer
arithmetic (usually called \texttt{QF\_LIA}) and the quantifier-free non-linear
integer arithmetic (\texttt{QF\_NIA}).

Replacement in ch2_preliminaries.tex at line 203 [9.8115]

B:BD[11.11380] → [11.11380:11459]

    \Phi_A$, with $\llbracket \varphi|_A \rrbracket = \set{\beta|_X}{\beta \in

[11.11380]

[11.11459]

    \Phi_A$, with $\llbracket \varphi|_A \rrbracket = \set{\beta|_A}{\beta \in

Replacement in ch2_preliminaries.tex at line 351 [9.8115]

B:BD[7.17966] → [7.17966:18038]

        \caption{Interval domain (Box)\label{fig:abstract_domains_box}}

[7.17966]

[7.18038]

        \caption{Box domain\label{fig:abstract_domains_box}}

Replacement in ch2_preliminaries.tex at line 361 [9.8115]

B:BD[7.18440] → [7.18440:18508]

    \caption{Visual comparison of the Interval domain, Octagons and

[7.18440]

[7.18508]

    \caption{Visual comparison of the Box domain, Octagons and

Replacement in ch2_preliminaries.tex at line 388 [9.8115]

B:BD[17.15063] → [11.14531:14625]

intervals (see \Sref{ssec:box}) and polyhedra (see
\Sref{ssec:polyhedra}).\cite{mine2006hosc}

[17.15063]

[21.2525]

intervals and polyhedra.\cite{mine2006hosc}

Replacement in ch2_preliminaries.tex at line 525 [9.8115]
B:BD[2.35105] → [2.35105:35150]
```
    meyer20arxiv}]\label{def:expected_value}
```
[2.35105]
[8.50322]
```
    meyer2021tacas}]\label{def:expected_value}
```

Replacement in ch2_preliminaries.tex at line 711 [9.8115]

B:BD[5.31279] → [8.51945:52007]

\begin{definition}[Probabilistic update]\ref{def:prob_update}

[5.31279]

[5.31320]

\begin{definition}[Probabilistic update]\label{def:prob_update}

Replacement in ch2_preliminaries.tex at line 782 [9.8115]

B:BD[5.33253] → [8.54043:54113]

        \lfloor\eta\rceil = \LAnd_{x\in A} x' = \tilde{\eta(x)} \Land

[5.33253]

[5.33324]

        \lfloor\eta\rceil = \LAnd_{x\in A} x' = \tilde{\eta}(x) \Land

Replacement in ch2_preliminaries.tex at line 884 [9.8115]

B:BD[8.55054] → [8.55054:55132]

current state to be taken. Whenever a transition is taken it can modifies the

[8.55054]

[8.55132]

current state to be taken. Whenever a transition is taken it modifies the

Replacement in ch2_preliminaries.tex at line 891 [9.8115]

B:BD[8.55516] → [8.55516:55754]

We use a graphical representation of programs, as shown in
\fref{fig:ex_pip_nd}. Updates are assign a value to every program variable,
however for better readability, the updates that preserve the value of a program
variable are omitted.

[8.55516]

[22.619]

We use a graphical representation of programs, as shown in \fref{fig:ex_pip_nd}.
Updates assign a value to every program variable, however for better
readability, the updates that preserve the value of a program variable are
omitted.

Replacement in ch2_preliminaries.tex at line 936 [9.8115]

B:BD[7.25931] → [8.55832:55906]

    At location $\ell_1$ with a variable assignment $\sigma(x) = 3$ three

[7.25931]

[7.26002]

    At location $\ell_1$ with a variable assignment $s(x) = 3$ three

Replacement in ch2_preliminaries.tex at line 958 [9.8115]

B:BD[7.27028] → [7.27028:27101]

        the variables $\V= \{x, t\}$ with program variables $\PV=\{x\}$.

[7.27028]

[7.27101]

        the variables $\V= \{x, u\}$ with program variables $\PV=\{x\}$.

Replacement in ch2_preliminaries.tex at line 963 [9.8115]

B:BD[7.27224] → [7.27224:27371]

B:BD[7.27371] → [8.56525:56596]

∅:D[8.56596] → [7.27447:27511]

B:BD[7.27447] → [7.27447:27511]

B:BD[7.27511] → [11.20611:20686]

B:BD[11.20686] → [8.56597:56672]

    transition $t_2$ contains an update with a temporary $t$ variable.
    Furthermore, the temporary variable is constraint to $1 \leq t \leq 3$.
    Since the temporary variable $t$ is not bound in the state $s: \PV
    \rightarrow \Z$, the variable can take any arbitrary value.
    Whenever a temporary variable is sampled to a value not satisfying the
    guard of $t_2$ and x is larger than zero, the program would terminate.

[7.27224]

[7.27659]

    transition $t_2$ contains an update with a temporary $u$ variable.
    Furthermore, the temporary variable is constraint to $1 \leq u \leq 3$. The
    value of temporary variable $u$ in the state $s: \PV \rightarrow \Z$ is
    ignored. It can have an arbitrary value for the purpose of fulfilling the
    guard and when being used in the update. Whenever a temporary variable is
    sampled to a value not satisfying the guard of $t_2$ and x is larger than
    zero, the program would terminate.

Replacement in ch2_preliminaries.tex at line 973 [9.8115]

B:BD[11.20758] → [8.56673:56751]

\Sref{sec:probability}. Updates are expanded to integer polynomials over with

[11.20758]

[8.56751]

\Sref{sec:probability}. Updates are expanded to integer polynomials with

Replacement in ch2_preliminaries.tex at line 1104 [9.8115]
B:BD[5.38228] → [5.38228:38246]
```
Some side notes: 
```
[5.38228]
[5.38246]
```
An execution of a program corresponds to a Markov decision process in the
following way:
```

Replacement in ch2_preliminaries.tex at line 1107 [9.8115]

B:BD[5.38264] → [5.38264:38334]

    \item The decision epochs are the steps of the program. There are

[5.38264]

[5.38334]

    \item The decision epochs correspond to the steps of the program. There are

Replacement in ch2_preliminaries.tex at line 1131 [9.8115]

B:BD[5.39522] → [5.39522:39603]

\begin{definition}[Markovian Scheduler\cite{meyer20arxiv}\label{def:mscheduler}]

[5.39522]

[11.24317]

\begin{definition}[Markovian Scheduler\cite{meyer2021tacas}\label{def:mscheduler}]

Replacement in ch2_preliminaries.tex at line 1146 [9.8115]

B:BD[18.6688] → [8.59271:59337]

    $\MDS_Prog$, where \enquote{MD} stands for \enquote{Markovian

[18.6688]

[11.24607]

    $\MDS_\Prog$, where \enquote{MD} stands for \enquote{Markovian

Replacement in ch2_preliminaries.tex at line 1164 [9.8115]

∅:D[8.59428] → [11.24858:24933]

B:BD[11.24858] → [11.24858:24933]

    = (g, s')$ implies: items \ref{itm:md1} to \ref{itm:md4} of Definition

[8.59428]

[11.24933]

    = (g, s')$ implies items \ref{itm:md1} to \ref{itm:md4} of Definition

Replacement in ch2_preliminaries.tex at line 1171 [9.8115]

B:BD[8.59509] → [8.59509:59571]

\begin{lemma}[Proposition 7.1.1 in \cite{puterman1994markov}]

[8.59509]

[8.59571]

\begin{lemma}[Proposition 7.1.1 in
    \cite{puterman1994markov}]\label{lem:hdsvsmds}

Replacement in ch2_preliminaries.tex at line 1187 [9.8115]

B:BD[5.40980] → [2.35295:35380]

∅:D[2.35380] → [8.60107:60187]

B:BD[5.41059] → [8.60107:60187]

∅:D[8.60187] → [5.41134:41215]

B:BD[5.41134] → [5.41134:41215]

B:BD[5.41215] → [2.35381:35463]

∅:D[11.25246] → [5.41289:41597]

∅:D[2.35463] → [5.41289:41597]

∅:D[8.60264] → [5.41289:41597]

B:BD[5.41289] → [5.41289:41597]

B:BD[5.41597] → [11.25247:25293]

For a \gls{pip} one can define a probability space $(\runs_\Prog, \F, \PrSs)$, where
the outcomes of the probabilistic experiment are the runs on the program. Every
distinct run is measurable hence the $\sigma$-algebra $\F$ is defined to contain
every set $\{r\} \in \F$ for $r \in \runs_\Prog$. The probability measure $\PrSs$
describes the probability of the given run for a scheduler $\scheduler$ and the
input given to the program as the initial state $s_0$. Formally the probability
space is defined by a cylindrical construction expanding the length of finite
prefixes up to infinity. For the detailed construction we'll refer to
\citeauthor{meyer20arxiv}\cite{meyer20arxiv}.

[5.40980]

[5.41618]

For a \gls{pip} one can define a probability space $(\runs_\Prog, \F, \PrSs)$,
where the outcomes of the probabilistic experiment are the runs on the program.
Every distinct run is measurable hence the $\sigma$-algebra $\F$ is defined to
contain every set $\{r\} \in \F$ for $r \in \runs_\Prog$. The probability
measure $\PrSs$ describes the probability of the given run for a scheduler
$\scheduler$ and the input given to the program as the initial state $s_0$.
Formally the probability space is defined by a cylindrical construction
expanding the length of finite prefixes up to infinity. For the detailed
construction we'll refer to \citeauthor{meyer2021tacas}\cite{meyer2021tacas}.

Replacement in ch2_preliminaries.tex at line 1203 [9.8115]

B:BD[11.25370] → [8.60265:60347]

∅:D[8.60347] → [5.42158:42171]

B:BD[5.42158] → [5.42158:42171]

terminating state $\ell_\bot$ it takes the special transition $g_\bot$ indicating
termination.

[11.25370]

[5.42171]

terminating state $\ell_\bot$ it takes the special transition $g_\bot$
indicating termination.

Replacement in ch2_preliminaries.tex at line 1206 [9.8115]

B:BD[5.42172] → [8.60348:60421]

Let $\scheduler \in \MDS_Prog$ be a scheduler and $s_0 \in \Sigma$ be an

[5.42172]

[8.60421]

Let $\scheduler \in \MDS_\Prog$ be a scheduler and $s_0 \in \Sigma$ be an

Replacement in ch2_preliminaries.tex at line 1226 [9.8115]

B:BD[8.61185] → [8.61185:61346]

∅:D[8.61346] → [5.42558:42628]

B:BD[5.42558] → [5.42558:42628]

variable $x\in\PV$ is updated to the value $s'(x)$ by the probabilistic update -
recall definition \ref{def:prob_update}, third that the temporary variables are
sampled by the scheduler equal to the target state. The result is the

[8.61185]

[8.61347]

variable $x\in\PV$ is updated to the value $s'(x)$ by the probabilistic update
-- recall Definition \ref{def:prob_update}, third that the temporary variables
are sampled by the scheduler equal to the target state. The result is the

Deletion in ch2_preliminaries.tex at line 1256 [9.8115]

B:BD[14.18067] → [14.18067:18124]

B:BD[14.18189] → [14.18189:18861]

∅:D[14.18861] → [10.22048:22049]

B:BD[10.22048] → [10.22048:22049]

B:BD[10.22049] → [14.18862:18905]

% \begin{definition}[Reachability]\def{def:reachability}
%     Let $l, l' \in \Loc_\Prog$, $t,t' \in \T_\Prog$, and $s,s' \in \Sigma$. A
%     configuration $c' = (l', t', s')$ is \textit{reachable} from $c = (l, t, s)$
%     if and only if there exists a scheduler $\scheduler \in \MDS$ and starting
%     state $s_0 \in \Sigma$ for which $\prSs(c \rightarrow c') > 0$.
% \end{definition}
%     Let $l, l' \in \Loc_\Prog$, $t,t' \in \T_\Prog$, $s,s' \in \Sigma$, and $c'
%     = (l', t', s')$ as well as $c = (l, t, s)$. Let $t = (\_, \_, \tau, \eta,
%     \_)$. When $c'$ is reachable from $c$ this implies: 
%     \begin{enumerate}
%         \item $s \models \tau$
%     \end{enumerate}
%     \begin{proof}
%     \end{proof}
% \begin{lemma}
%     If a 
% \end{lemma}

Replacement in ch2_preliminaries.tex at line 1285 [9.8115]

B:BD[14.19933] → [2.35582:35661]

    For a finite prefix $f \in \fpath_\Prog$ the runtime complexity $\Rt_\Prog

[14.19933]

[2.35661]

    For a finite prefix $f = c_0\dots{}c_n \in \fpath_\Prog$ the runtime complexity $\Rt_\Prog

Replacement in ch2_preliminaries.tex at line 1311 [9.8115]

B:BD[10.25857] → [8.62329:62407]

    $\scheduler \in \MDS_Prog$ and every initial state $s_0 \in \Sigma$, then

[10.25857]

[8.62407]

    $\scheduler \in \MDS_\Prog$ and every initial state $s_0 \in \Sigma$, then

Replacement in ch2_preliminaries.tex at line 1316 [9.8115]

B:BD[11.26915] → [11.26915:26989]

    Consider the program from example \ref{ex:prob_sampling} displayed in

[11.26915]

[11.26989]

    Consider the program from Example \ref{ex:prob_sampling} displayed in

Replacement in ch2_preliminaries.tex at line 1340 [9.8115]

B:BD[5.44671] → [5.44671:44739]

B:BD[5.44739] → [11.28039:28116]

∅:D[11.28116] → [5.44816:44870]

B:BD[5.44816] → [5.44816:44870]

The primary goal of \gls{koat}'s analysis is to find bounds for the
runtime complexity bounds. A bound is an expression that describes the upper
bound of the costs or runtime complexity the program.

[5.44671]

[10.26685]

The primary goal of \gls{koat}'s analysis is to find bounds for the runtime
complexity.

Replacement in ch2_preliminaries.tex at line 1394 [9.8115]
B:BD[8.64049] → [8.64049:64083]
```
\begin{definition}[\acrfull{scc}]
```
[8.64049]
[8.64083]
```
\begin{definition}[\Acrfull{scc}]
```

Replacement in ch2_preliminaries.tex at line 1396 [9.8115]

B:BD[8.64164] → [8.64164:64456]

    $\Prog$ with locations $V(\S)= \set{\ell, \ell'}{(\ell, \_, \_, \_,\ell') \in
    \S}$. A component $\S \subseteq \T_\Prog$ is \emph{strongly connected} when
    for every two locations $\ell, \ell' \in V(\S)$ there exists a path between
    the two using only the transitions from $\S$.

[8.64164]

[8.64456]

    $\Prog$ with locations $V(\S)= \set{\ell, \ell'}{(\ell, \_, \_, \_,\ell')
    \in \S}$. A component $\S \subseteq \T_\Prog$ is \emph{strongly connected}
    when for every two locations $\ell, \ell' \in V(\S)$ there exists a path
    between the two using only the transitions from $\S$.

Replacement in ch1_introduction.tex at line 5 [9.14309]

B:BD[18.6831] → [18.6831:7562]

∅:D[18.7562] → [3.8486:8564]

∅:D[7.40005] → [3.8486:8564]

B:BD[3.8486] → [3.8486:8564]

strongly related to the search for upper and lower runtime complexity bounds, is
equally undecidable. Nevertheless, the answers to those questions are very
important in practice. For example a compiler might want to warn the programmer
about faulty code sections that were not marked explicitly to run indefinitely,
or the developer could be forced to prove the efficiency of his program in
critical scenarios. In an ever more complex world, with ever more complex
algorithms, the need for automatic tools arised. Even though the question for
runtime-complexity can not be answered in a general case, many tools were
developed to automatically analyze the complexity of various programming
paradigms as tight and fast as possible.
\cite{giesl2017aprove, montoya2014aplas,alarcon2017muterm,irankfinder2018wst}

[18.6831]

[3.8564]

strongly related to the search for upper and lower runtime complexity bounds,
which is equally undecidable. Nevertheless, the answers to those questions are
very important in practice. For example a compiler might want to warn the
programmer about faulty code sections that were not marked explicitly to run
indefinitely, or the developer could be forced to prove the efficiency of his
program in critical scenarios. In an ever more complex world, with ever more
complex algorithms, the need for automatic tools arised. Even though the
question for runtime-complexity can not be answered in a general case, many
tools were developed to automatically analyze the complexity of various
programming paradigms as tight and fast as possible. \cite{giesl2017aprove,
montoya2014aplas,alarcon2017muterm,irankfinder2018wst}

Replacement in ch1_introduction.tex at line 44 [9.14309]

B:BD[18.7917] → [18.7917:7992]

    runtime-complexity bounds using only \gls{mprf}.\cite{giesl2022arxiv}}

[18.7917]

[11.29736]

    runtime-complexity bounds using only \gls{mprf}.\cite{giesl2022lncs}}

Replacement in ch1_introduction.tex at line 53 [9.14309]

∅:D[8.64990] → [18.8979:9058]

B:BD[18.8979] → [18.8979:9058]

runtime-complexity bounds for this program\cite{giesl2022arxiv}. With the help

[8.64990]

[8.64991]

runtime-complexity bounds for this program\cite{giesl2022lncs}. With the help

Replacement in ch1_introduction.tex at line 59 [9.14309]

B:BD[11.30601] → [11.30601:30680]

\citeauthor{domenech2019arxiv}\cite{domenech2019arxiv} and is called \gls{cfr}

[11.30601]

[11.30680]

\citeauthor{Domenech19}\cite{Domenech19} and is called \gls{cfr}

Replacement in ch1_introduction.tex at line 65 [9.14309]

B:BD[11.31040] → [11.31040:31120]

was presented by \citeauthor{giesl2022arxiv}\cite{giesl2022lncs,giesl2022arxiv}

[11.31040]

[11.31120]

was presented by \citeauthor{giesl2022lncs}\cite{giesl2022lncs}

Replacement in ch1_introduction.tex at line 71 [9.14309]

B:BD[11.31381] → [11.31381:31461]

programs\cite{meyer2021tacas,meyer20arxiv}. Unfortunately, by using iRankfinder

[11.31381]

[11.31461]

programs\cite{meyer2021tacas}. Unfortunately, by using iRankfinder

Replacement in ch1_introduction.tex at line 99 [9.14309]

B:BD[8.65785] → [8.65785:65858]

bounds for this program. It would be nice to transform $\Prog_2$ into an

[8.65785]

[8.65858]

bounds for this program. It would be useful to transform $\Prog_2$ into an

Replacement in ch1_introduction.tex at line 133 [9.14309]
∅:D[8.66924] → [7.41547:41592]
B:BD[7.41547] → [7.41547:41592]
```
\cite{gallagher2019eptcs,domenech2019arxiv}.
```
[8.66924]
[3.10824]
```
\cite{gallagher2019eptcs}.
```

Replacement in ch1_introduction.tex at line 137 [9.14309]

B:BD[7.41673] → [7.41673:41747]

with a number of limitations\cite{giesl2022wst}: Besides code-quality and

[7.41673]

[7.41747]

with a number of limitations\cite{giesl2022lncs}. Besides code-quality and

WIP