.. _experimental-fused-rime-api-anchor:

-----------------------------------------------
Fused Radio Interferometer Measurement Equation
-----------------------------------------------

Radio Interferometer Measurement Equation
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The Radio Interferometer Measurement Equation (RIME)
describes the response of an interferometer to a sky model.
As described in `A full-sky Jones formalism <rime_paper_i_>`_,
a RIME could be written as follows:

.. _fused-rime-equation-anchor:

.. math::

    V_{pq} = G_{p} \left(
        \sum_{s} E_{ps} L_{p} K_{ps}
        B_{s}
        K_{qs}^H L_{q}^H E_{qs}^H
        \right) G_{q}^H

where for antenna :math:`p` and :math:`q`, and source :math:`s`:

* :math:`G_{p}` represents direction-independent effects.
* :math:`E_{ps}` represents direction-dependent effects.
* :math:`L_{p}` represents the feed rotation.
* :math:`K_{ps}` represents the phase delay term.
* :math:`B_{s}` represents the brightness matrix.

The RIME is more formally described in the following four papers:

* `I. A full-sky Jones formalism <rime_paper_i_>`_
* `II. Calibration and direction-dependent effects <rime_paper_ii_>`_
* `III. Addressing direction-dependent effects in 21cm WSRT observations of 3C147 <rime_paper_iii_>`_
* `IV. A generalized tensor formalism <rime_paper_iv_>`_

.. _rime_paper_i: https://arxiv.org/abs/1101.1764
.. _rime_paper_ii: https://arxiv.org/abs/1101.1765
.. _rime_paper_iii: https://arxiv.org/abs/1101.1768
.. _rime_paper_iv: https://arxiv.org/abs/1106.0579


The Fused RIME
~~~~~~~~~~~~~~

The RIME poses a number of implementation challenges which
focus on flexibility, speed and ease of use.

Firstly, the RIME can be composed of many terms representing
various physical effects.
It is useful for scientist to be able to specify many different
terms in the above :ref:`Equation <fused-rime-equation-anchor>`,
for example.

Secondly, the computational complexity of the RIME `O(S x V)` where S
is the number of source and V is the number of visibilities.
This is computionationally expensive relative to degridding
strategies.

Thirdly, it should be as easy as possible to define the RIME,
but not at the cost of the previous two constraints.

The Fused RIME therefore implements a "RIME Compiler" using
`Numba <https://numba.pydata.org/_>`_ for speed, which compiles
a RIME Specification defined by a number of `Terms` into
a single, optimal unit of execution.

.. _experimental-fused-rime-example-anchor:

A Simple Example
~~~~~~~~~~~~~~~~

In the following example, we will define a simple RIME using the
Fused RIME API to define terms for computing:

1. the Phase Delay.
2. the Brightness Matrix.

The RIME Specification
++++++++++++++++++++++

The specification for this RIME is as follows:

.. code-block:: python

    rime_spec = RimeSpecification("(Kpq, Bpq): [I,Q,U,V] -> [XX,XY,YX,YY]",
                                  terms={"K": Phase},
                                  transformers=[LMTransformer])

``(Kpq, Bpq)`` specifies the onion including the Phase Delay and
Brightness more formally defined
:ref:`here <experimental-fused-rime-api-anchor>`, while the
the ``pq`` in both terms signifies that they are calculated per-baseline.
``[I,Q,U,V] -> [XX,XY,YX,YY]`` defines the stokes to correlation conversion
within the RIME and also identifies whether the RIME is handling linear
or circular feeds. :code:`terms={"K": Phase}` indicates that the
K term is implemented as a custom Phase term, described in the next section.
Finally, :code:`LMTransformer` is a Transformer that precomputes lm coordinates
for use by all terms.

Custom Phase Term
+++++++++++++++++

Within the RIME, each term is *sampled* at an individual
source, row and channel.

Therefore each term must provide a sampling function that will
provide the necessary data for multiplication within the RIME.
Consider the following Phase Term:

.. code-block:: python

    from africanus.experimental.rime.fused.terms.core import Term

    class Phase(Term):
        def sampler(self):
            def phase_sample(state, s, r, t, f1, f2, a1, a2, c):
                p = state.real_phase[s, r] * state.chan_freq[c]
                return np.cos(p) + np.sin(p)*1j

            return phase_sample

This may look simple: we compute the complex phase by multiplying
the real phase at each source and row by the channel frequency
and return the complex exponential of this value.

However, questions remain: What is the `state` object and how
do we know that the `real_phase` and `chan_freq` are members?
To answer this, we must define (and understand) a second method
defined on the `Phase` term, called `init_fields`.

.. code-block:: python

    import numba
    from africanus.experimental.rime.fused.terms.core import Term

    class Phase(Term)
        def init_fields(self, typingctx, init_state, lm, uvw, chan_freq):
            # Given the numba types of the lm, uvw and chan_freq
            # arrays, derive a unified output numba type
            numba_type = typingctx.unify_types(lm.dtype,
                                               uvw.dtype,
                                               chan_freq.dtype)

            # Define the type of new fields on the state object
            # in this case a 2D Numba array with dtype numba_type
            fields = [("real_phase", numba_type[:, :])]

            def real_phase(lm, uvw, chan_freq):
                """Compute the real_phase upfront, instead of in
                the sampling function"""
                real_phase = np.empty((lm.shape[0], uvw.shape[0]), numba_type)

                for s in range(lm.shape[0]):
                    l, m = lm[s]
                    n = 1.0 - l**2 - m**2
                    n = np.sqrt(0.0 if n <= 0.0 else n) - 1.0

                    for r in range(uvw.shape[0]):
                        u, v, w = uvw[r]
                        real_phase[s, r] = -2.0*np.pi*(l*u + m*v + n*w)/3e8

                return real_phase

            # Return the new field definition and
            # the function for creating it
            return fields, real_phase

``init_fields`` serves multiple purposes:

1. It requests input for the Phase term.
   The above definition of ``init_fields`` signifies
   that the Phase term desires the ``lm``, ``uvw`` and
   ``chan_freq`` arrays.
   Additionally, these arrays will be stored on the ``state``
   object provided to the sampling function.

2. It supports reasoning about Numba types.
   The ``lm``, ``uvw`` and ``chan_freq``
   arguments contain the Numba types of the variables supplied
   to the RIME, while the ``typingctx`` argument contains a Numba
   Typing Context which can be useful for reasoning about
   these types.
   For example
   :code:`typingctx.unify_types(lm.dtype, uvw.dtype, chan_freq.dtype)`
   returns a type with sufficient precision, given the input types,
   similar to :func:`numpy.result_type`.

3. It allows the user to define new fields, as
   well as a function for defining those fields
   on the ``state`` object.
   The above definition of ``init_fields`` returns
   a list of :code:`(name, type)` tuples defining
   the new field names and their types, while
   :code:`real_phase` defines the creation of
   this new field.

   This is useful for optimising the sampling function
   by pre-computing values. For example, it is wasteful to
   compute the real phase for each source, row and
   channel.

Returning to our definition of the Phase Term sampling function,
we can see that it uses the new field ``real_phase`` defined in
``init_fields``, as well as the ``chan_freq`` array requested
in ``init_fields`` to compute a complex exponential.

.. code-block:: python

    class Phase(Term):
        def sampler(self):
            def phase_sample(state, s, r, t, f1, f2, a1, a2, c):
                p = state.real_phase[s, r] * state.chan_freq[c]
                return np.cos(p) + np.sin(p)*1j

            return phase_sample

Transformers
++++++++++++

Using :meth:`Term.init_fields`, we can precompute data for use in
sampling functions, within a single Term.
However, sometimes we wish to precompute data for use by multiple
Terms.
This can be achieved through the use of ``Transformers``.
A good example of data that it is useful to precompute for multiple
Terms are ``lm`` coordinates, which are in turn, derived from
``phase_dir`` and ``radec`` which are the phase centre of an
observation and the position of a source, respectively.
In the following code snippet, ``LMTransformer.init_fields``

.. code-block:: python

    from africanus.experimental.rime.fused.transformers import Transformer

    class LMTransformer(Transformer):
        # Must specify list of outputs produced by this transformer on the
        # OUTPUTS class attribute
        OUTPUTS = ["lm"]

        def init_fields(self, typingctx, init_state, radec, phase_dir):
            # Type and provide method for initialising the lm output
            dt = typingctx.unify_types(radec.dtype, phase_dir.dtype)
            fields = [("lm", dt[:, :])]

            def lm(radec, phase_dir):
                lm = np.empty_like(radec)
                pc_ra = phase_dir[0]
                pc_dec = phase_dir[1]

                sin_pc_dec = np.sin(pc_dec)
                cos_pc_dec = np.cos(pc_dec)

                for s in range(radec.shape[0]):
                    da = radec[s, 0] - pc_ra
                    sin_ra_delta = np.sin(da)
                    cos_ra_delta = np.cos(da)

                    sin_dec = np.sin(radec[s, 1])
                    cos_dec = np.cos(radec[s, 1])

                    lm[s, 0] = cos_dec*sin_ra_delta
                    lm[s, 1] = sin_dec*cos_pc_dec - cos_dec*sin_pc_dec*cos_ra_delta

                return lm

            return fields, lm

The ``lm`` array will be available on the ``state`` object and as a valid input
for :meth:`Term.init_fields`.

Indexing arrays
+++++++++++++++

The ``init_state`` and ``state`` objects contains NumPy arrays storing
Measurement Set v2.0 indexing information.

.. code-block:: python

    class State:
        utime             # Unique times
        uantenna          # Unique antenna indices
        ufeed             # Unique feed indices
        time_inverse      # Maps the time at a row into utime
        antenna1_inverse  # Maps the antenna1 index at a row into uantenna
        antenna2_inverse  # Maps the antenna2 index at a row into uantenna
        feed1_inverse     # Maps the feed1 index at a row into ufeed
        feed2_inverse     # Maps the feed2 index at a row into ufeed
        ...

These arrays are useful in cases where the developer wishes to avoid
recomputing values multiple times for each row in the sampling function.
Instead they can be pre-computed for unique times, antennas and feeds
in :meth:`Term.init_fields` and then looked up in :meth:`Term.sampler`.

.. code-block:: python

    class MyTerm(Term):
        def init_fields(self, typingctx, init_state, ...):
            fields = [("precomputed", numba.float64[:, :, :])]

            def precompute(init_state, ...):
                ntime = init_state.utime.shape[0]
                nfeed = init_state.ufeed.shape[0]
                nant = init_state.uantenna.shape[0]
                precomputed = np.empty((ntime, nfeed, nant), np.float64)

                for t in range(ntime):
                    for f in range(nfeed):
                        for a in range(nant):
                            precomputed[t, f, a] = ...

                return precomputed

            return fields, precompute

        def sampler(self, state, s, r, t, f1, f2, a1, a2, c):
            left = self.configuration == "left"

            def sample_precomputed(state, s, r, t, f1, f2, a1, a2, c):
                f = state.feed1_inverse[r] if left else state.feed2_inverse[r]
                a = state.antenna1_inverse[r] if left else state.antenna2_inverse[r]
                return state.precomputed[t, f, a]

            return sample_precomputed


Invoking the RIME
+++++++++++++++++

We then invoke the RIME by passing in the :class:`RimeSpecification`, as
well as a dataset containing the required arguments:

.. code-block:: python

    from africanus.experimental.rime.fused.core import rime
    import numpy as np

    dataset = {
        "radec": np.random.random((10, 2))*1e-5,
        "phase_dir": np.random.random((2,))*1e-5,
        "uvw": np.random.random((100, 3))*1e5,
        "chan_freq:" np.linspace(.856e9, 2*.856e9, 16),
        ...,
        "stokes": np.random.random((10, 4)),
        # other required data
    }

    rime_spec = RimeSpecification("(Kpq, Bpq)",
                                  terms={"K": Phase},
                                  transformers=LMTransformer)
    model_visibilities = rime(rime_spec, dataset)


Dask Support
++++++++++++

Dask wrappers are provided for the
:func:`africanus.experimental.rime.fused.core.rime` function.
In order to support this, both :class:`Term` and :class:`Transformer`
classes need to supply a ``dask_schema`` function which is used to
define the ``schema`` for each supplied argument, which in turn
is supplied to a :func:`dask.array.blockwise` call.

The ``schema`` should be a tuple of dimension string names.
In particular, the ``rime`` function assigns special meaning to
``source``, ``row``, ``chan`` and ``corr`` -- These names are
are associated with individual sources (fields) and Measurement Set
rows, channels and correlations, respectively.
Dask Array chunking is supported along these dimensions in the sense
that the ``rime`` will be computed for each chunk along these dimensions.

.. note::

    Chunks in dimensions other than ``source``, ``row``, ``chan`` and
    ``corr`` will be contracted into a single array within the
    ``rime`` function.
    It is recommended that other dimensions contain a single chunk,
    or contain small quantities of data relative to the special dimensions.


Therefore, :code:`Phase.dask_schema` could be implemented as follows:

.. code-block:: python

    class Phase(Term):
        def dask_schema(self, lm, uvw, chan_freq):
            assert lm.ndim == 2
            assert uvw.ndim == 2
            assert chan_freq.ndim == 1

            return {
                "lm": ("source", "lm-component"),
                "uvw": ("row", "uvw-component"),
                "chan_freq": ("chan",),
            }

The :code:`dask_schema` for a :code:`Transformer` is slightly different as,
in addition a schema for the inputs, it must also provide an ``array_like``
variable describing the number of dimensions and data type of the output
arrays.
The ``array_like`` variables are in turn passed into :class:`Term.dask_schema`.
Thus, :code:`LMTransformer.dask_schema` could be implemented as follows;

.. code-block:: python

    class LMTransformer(Transformer):
        OUTPUTS = ["lm"]

        def dask_schema(self, phase_dir, radec):
            dt = np.result_type(phase_dir.dtype, radec.dtype)
            return ({
                "phase_dir": ("radec-component",),
                "radec": ("source", "radec-component",),
            },
            {
                "lm": np.empty((0,0), dtype=dt)
            })


Then, in a paradigm very similar to the non-dask case, we create
a :class:`RimeSpecification` and supply it,
along with a dictionary or dataset of dask arrays, to the
:func:`rime` function.
This will produce a dask array representing the model
visibilities.

.. code-block:: python

    from africanus.experimental.rime.fused.dask import rime
    import dask.array as da
    import numpy as np

    dataset = {
        "radec": da.random.random((10, 2), chunks=(2, 2))*1e-5,
        "phase_dir": da.random.random((2,), chunks=(2,))*1e-5,
        "uvw": da.random.random((100, 3), chunks=(10, 3))*1e5,
        "chan_freq:" da.linspace(.856e9, 2*.856e9, 16, chunks=(4,)),
        ...,
        "stokes": da.random.random((10, 4), chunks=(2, 4)),
        # other required data
    }

    rime_spec = RimeSpecification("(Kpq, Bpq)",
                                  terms={"K": Phase},
                                  transformers=LMTransformer)
    model_visibilities = rime(rime_spec, dataset)
    model_visibilities.compute()


API
~~~

.. currentmodule:: africanus.experimental.rime.fused.specification

.. autoclass:: RimeSpecification
    :exclude-members: equation_bits, flatten_eqn


.. currentmodule:: africanus.experimental.rime.fused.terms.core

.. py:class:: Term

    Base class for Terms which describe parts of the Fused RIME.
    Implementors of a RIME Term should inherit from it.

    A Term is an object that defines how a term in the RIME should
    be sampled to produces the Jones Terms that make up the RIME.
    It therefore defines a sampling function, which in turn
    depends on arbitrary inputs for performing the sampling.

    A high degree of flexibility and leeway is afforded when
    implementing a Term. It might be implemented by merely indexing
    an array of Jones Matrices, or by implementing some computational
    model describing the Jones Terms.

    .. code-block:: python

        class Phase(Term):
            def __init__(self, configuration):
                super().__init__(configuration)

    .. py:method:: Term.init_fields(self, typing_ctx, init_state, \
                                    arg1, ..., argn, \
                                    kwarg1=None, ..., kwargn=None)

        Requests inputs to the RIME term, ensuring that they are
        stored on a ``state`` object supplied to the sampling function
        and allows for new fields to be initialised and stored on the
        ``state`` object.

        Requested inputs :code:`arg1...argn` are required to be passed
        to the Fused RIME by the caller and are supplied to ``init_fields``
        as Numba types. :code:`kwarg1...kwargn` are optional -- if omitted
        by the caller, their default types (and values)  will be supplied.

        ``init_fields`` should return a :code:`(fields, function)` tuple.
        ``fields`` should be a list of the form :code:`[(name, numba_type)]`, while
        ``function`` should be a function of the form
        :code:`fn(init_state, arg1, ..., argn, kwarg1=None, .., kwargn=None)`
        and should return the variables of the type defined
        in ``fields``. Note that it's signature therefore matches
        that of ``init_fields`` from after the ``typingctx``
        argument. See the
        :ref:`Simple Example <experimental-fused-rime-example-anchor>`.

        :param typingctx: A Numba typing context.
        :param init_state: State object holding index information.
        :param arg1...argn: Required RIME inputs for this Term.
        :param kwarg1...kwargn: Optional RIME inputs for this Term. \
            Types here should be simple: ints, floats, complex numbers
            and strings are ideal.

        :rtype: tuple
        :returns: A :code:`(fields, function)` tuple.

        .. warning::

            The ``function`` returned by ``init_fields`` must be compileable
            in Numba's
            `nopython <https://numba.pydata.org/numba-doc/latest/user/jit.html#nopython_>`_ mode.


    .. py:method:: Term.sampler(self)

        Return a sampling function of the following form:

        .. code-block:: python

            def sampler(self):
                def sample(state, s, r, t, f1, f2, a1, a2, c):
                    ...

            return sample

        :param state: A state object containing the inputs requested by
                      all ``Term`` objects in the RIME, as well as any
                      fields created by ``Term.init_fields``.
        :param s: Source index.
        :param r: Row index.
        :param t: Time index.
        :param f1: Feed 1 index.
        :param f2: Feed 2 index.
        :param a1: Antenna 1 index.
        :param a2: Antenna2 index.
        :param c: Channel index.

        :rtype: scalar or a tuple
        :returns: a scalar or a tuple of two scalars or a tuple of four scalars.

        .. warning::

            The sampling function returned by ``sampler`` must be compileable
            in Numba's
            `nopython <https://numba.pydata.org/numba-doc/latest/user/jit.html#nopython_>`_ mode.

    .. py:method:: dask_schema(self, arg1, ..., argn, \
                kwargs1=None, ..., kwargn=None)

        :param arg1...argn: Required RIME inputs for this Transformer.
        :param kwarg1...kwargn: Optional RIME inputs for this Transformer. \
            Types here should be simple: ints, floats, complex numbers
            and strings are ideal.

        :rtype: dict
        :returns: A dictionary of the form :code:`{name: schema}` defining
                  the :func:`~dask.array.blockwise` dimension schema of each
                  supplied argument and keyword argument.

.. currentmodule:: africanus.experimental.rime.fused.transformers.core

.. py:class:: Transformer

    Base class for precomputing data for consumption by
    :class:`~africanus.experimental.rime.fused.terms.core.Term`'s.

    .. py:attribute:: OUTPUTS

        This class attributes should contain names of the outputs produced
        by the Transformer class.
        This should correspond to the fields produced by
        :meth:`Transformer.init_fields`.

    .. py:method:: Transformer.init_fields(self, typing_ctx, init_state, \
                                           arg1, ..., argn, \
                                           kwarg1=None, ..., kwargn=None)

        Requests inputs to the Transformer, and specifies new fields and
        the function for creating them on the ``state`` object.
        Functionally, this method behaves exactly the same as the
        :meth:`~africanus.experimental.rime.fused.terms.core.Term.init_fields`
        method, the difference being that the outputs are available to all
        Terms.

        :rtype: tuple
        :returns: A :code:`(fields, function)` tuple.

        .. warning::

            The ``function`` returned by ``init_fields`` must be compileable
            in Numba's
            `nopython <https://numba.pydata.org/numba-doc/latest/user/jit.html#nopython_>`_ mode.

    .. py:method:: dask_schema(self, init_state, \
                               arg1, ..., argn, \
                               kwargs1=None, ..., kwargn=None)


        :rtype: tuple
        :returns: A :code:`(inputs, outputs)` tuple.

                ``inputs`` should
                be a dictionary of the form :code:`{name: schema}`
                where ``schema`` is a dimension schema suitable for use
                in :func:`dask.array.blockwise`. A suitable schema for
                visibility data would be :code:`(row, chan, corr)`,
                while a uvw coordinate schema could be
                :code:`(row, uvw-component)`.

                ``outputs`` should be a dictionary of the form
                :code:`{name: array_like}`, where ``array_like``
                is an object with ``dtype`` and ``ndim`` attributes.
                A suitable array_like for lm data could be
                :code:`np.empty((0,0), dtype=np.float64)`.


Predefined Terms
++++++++++++++++

.. autoclass:: africanus.experimental.rime.fused.terms.phase.Phase
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, validate_sampler
.. autoclass:: africanus.experimental.rime.fused.terms.brightness.Brightness
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, validate_sampler
.. autoclass:: africanus.experimental.rime.fused.terms.gaussian.Gaussian
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, validate_sampler
.. autoclass:: africanus.experimental.rime.fused.terms.feed_rotation.FeedRotation
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, validate_sampler
.. autoclass:: africanus.experimental.rime.fused.terms.cube_dde.BeamCubeDDE
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, validate_sampler

Predefined Transformers
+++++++++++++++++++++++

.. autoclass:: africanus.experimental.rime.fused.transformers.lm.LMTransformer
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, transform_validator
.. autoclass:: africanus.experimental.rime.fused.transformers.parangle.ParallacticTransformer
    :exclude-members: init_fields, dask_schema, sampler, validate_constructor, transform_validator

Numpy
~~~~~

.. currentmodule:: africanus.experimental.rime.fused.core

.. autosummary::
    rime

.. autofunction:: rime

Dask
~~~~

.. currentmodule:: africanus.experimental.rime.fused.dask

.. autosummary::
    rime

.. autofunction:: rime