diff --git a/CIP-plutus-modules/README.md b/CIP-plutus-modules/README.md
new file mode 100644
index 000000000..b91df2e74
--- /dev/null
+++ b/CIP-plutus-modules/README.md
@@ -0,0 +1,1656 @@
+---
+CIP: "?"
+Title: Modules in UPLC
+Status: Proposed
+Category: Plutus
+Authors:
+  - John Hughes <john.hughes@quviq.com>
+Implementors: []
+Discussions: []
+Created: 2024-11-12
+License: CC-BY-4.0
+---
+## Abstract
+
+Cardano scripts are limited in complexity by the fact that each script
+must be supplied in one transaction, whether the script is supplied in
+the same transaction in which it is used, or pre-loaded onto the chain
+for use as a reference script. This limits script code size, which in
+turn limits the use of libraries in scripts, and ultimately limits the
+sophistication of Cardano apps, compared to competing blockchains. The
+script size limit is an aspect of Cardano that script developers
+commonly complain about.
+
+This CIP addresses this problem directly, by allowing reference inputs
+to supply 'modules', which can be used from other scripts (including
+other modules), thus allowing the code of a script to be spread across
+many reference inputs. The 'main specification' requires *no* changes to
+UPLC, PTLC, PIR or Plinth; only a 'dependency resolution' step before
+scripts are run. Many variations are described for better performance,
+including some requiring changes to the CEK machine itself.
+
+Higher performance variations will be more expensive to implement; the
+final choice of variations should take implementation cost into
+account, and (in some cases) may require extensive benchmarking.
+
+## Motivation: why is this CIP necessary?
+
+Cardano scripts are currently subject to a fairly tight size limit;
+even when they are supplied as a reference input, that UTxO must be
+created by a single transaction, which is subject to the overall
+transaction size limit. Competing blockchains suffer from no such
+limit: on the Ethereum chain, for example, contracts can call one
+another, and so the code executed in one transaction may come from
+many different contracts, created independently on the
+blockchain--each subject to a contract size limit, but together
+potentially many times that size. This enables more sophisticated
+contracts to be implemented; conversely, on Cardano, it is rather
+impractical to implement higher-level abstractions as libraries,
+because doing so will likely exceed the script size limit. This is not
+just a theoretical problem: complaints about the script size limit are
+commonly made by Cardano contract developers.
+
+Thus the primary goal of this CIP is to lift the limit on the total
+amount of code run during a script execution, by allowing part of the
+code to be provided in external modules. By storing these modules on
+the blockchain and providing them as reference UTxOs, it will be
+possible to keep transactions small even though they may invoke a large
+volume of code.
+
+Once scripts can be split into separate modules, then the question
+immediately arises of whether or not the script and the modules it
+imports need to be in the same language. Today there are many
+languages that compile to UPLC, and run on the Cardano
+blockchain. Ideally it should be possible to define a useful library
+in any of these languages, and then use it from all of them. A
+secondary goal is thus to define a module system which permits this,
+by supporting cross-language calls.
+
+Note that many languages targetting UPLC already support modules. In
+particular, Plinth already enjoys a module system, namely the Haskell
+module system. This already enables code to be distributed across
+several modules, or put into libraries and shared. Indeed this is
+already heavily used: the DJED code base distributes Plinth code
+across 24 files, of which only 4 contain top-level contracts, and the
+others provide supporting code of one sort or another. Thus the
+software engineering benefits of a module system are already
+available. The *disadvantage* of this approach is that all the code is
+combined into one script, which can easily exceed the size limit as a
+result. Indeed, the DJED code base also contains an implementation of
+insertion sort in Plinth, with a comment that a quadratic algorithm is
+used because its code is smaller than, for example, QuickSort. There
+is no clearer way to indicate why the overall limit on code size must be
+lifted.
+
+### The Situation on Ethereum
+
+Ethereum contracts are not directly comparable to Cardano scripts;
+they correspond to both the *on-chain* and the *off-chain* parts of
+Cardano contracts, so one should expect Ethereum contracts to require
+more code for the same task, since in Cardano only the verifiers need
+to run on the chain itself. Nevertheless, it is interesting to ask
+whether, and how, the need for modules has been met in the Ethereum
+context.
+
+Solidity does provide a notion of 'library', which collects a number
+of reusable functions together. Libraries can be 'internal' or
+'external'--the former are just compiled into the code of client
+contracts (and so count towards its size limit), while the latter are
+stored separately on the blockchain.
+
+There is a problem of trust in using code supplied by someone else:
+the documentation for the `ethpm` package manager for Ethereum warns
+sternly
+
+**you should NEVER import a package from a registry with an unknown or
+  untrusted owner**
+
+It seems there *is* only one trusted registry, and it is the one
+supplied as an example by the developers of `ethpm`. In other words,
+while there is a package manager for Ethereum, it does not appear to
+be in use.
+
+This is not to say that code is never shared. On the contrary, there
+is an open source repo on `github` called `OpenZeppelin` which appears
+to be heavily used. It provides 264 Solidity files, in which 43
+libraries are declared (almost all internal). It seems, thus, that
+libraries are not the main way of reusing code in Solidity; rather it
+is by calling, or inheriting from, another contract, that code reuse
+primarily occurs.
+
+A small study of 20 'verified' contracts running on the Ethereum chain
+(verified in the sense that their source code was provided) showed that
+
+* 55% of contracts consisted of more than one module
+* 40% of contracts contained more than one 'application' module
+* 55% of contracts imported `OpenZeppelin` modules
+* 10-15% of contracts imported modules from other sources
+* 5% of contracts were simply copies of `OpenZeppelin` contracts
+
+Some of the 'other' modules were provided to support specific
+protocols; for example Layr Labs provide modules to support their
+Eigenlayer protocol for re-staking.
+
+A sample of 20 is too small to draw very strong statistical conclusions,
+but we can say that the 95% confidence interval for contracts to
+consist of multiple modules is 34-74%.
+Thus code sharing is clearly going on, and a significant number of
+transactions exploit multiple modules. We may conclude that there is a
+significant demand for modules in the context of smart contracts, even
+if the total contract code still remains relatively small.
+
+
+## Specification
+
+### Adding modules to UPLC
+
+This CIP provides the simplest possible way to split scripts across
+multiple UTxOs; essentially, it allows any closed subterm to be
+replaced by its hash, whereupon the term can be supplied either as a
+witness in the invoking transaction, or via a [reference script](https://cips.cardano.org/cip/CIP-0033) in that
+transaction. To avoid any change to the syntax of UPLC, hashes are
+allowed only at the top-level (so to replace a deeply nested subterm
+by its hash, we need to first lambda-abstract it). This also places
+all references to external terms in one place, where they can easily
+be found and resolved. Thus we need only change the definition of a
+`Script`; instead of simply some code, it becomes the application of
+code to zero or more arguments, given by hashes.
+
+Currently, the definition of “script” used by the ledger is (approximately):
+```
+newtype Script = Script ShortByteString
+```
+We change this to:
+```
+newtype CompleteScript = CompleteScript ShortByteString
+
+newtype Arg = ScriptArg ScriptHash
+
+data Script =
+  ScriptWithArgs { head :: CompleteScript, args :: [Arg] }
+
+-- hash of a Script, not a CompleteScript
+type ScriptHash = ByteString
+```
+
+Scripts in transactions, and on the chain, are represented in this
+way, with dependencies that must be supplied in a transaction using
+the script. During phase 2 verification we need to resolve the
+arguments of each script before running it:
+```
+resolveScriptDependencies
+  :: Map ScriptHash Script
+  -> Script
+  -> Maybe CompleteScript
+resolveScriptDependencies preimages = go
+  where
+    go (ScriptWithArgs head args) = do
+      argScripts <- traverse lookupArg args
+      pure $ applyScript head argScripts
+      where
+        lookupArg :: Arg -> Maybe CompleteScript
+        lookupArg (ScriptArg hash) = do
+          script <- lookup hash preimages
+          go script
+```
+The `preimages` map is the usual witness map constructed by the ledger,
+so in order for a script hash argument to be resolved, the transaction
+must provide the pre-image in the usual way. Note that arguments are
+mapped to a `Script`, not a `CompleteScript`, so the result of looking
+up a hash may contain further dependencies, which need to be resolved
+recursively. A transaction must provide witnesses for *all* the
+recursive dependencies of the scripts it invokes.
+
+The only scripts that can be run are complete scripts, so the type of
+`runScript` changes to take a `CompleteScript` instead of a `Script`.
+
+#### Variation: Lazy Loading
+
+With this design, if any script hash is missing from the `preimages`,
+then the entire resolution fails. As an alternative, we might replace
+missing subterms by a dummy value, such as `builtin unit`, thus:
+```
+resolveScriptDependencies
+  :: Map ScriptHash Script
+  -> Script
+  -> CompleteScript
+resolveScriptDependencies preimages = go
+  where
+    go (ScriptWithArgs head args) =
+      applyScript head (map lookupArg args)
+      where
+        lookupArg :: Arg -> CompleteScript
+        lookupArg (ScriptArg hash) = do
+          case lookup hash preimages of
+	    Nothing     -> builtin unit
+	    Just script -> go script
+```
+This would allow transactions to provide witnesses only for script
+arguments which are actually *used* in the calls that the transaction
+makes. This may sometimes lead to a significant reduction in the
+amount of code that must be loaded; for example, imagine a spending
+verifier which offers a choice of two encryption methods, provided as
+separate script arguments. In any call of the verifier, only one
+encryption method will be required, allowing the other (and all its
+dependencies) to be omitted from the spending transaction.
+
+#### Variation: Value Scripts
+
+The goal of this variation is to eliminate the cost of evaluating
+scripts, by converting them directly to values. Since UPLC runs on the
+CEK machine, this means converting them directly into the `CekValue` type,
+*without* any CEK machine execution. To make this possible, the syntax
+of scripts is restricted so that those parts that would be evaluated
+during an application to the script arguments are already (UPLC)
+values. That is, script code is syntactically restricted to explicit
+λ-expressions with one λ per `ScriptArg`, followed by a syntactic
+value. (Values are constants, variables, built-ins, λ-abstractions,
+delayed terms, and SoP constructors whose fields are also values).
+
+This means that every script must take the form
+`λA1.λA2....λAn.<value>`, where `n` is the number of `ScriptArg`s
+supplied. Now, since variables in `CompiledCode` are de Bruijn indices
+then the `n` λs can be omitted from the representation--we know how
+many there must be from the number of `ScriptArg`s, and the names
+themselves can be reconstructed.
+
+There must be a dynamic check that the code of each script really is
+of this form, but this check can be built into deserialization, and
+thus need cost very little.
+
+`Script`s in this restricted form can be mapped directly into CEK
+values, without any CEK-machine evaluation steps. In pseudocode:
+```
+scriptCekValue
+  :: Map ScriptHash CekValue
+  -> Script
+  -> CekValue
+scriptCekValue scriptValues (ScriptWithArgs head args) =
+  cekValue (Env.fromList [case lookup h scriptValues of
+	   		    Just v -> v
+			    Nothing -> vbuiltin unit
+	   		 | ScriptArg h <- args])
+	   (deserialize (getPlc head))
+
+```
+That is, a script is turned into a value by creating a CEK machine
+environment from the values of the `ScriptArg`s, and converting the
+body of the script (a syntactic value) in a CekValue in that
+environment.
+
+This pseudocode follows the 'lazy loading' variation; an easy
+variation treats not finding a script hash as an error.
+
+Syntactic values are turned into `CekValue`s by the following
+function, which is derived by simplifying `computeCek` in
+UntypedPlutusCore.Evaluation.Machine.Cek.Internal, and restricting it
+to syntactic values.
+```
+cekValue
+  :: Env
+  -> NTerm
+  -> CEKValue
+cekValue env t = case t of
+  Var _ varname      -> lookupVarName varName env
+  Constant _ val     -> VCon val
+  LamAbs _ name body -> VLamAbs name body env
+  Delay _ body       -> VDelay body env
+  Builtin _ bn       ->
+    let meaning = lookupBuiltin bn ?cekRuntime in
+    VBuiltin bn (Builtin () bn) meaning
+  Constr _ i es      ->
+    VConstr i (foldr ConsStack EmptyStack (map (cekValue env) es)
+  _                  -> error
+```
+Converting a syntactic value to a CekValue does require traversing it,
+but the traversal stops at λs and delays, so will normally traverse
+only the top levels of a term.
+
+Finally, if `preimages` is the `Map ScriptHash Script` constructed from
+a transaction, then we may define
+```
+scriptValues = Map.map (scriptCekValue scriptValues) preimages
+```
+to compute the CekValue of each script.
+
+Scripts are then applied to their arguments by building an initial CEK
+machine configuration applying the script value to its argument value.
+
+
+
+Note that this recursive definition of `scriptValues` could potentially allow an
+attacker to cause a black-hole exception in the transaction validator,
+by submitting a transaction containing scripts with a dependency
+cycle. However, since scripts are referred to by hashes, then
+constructing such a transaction would require an attack on the hash
+function itself... for example a script hash `h` and values for `head`
+and `args` such that
+```
+h = hash (Script head (h:args))
+```
+We assume that finding such an `h` is impossible in practice; should
+this not be the case, or if we should wish to defend against an
+attacker with the resources to find such an attack on the hash
+function, then we must build a script dependency graph for each
+transaction and check that it is acyclic before evaluating the scripts
+in this way.
+
+##### Cost
+
+Converting `Script`s to `CekValue`s does require a traversal of all
+`Script`s, and the top level of each `Script` value. This is linear
+time in the total size of the scripts, though, and should be
+considerably faster than doing the same evaluation using CEK machine
+transitions. The conversion can be done *once* for a whole
+transaction, sharing the cost between several scripts if they share
+modules (such as frequently used libraries). So costs should be
+charged *for the whole transaction*, not per script. The most accurate
+cost would be proportional to the total size of values at the
+top-level of scripts. A simpler approach would be to charge a cost
+proportional to the aggregated size of all scripts, including
+reference scripts--although this risks penalizing complex scripts with
+a simple API.
+
+##### Implementation concerns
+The CEK implementation does not, today, expose an API for starting
+evaluation from a given configuration, or constructing `CekValue`s
+directly, so this variation does involve significant changes to the
+CEK machine itself.
+
+##### Subvariation: Module-level recursion
+
+Many modules define recursive functions at the top-level. In this
+variation, the innermost body of a script is further restricted to the
+form `λSelf.<value>`, and `resolveScriptDependencies` applies an
+implicit `fix` to the script body, after supplying the script
+arguments.  Like the other λs binding script arguments, the `λSelf.`
+need not appear in the actual representation; we know it has to be
+there so we can just store the body of the `λ`. When a script is
+evaluated, the value of the script is just added to the environment in
+the same way as the script arguments. The script can then refer to
+its own value using `Self`.
+
+#### Variation: Explicit lambdas
+
+This variation is a less-restrictive version of 'value scripts'. As in
+the former case, we restrict scripts syntactically to explicit
+λ-expressions binding the script arguments, but we do not restrict the
+script body proper to be a syntactic value. As in the former case, the
+λs need not be present in the `Script` representation, because their
+number is known from the number of script arguments, and the bound
+variables are deBruijn indices.
+
+In this variation, script bodies cannot be converted to `CekValue`s
+using `cekValue`; we actually have to run the CEK machine to evaluate
+them. This requires extending the API of the CEK machine, to support
+evaluating a UPLC term *in a given environment*, and returning a
+`CekValue`, rather than a discharged `NTerm`, because discharging a
+`CekValue` loses sharing. Losing sharing is unacceptable because it
+introduces a potentially exponential space cost for acyclic
+structures, and leads to non-termination in the case of cyclic
+structures (created by 'Module-level recursion').
+
+The implementation of the CEK machine currently always discharges
+values before returning them; the core loop of the machine will need
+to be modified to change this.
+
+Since script bodies must be evaluated by running the CEK machine, then
+it is possible to exceed the execution unit budget at any point during
+the script evaluation. The budget must be checked during these
+evaluations, and the budget for evaluating each script will depend on
+the actual costs of evaluating all the previous ones.
+
+To avoid circular dependencies, the scripts must be topologically
+sorted before evaluation, so that no earlier script depends on a later
+one. Topological sorting is linear time in the total number of scripts
+and script arguments.
+
+It is still possible to write a recursive definition of the
+`scriptValues`, so that each script can depend on the *same* map, but
+care is needed to avoid circular dependencies for the reasons
+explained above.
+
+#### A Note on Tuples
+
+The following variations make heavy use of tuples which in practice
+could grow quite large--tuples of modules, and modules as tuples of
+exports. These variations only make sense if projection of a component
+from a tuple is *efficient*, and in particular, constant time,
+independent of the tuple size. At present, tuples are represented
+using the SoP extension (CIP-85) as `constr 0 x1...xn`, but the only
+way to select the `i`th component is using
+```
+  case t of (constr 0 x1...xi...xn) -> xi
+```
+which takes time linear in the size of the tuple to execute, because
+all `n` components need to be extracted from the tuple and passed to
+the case branch (represented by a function).
+
+We assume below that there is an expression `proj i t` in UPLC, where
+`i` is a constant, which efficiently extracts the `i`th component from
+tuple `t`. There are several ways this could be implemented:
+
+* `proj i t` could be added as a new construct to UPLC, together with
+  extensions to the CEK machine to evaluate it.
+* `proj` could be added as a new built-in to UPLC--probably a smaller
+  change to the implementation, but less efficient (because `i` would
+  be an argument needing evaluation, rather than a constant integer in
+  the AST), and problematic to add to TPLC (because typing it requires
+  dependent types).
+* represent 'tuples' in this context as functions from indices to
+  components, so `(x,y,z)` would be represented as
+  ```
+  λi. case i of 0 -> x
+      	     	1 -> y
+		2 -> z
+  ```
+  This requires support in UPLC for pattern-matching on integers in
+  constant time, which is not implemented right now, but is on the
+  horizon. It would also need dependent types to be typed, and so
+  cannot be added to Plinth, PIR or PTLC.
+
+In the sections below we just use tuples and the notation `proj i t`,
+on the assumption that an implementation is chosen and deployed.
+
+#### Variation: Tuples of modules
+
+In the main specification in this CIP, script code is a curried
+function of the script arguments; that is, imported modules are
+supplied to scripts as individual arguments. In this variation, the
+script code is instead an *uncurried* function of the script
+arguments, which are tupled together to be passed to the script code.
+
+This variation only makes sense if the 'value scripts' variation is
+also adopted, and places an additional syntactic restriction on script
+code: it must be of the form `λMods.e`, and all occurrences of `Mods`
+in `e` must be of the form `proj i Mods` for some `i`. That is, it is
+impossible to refer to the whole tuple of modules; scripts can refer
+to only one module at a time.
+
+To avoid additional overheads for scripts without arguments, we
+redefine the `Script` type as follows:
+```
+data Script =
+    CompleteScript CompleteScript
+  | ScriptWithArgs { head :: CompleteScript, args :: [Arg] }
+```
+Here the `CompleteScript` alternative is used for scripts without
+script arguments; such scripts are not applied to a tuple of modules
+before use, and so need not be of the form `λMods.e`.
+
+##### Subvariation: Global module environment
+
+In the 'tuples of modules' variation, each script is paremeterised on
+a tuple of modules, and fetches the modules when needed by projecting
+out a component of the tuple. In the 'global module environment'
+subvariation, *all* the modules are placed in *one* tuple, from which
+scripts fetch the modules they need.
+
+The global module environment is constructed for the transaction as a
+whole, containing all the scripts provided by the transaction. It
+follows that the *same* module may end up in *different* components in
+different transactions. Scripts refer to other modules via references
+of the form `proj i Mods`, where `Mods` is the variable bound to the
+tuple of modules. Before scripts are run, these references must be
+replaced by `proj j Mods`, where `j` is the index of the corresponding
+module in the global module environment. Thus it is necessary to
+traverse the code of all the scripts, relocating module references to
+refer to the global module environment instead. One this is done, then all
+the script values can refer to the *same* tuple of modules.
+
+###### Subsubvariation: Module environment built into the CEK machine
+
+In this subsubvariation, the (single) tuple of modules is passed (as a
+new implicit parameter) directly to the CEK machine, instead of being
+passed as a parameter in UPLC. Consequently it cannot be accessed as a
+UPLC variable; new UPLC constructs are needed instead. Since
+references to the global tuple of modules always refer to a
+*particular* module, then it suffices to add a construct of the form
+```
+data Term name uni fun ann = ..  | ModuleRef Int
+```
+such that `ModuleRef i` evaluates to the `i`th component of the global
+module tuple.
+
+Once again, the scripts provided in a transaction must refer to script
+arguments using an index into *the script's own* script arguments;
+before execution these indices must be replaced by the corresponding
+indices in the global module environment, necessitating a traversal of
+the script code to prepare it for execution.
+
+##### Subvariation: Unboxed modules
+
+In this subvariation, we distinguish between validation scripts and
+scripts representing modules; the latter are subject to an additional
+syntactic restriction that the script body must be a tuple. We change
+the `Script` type accordingly
+```
+data Script = ValidatorScript         CompiledCode [ScriptArg]
+            | ModuleScript            CompiledCode [ScriptArg]
+```
+so that the deserializer can easily check the new syntactic
+restriction. `Script`s used as `ScriptArg`s may only be of the
+`ModuleScript` form (this requires a dynamic check). The idea is that
+a module provides a number of exports, which are the components of the
+tuple. (Again, special cases for an empty list of script arguments
+can be included in this type if desired).
+
+In addition, expressions `M` referring to modules (of the form `proj j
+Mods`) may only appear in contexts of the form `proj i M`, projecting
+out one of the module exports. We call these terms 'export
+references'.
+
+With this restriction, a tuple of modules is now a tuple of tuples,
+and the effect of the subvariation is to flatten that into a tuple of
+exports instead. Every module export is assigned an index in the
+resulting tuple, and the scripts must be preprocessed before execution
+to replace the indexes in every export reference by the corresponding
+index in the tuple--so `proj i (proj j Mods)` becomes `proj k Mods`
+for `k` the index of the `i`th export of the `j`th module. Since the
+placement of modules in a global tuple depends on *all* the modules
+used in a transaction, and since some of the scripts used by a
+transaction are taken from pre-existing reference UTxOs, then this
+preprocessing cannot be done in advance; it must be done during script
+verification of the transaction.
+
+In the case of modules which are omitted from the transaction (see
+'lazy loading'), the export references `proj i (proj j Mods)` should
+be replaced by `builtin unit`. This is either the correct value, or
+will cause a run-time type error (and thus verification failure) if
+the value is used.
+
+This subvariation can be combined with 'module environment built into
+the CEK machine', in which case the export references are replaced by
+suitable `ModuleRef k` expressions as before.
+
+This subvariation does not change the `CompiledCode` stored in
+scripts; it only affects the way that code is prepared for execution.
+
+
+##### Script traversal costs
+
+The last two subvariations above both require a traversal of all the
+script code in a transaction (including the code fetched from
+reference scripts) to adjust module or export references. If they are
+adopted, transaction fees should be increased by an amount linear in
+the total script size to pay for this traversal.
+
+### Modules in TPLC
+
+No change is needed in TPLC.
+
+### Modules in PIR
+
+No change is needed in PIR.
+
+### Modules in Plinth
+
+The Plinth modules introduced in this CIP bear no relation to Haskell
+modules; their purpose is simply to support the module mechanism added
+to UPLC. They are first-class values in Haskell.
+
+Just as we introduced a distinction in UPLC between `CompleteScript`
+and `Script`, so we introduce a distinction in Plinth between
+`CompiledCode a` (returned by the Plinth compiler when compiling a
+term of type `a`), and `Module a` representing a top-level `Script`
+with a value of type `a`.
+```
+newtype Module a = Module {unModule :: Mod}
+
+newtype ModArg = ModArg ScriptHash
+
+data Mod = forall b. Mod{ modCode :: Maybe (CompiledCode b),
+     	   	     	  modArgs :: Maybe ([ModArg]),
+			  modHash :: ScriptHash }
+```
+
+Here the `modArgs` correspond to the `ScriptArg`s in the UPLC case,
+and the `modHash` is the hash of the underlying `Script`.  The type
+parameter of `Module a` is a phantom parameter, just like the type
+parameter of `CompiledCode a`, which tells us the type of value which
+the application of the `modCode` to the `modArgs` represents.
+
+We can convert any `ScriptHash` into a module:
+```
+fromScriptHash :: ScriptHash -> Module a
+fromScriptHash hash = Module Nothing Nothing hash
+```
+and we can convert any `CompiledCode` into a module:
+```
+makeModule :: CompiledCode a -> Module a
+makeModule code = Module (Just (Mod code)) (Just []) ...compute the script hash...
+```
+
+We also need a way to supply an imported module to a `Module`:
+```
+applyModule :: Module (a->b) -> Module a -> Module b
+applyModule (Module (Mod (Just code) (Just args) _)) m =
+  Module (Mod (Just code) (Just (args++[modHash m])) ...compute the script hash...)
+```
+As in UPLC, the intention is that scripts that import modules be
+written as lambda-expressions, and the imported module is then
+supplied using `applyModule`. No change is needed in the Plinth
+compiler to support this mechanism.
+
+Note that only a `Module` containing code and an argument list can
+have the argument list extended by `applyModule`; this is because the
+`ScriptHash` of the result depends on the code and the entire list of
+arguments, so it cannot be computed for a module that lacks either of
+these.
+
+It is `Module` values that would then be serialised to produce scripts
+for inclusion in transactions.
+
+In the 'unboxed modules' variation we need to distinguish two kinds of
+scripts, scripts which define modules, and scripts which define
+validators. In Plinth, this distinction can be made in the types, by
+turning the `Module` type into a GADT with an extra parameter, of type
+```
+data ScriptType = ModuleScript | ValidatorScript
+```
+`applyModule` would be given a more restrictive type:
+```
+applyModule :: Module s (a->b) -> Module ModuleScript a -> Module s b
+```
+thus ensuring that only scripts representing modules are passed as
+script arguments.
+
+### Plutus Ledger Language Versions
+
+Plutus ledger language version is what "Plutus V1", "Plutus V2", "Plutus V3" refer to.
+These are not distinct programming languages; the primary difference lies in the arguments the script receives from the ledger, and the value it returns[^1].
+Plutus V1, V2 and V3 can therefore be understood as type signatures, in the sense that they each represent a subset of UPLC programs with specific types. Any UPLC program that matches the expected argument and return types can be considered and used as a Plutus V1, V2 or V3 script.
+A new ledger era is the primary reason for introducing a new ledger language version, though technically there can be cases where a new ledger language version is necessary without a new ledger era.
+
+Currently each script on-chain is tagged with a specific ledger language version - V1, V2, V3 or native script - and this version tag is a component of the script hash.
+A logical approach, therefore, is to continue doing so for module scripts, and require that a validator script and all modules it references must use the same ledger language version; failure to do so leads to a phase-1 error.
+
+A different approach is to distinguish between validator scripts and module scripts by applying version tags only to validator scripts.
+Module scripts are untagged and can be linked to any validator script.
+This makes module scripts more reusable, which is advantageous because in most cases, a UPLC program has the same semantics regardless of the ledger language version.
+
+This is, however, not always the case because a few builtin functions have multiple semantic variants, and the variant used may differ depending on the ledger language version.
+Nonetheless, if a module script depends on a particular ledger language version to work correctly, this requirement can be communicated through alternative means, e.g., as a piece of metadata in a module script registry.
+
+Another drawback of untagged modules is that untagged modules will be a new concept that doesn't currently exist, and as a result, modules will not be usable in Plutus V1 through V3, and can only be used from Plutus V4 onwards.
+
+### Plutus Core Versions
+
+Plutus Core version is the usual sense of version pertaining to programming languages - in this instance the Plutus Core language.
+So far there have been two Plutus Core versions: 1.0.0 and 1.1.0. 1.1.0 adds sums-of-products to the language by introducing two new AST node types: Case and Constr.
+See [CIP-85](https://cips.cardano.org/cip/CIP-0085) for more details.
+Each UPLC program is tagged with a Plutus Core version (where as for ledger language versions, only _scripts_ that exist on-chain, i.e., stored in UTXOs, are tagged with ledger language versions).
+
+UPLC programs with different Plutus Core versions are incompatible and cannot be combined, and therefore, a validator script and all modules it references must share the same Plutus Core version; otherwise it is a phase-1 error.
+
+## Rationale: how does this CIP achieve its goals?
+
+This CIP provides a minimal mechanism to split scripts across several
+transactions. 'Imported' modules are provided in the calling
+transaction and passed as arguments to the top-level script, and their
+identity is checked using their hash. The representation of modules is
+left entirely up to compiler-writers to choose--a module may be any
+value at all. For example, one compiler might choose to represent
+modules as a tuple of functions, while another might map function
+names to tags, as Solidity does, and represent a module as a function
+from tags to functions. Each language will need to define its own
+conventions for module representations, and implement them on top of
+this low-level mechanism. For example, a typed language might
+represent a module as a tuple of exported values, and store the names
+and types of the values in an (off-chain) interface file. Clients
+could use the interface file to refer to exported values by name, and
+to perform type-checking across module boundaries.
+
+### Recursive modules
+
+This design does not support mutually recursive modules. Module
+recursion is sometimes used in languages such as Haskell, but it is a
+rarely-used feature that will not be much missed.
+
+### Cross-language calls
+
+There is no a priori reason why script arguments need be written in
+the same high-level language as the script itself; thus this CIP
+supports cross-language calls. However, since different languages may
+adopt different conventions for how modules are represented, then some
+'glue code' is likely to be needed for modules in different languages
+to work together. In the longer term, it might be worthwhile defining
+an IDL (Interface Definition Language) for UPLC, to generate this glue
+code, and enable scripts to call code in other languages more
+seamlessly. This is beyond the scope of this CIP; however this basic
+mechanism will not constrain the design of such an IDL in the future.
+
+In Plinth, because the `Module` type is a phantom type, it is easy to
+take code from elsewhere and turn it into a `Module t` for arbitrary
+choice of `t`; this can be used to import modules compiled from other
+languages into Plinth (provided a sensible Plinth type can be given to
+them).
+
+
+### Static vs Dynamic Linking
+
+With the introduction of modules, scripts are no longer
+self-contained--they may depend on imported modules. This applies both
+to scripts for direct use, such as spending verifiers, and to scripts
+representing modules stored on the chain.  A module may depend on
+imported modules, and so on transitively. An important question is
+when the identity of those modules is decided. In particular, if a
+module is replaced by a new version, perhaps fixing a bug, can
+*existing* code on the chain use the new version instead of the old?
+
+The design in this CIP supports both alternatives. Suppose a module
+`A` imports modules `B` and `C`. Then module `A` will be represented
+as the lambda-expression `λB.λC.A`. This can be compiled into a
+`CompleteScript` and placed on the chain, with en empty list of
+`ScriptArg`s, as a reference script in a UTxO, allowing it to be used
+with any implementations of `B` and `C`--the calling script must pass
+implementations of `B` and `C` to the lambda expression, and can
+choose them freely. We call this 'dynamic linking', because the
+implementation of dependencies may vary from use to use. On the other
+hand, if we want to *fix* the versions of `B` and `C` then we can
+create a `Script` that applies the same `CompleteScript` to two
+`ScriptArg`s, containing the hashes of the intended versions of `B`
+and `C`, which will then be supplied by
+`resolveScriptDependencies`. We call this 'static linking', because
+the version choice for the dependency is fixed by the script. It is up
+to script developers (or compiler writers) to decide between static
+and dynamic linking in this sense.
+
+On the other hand, when a script is used directly as a validator then
+there is no opportunity to supply additional arguments; all modules
+used must be supplied as `ScriptArg`s, which means they are
+fixed. This makes sense: it would be perverse if a transaction trying
+to spend a UTxO protected by a spending validator were allowed to
+replace some of the validation code--that would open a real can of
+worms, permitting many attacks whenever a script was split over
+several modules. With the design in the CIP, it is the script in the
+UTxO being spent that determines the module versions to be used, not
+the spending transaction. That transaction does need to *supply* all
+the modules actually used--including all of their dependencies--but it
+cannot choose to supply alternative implementations of them.
+
+### In-service upgrade
+
+Long-lived contracts may need upgrades and bug fixes during their
+lifetimes. This need is met on the Ethereum blockchain using the
+'proxy pattern'--a 'proxy' contract which delegates calls to the
+current implementation contract, whose identity is stored in the proxy
+contract's mutable state. Proxy contracts can provide a 'code upgrade'
+method which modifies the mutable state to store a new implementation
+contract.
+
+
+The Cardano chain does not offer mutable state. Instead, a changing
+state is represented by a succession of UTxOs, each holding the
+current state, usually with the currently-valid UTxO identified by
+holding a particular NFT. In the absence of mutable state a dependency
+cannot be updated just by changing a pointer, but scripts can still be
+upgraded by creating new values on the chain. The exact mechanism
+depends on the kind of script--and, often, on the original script
+developer preparing the ground for a later code change.
+
+Note that, on Ethereum, a proxy contract can be updated without
+changing its contract address---thanks to mutable state. On Cardano, a
+script address *is* the hash of its code; of course, changing the code
+will change the script address. It is very hard to see how that could
+possibly be changed without a fundamental redesign of Cardano. So the
+methods discussed below are different in nature from the Ethereum one:
+they upgrade dependencies in something by replacing it with a new one,
+with different dependencies. This is really just functional
+programming at work: data is always 'updated' by creating a new
+version with possibly different content. This does mean that script
+addresses are going to change when their dependencies do; there's no
+way around it.
+
+First consider shared modules, stored as reference scripts in
+UTxOs. The hash of a module depends on the hash of all its
+dependencies, so when a dependency changes, then a new version of the
+UTxO needs to be created with the new dependency, and its hash needs
+to be distributed (by off-chain means). To prevent accidental use of
+the old UTxO, it could be spent.
+
+UTxOs whose *spending verifier* needs upgrading can be spent and
+recreated with a new verifier, if the need has been anticipated by the
+script author. The verifier would need to accept a 'code change'
+redeemer, and then check that the transaction created a new UTxO
+protected by the new spending verifier. For example, the code change
+redeemer might provide the script hash of the new verifier, and the
+old verifier would then check that the new UTxO, with the same
+contents, was protected by that hash. This mechanism permits an
+arbitrary code change; of course this opens for attacks, so in
+practice such a verifier would need to check that the proposed code
+change was correctly authorised. How this is done is up to the
+contract concerned.
+
+Note that *currency symbols* in Cardano are just the hash of the
+minting policy--a script. Thus, updating a dependency of a minting
+policy means changing the currency symbol. We need to be able to
+convert tokens with the old currency to the new one. To allow this,
+the minting script must allow *burning* the old currency when the new
+one is being minted, and the new minting script must allow minting
+when the old currency is being burned, provided the code upgrade is
+correctly authorized. This is enough to allow wallets holding the old
+currency to replace it by the new one--a wallet can just submit a
+transaction that burns the old tokens and mints the new. If the
+currency is also to be stored in UTxOs protected by spending
+verifiers, then those verifiers must also accept 'currency upgrade'
+redeemers, and check that the UTxO is just being recreated with old
+tokens replaced by new ones--or alternatively, continue to use the
+old coins and exchange them when they reach a wallet. To facilitate
+this, transactions that require an input with these tokens should also
+accept old versions, along with authentication of the code upgrade.
+
+Staking validators are a much simpler case: they can be upgraded just by
+deregistering the state key registration certificate that refers to
+them, and then reregistering the same state key with a new staking
+validator.
+
+### Lazy loading
+
+The 'lazy loading' variation in the specification section above
+permits fewer modules to be supplied in a transaction.  Dependency
+trees have a tendency to grow very large; when one function in a
+module uses another module, it becomes a dependency of the entire
+module and not just of that function. It is easy to imagine situations
+in which a script depends on many modules, but a particular call
+requires only a few of them. For example, if a script offers a choice
+of protocols for redemption, only one of which is used in a particular
+call, then many modules may not actually be needed. The variation
+allows a transaction to omit the unused modules in such cases. This
+reduces the size of the transaction, which need provide fewer
+witnesses, but more importantly it reduces the amount of code which
+must be loaded from reference UTxOs.
+
+If a script execution *does* try to use a module which was not
+provided, it will encounter a run-time type error and fail (unless the
+module value was `builtin unit`, in which case the script will behave
+as though the module had been provided).
+
+To take advantage of this variation, it is necessary, when a
+transaction is constructed, to *observe* which script arguments are
+actually used by the script invocations needed to validate the
+transaction. The transaction balancer runs the scripts anyway, and so
+can in principle observe the uses of script arguments, and include
+witnesses in the transaction for just those arguments that are used.
+
+#### Balancer modifications
+
+To take advantage of 'lazy loading', it's necessary to identify
+reference scripts that are *dynamically* unused, when the scripts in a
+transaction run. The best place to do that is in a transaction
+balancer, which needs to run the scripts anyway, both to check that
+script validation succeeds, and to determine the number of execution
+units needed to run the scripts. We adopt the view that
+
+**A transaction balancer may drop reference inputs from a
+   transaction, if the resulting transaction still validates**
+
+We call reference scripts which are not actually invoked during script
+verification 'redundant'; these are the reference scripts that can be
+removed by the balancer.
+
+##### First approach: search
+
+The simplest way for a balancer to identify redundant reference inputs
+is to try rerunning the scripts with an input removed. If script
+validation still succeeds, then that input may safely be removed. The
+advantages of this approach are its simplicity, and lack of a need for
+changes anywhere else in the code. The disadvantage is that
+transaction balancing may become much more expensive--quadratic in the
+number of scripts, in the worst case.
+
+The reason for this is that removing one script may make others
+redundant too; for example if script A depends on script B, then
+script B may become redundant only after script A has been
+removed--simply evaluating script A may use the value of B, and
+scripts are evaluated when they are passed to other scripts, whether
+they are redundant or not. So if the balancer tries to remove B first,
+then script verification will fail--and so the balancer must try again
+to remove B after A has been shown to be redundant. Unless we exploit
+information on script dependencies, after one successful script
+removal then all the others must be revisited. Hence a quadratic
+complexity.
+
+In the case of 'value scripts' this argument does not apply:
+evaluating a script will never fail just because a different script is not
+present. In this case it would be sufficient to traverse all the
+scripts once, resulting in a linear number of transaction
+verifications.
+
+##### Second approach: garbage collection
+
+First the balancer analyses all the scripts and reference scripts in a
+transaction, and builds a script dependency dag (where a script
+depends on its `ScriptArg`s). Call the scripts which are invoked
+directly in the transaction (as validators of one sort or another) the
+*root* scripts.
+
+Topologically sort the scripts according to the dependency relation;
+scripts may depend on scripts later in the order, but not
+earlier. Now, traverse the topologically sorted scripts in order. This
+guarantees that removing a *later* script in the order does not cause
+an *earlier* one to become redundant.
+
+For each script, construct a modified dependency graph by removing the
+script concerned, and then 'garbage collecting'... removing all the
+scripts that are no longer reachable from a root. Construct a transaction
+including only the reference scripts remaining in the graph, and run
+script validation. If validation fails, restore the dependency graph
+before the modification. If validation succeeds, the script considered
+and all the 'garbage' scripts are redundant; continue using the now
+smaller dependency graph.
+
+When all scripts have been considered in this way, then the remaining
+dependency graph contains all the scripts which are dynamically needed
+in this transaction. These are the ones that should be included in the
+transaction, either directly or as reference scripts.
+
+The advantage of this approach is that only the code in the balancer
+needs to be changed. The disadvantage is that transaction balancing
+becomes more expensive: script verification may need to be rerun up to
+once per script or reference script. In comparison to the first
+approach above, this one is more complex to implement, but replaces a
+quadratic algorithm by a linear one.
+
+##### Third approach: modified CEK machine
+
+The most direct way to determine that a script is not redundant is to
+observe it being executed during script verification. Unfortunately,
+the CEK machine, in its present form, does not make that
+possible. Thus an alternative is to *modify the CEK machine* so that a
+balancer can observe scripts being executed, and declare all the other
+scripts redundant. In comparison to the first two approaches, this is
+likely to be much more efficient, because it only requires running
+script verification once.
+
+The modifications needed to the CEK machine are as follows:
+
+`CekValue`s are extended with *tagged values*, whose use can be
+observed in the result of a run of the machine.
+```
+data CekValue uni fun ann =
+  ...
+  | VTag ScriptHash (CekValue uni fun ann)
+```
+In the 'value script' variation, no expression resulting in a `VTag`
+value is needed, because `VTag`s will be inserted only by
+`resolveScriptDependencies`. In other variations, a `Tag` constructor
+must also be added to the `NTerm` type, to be added by
+`resolveScriptDependencies`. In either case the version of
+`runScriptDependencies` *used in the balancer* tags each value or
+subterm derived from a `ScriptHash` `h` as `VTag h ...` (or `Tag h
+...` in variations other than 'value scripts').
+
+The CEK machine is parameterized over an emitter function, used for
+logging. We can make use of this to emit `ScriptHash`es as they are
+used. This allows the balancer to observe which `ScriptHash`es *were*
+used.
+
+Simply evaluating a tagged value, or building it into a
+data-structure, does not *use* it in the sense we mean here: replacing
+such a value with `builtin unit` will not cause a validation
+failure. Only when such a value is actually *used* should we strip the
+tag, emit the `ScriptHash` in the `CekM` monad, and continue with the
+untagged value. This should be done in `returnCek`, on encountering a
+`FrameCases` context for a tagged value, and in `applyEvaluate` when
+the function to be applied turns out to be tagged, or when the
+argument to a `builtin` turns out to be tagged.
+
+Adding and removing tags must be assigned a zero cost *in the
+balancer*, since the intention is that they should not appear in
+transactions when they are verified on the chain. Thus a zero cost is
+required for the balancer to return accurate costs for script
+verification on the chain. On the other hand, if these operations *do*
+reach the chain, then they should have a *high* cost, to deter attacks
+in which a large number of tagging operations are used to keep a
+transaction verifier busy. This can be achieved by adding a `BTag`
+step kind to the CEK machine, a `cekTagCost` to the
+`CekMachineCostsBase` type, and modifying the balancer to set this
+cost to zero during script verification.
+
+The advantage of this approach is that it only requires running each
+script once in the balancer, thus reducing the cost of balancing a
+transaction, perhaps considerably. The disadvantage is that it
+requires extensive modifications to the CEK machine itself, a very
+critical part of the Plutus infrastructure.
+
+##### Fourth approach: lazy scripts
+
+Another way to observe script uses *without* modifying the CEK machine
+is to wrap them in `Delay` and force them at the point of use. The
+balancer can then insert trace output of the script hash just inside
+the `Delay`, and so observe which scripts are actually forced during
+script execution.
+
+The difficulties with this approach arise from the fact that delayed
+closures must be *explicitly* forced in UPLC; this does not 'just
+happen' when a delayed value is used. This means that corresponding
+`Force` operations must also be added to scripts, and the question is:
+who does this, and if it is to be done automatically, then how?
+
+One possibility is that it is the developer's responsibility to force
+script arguments at the point of use--that is, that the `Force`
+operations needed would be written by the human programmer. It follows
+that they would *always* be part of the script, even when running on
+the chain, and so even on the chain script arguments would need to be
+delayed (even if no trace output would be needed). This would increase
+code size a little, and impose a force-delay overhead on every
+cross-module reference, which is probably not acceptable.
+
+The alternative is to have the balancer insert corresponding `Force`
+operations, as well as the `Delay`s. A simple way to do so would be
+to add a `Force` around every use of a variable corresponding to a
+script argument--under the 'value scripts' syntactic restriction these
+variables are easy to identify. These modifications would not be made
+during normal script verification, which might therefore cost less--or
+more--than the modified balancer run. The balancer would thus need to
+perform script verification twice: once with `Delay` and
+`Force`inserted to determine redundant scripts, and then a second time
+(with redundant scripts removed) to determine the actual cost on the
+chain.
+
+The bigger problem with this approach, though, is that it will
+*overestimate* the set of used scripts, leading to more scripts being
+used in a transaction, and thus potentially exponentially more
+expensive transactions. The reason for the overestimation is that
+*all* occurrences of variables bound to script arguments are wrapped
+in `Force`, even those that would not lead to untagging the
+corresponding tagged value in the third approach above. For example,
+suppose a variable bound to a script argument is passed as a parameter
+to another function. With the simple `Force`-placement strategy
+described above, the script argument would be forced *at that call*,
+making the corresponding script appear to be used, even though the
+function it is passed to might not actually use it in all cases. Hence
+the set of scripts used would be overestimated.
+
+One might use a more sophisticated strategy to insert `Force`
+operations. For example, in the case described above one might pass
+the script argument *unforced* to the function, and modify the
+function to force it when it is used. This would require the balancer
+to perform a flow analysis, to identify the functions that might be
+passed a delayed script argument. Moreover, such functions might be
+called *sometimes* with a delayed script argument, and sometimes
+not. The code could be replicated to create two versions of such
+functions. But with *n* script arguments, this might require up to
+*2^n* versions of each function, leading to an exponential increase in
+code size. An attacker could exploit this to craft a transaction that
+would cause the balancer to run out of memory. This is really not
+attractive.
+
+Finally, one might finesse these problems by modifying the CEK machine
+to force delayed closures automatically where the value is required,
+thus enabling explicit `Force` operations to be omitted. This would
+effectively turn UPLC into a lazy programming language. That would
+enable this problem to be solved more easily, but  at the cost of
+reversing a rather fundamental design decision in UPLC--and probably
+making the CEK machine a little bit slower, for all programs.
+
+Thus it appears that there is no good way of using UPLC's existing
+lazy evaluation to observe use of script arguments.
+
+### Value Scripts
+
+This section discusses the 'value scripts' variation.
+
+The main specification in this CIP represents a `Script` that imports
+modules as compiled code applied to a list of `ScriptHash`es. To
+prepare such a script for running, `resolveScriptDependencies`
+replaces each hash by the term it refers to, and builds nested
+applications of the compiled code to the arguments. These applications
+must be evaluated by the CEK machine *before* the script proper begins
+to run. Moreover, each imported module is itself translated into such
+a nested application, which must be evaluated before the module is
+passed to the client script. In a large module hierarchy this might
+cause a considerable overhead before the script proper began to
+run. Worst of all, if a module is used *several times* in a module
+dependency tree, then it must be evaluated *each time* it is
+used. Resolving module dependencies traverses the entire dependency
+*tree*, which may be exponentially larger than the dependency *dag*.
+
+The value script variation addresses this problem head on. Scripts are
+converted directly into CEK-machine values that can be invoked at low
+cost. Each script is converted into a value only once, no matter how
+many times it is referred to, saving time and memory when modules
+appear several times in a module hierarchy.
+
+On the other hand it does restrict the syntactic form of
+scripts. Scripts are restricted to be syntactic lambda expressions,
+binding their script arguments at the top-level. This is not so
+onerous. But inside those λs, there must also be a syntactic
+value. For example, consider a module represented by a tuple, whose
+components represent the exports of the module. Then all of those exports
+need to be syntactic values--an exported value could not be computed
+at run-time, for example using an API exported by another
+module. While many module exports are functions, and so naturally
+written as λ-expressions (which are values), this restriction will be
+onerous at times.
+
+This method does require opening up the API of the CEK machine, so
+that CEK values can be constructed in other modules, and introducing a
+way to run the machine starting from a given configuration. So it
+requires more invasive changes to the code than the main
+specification.
+
+#### `ScriptHash` allowed in terms?
+
+An alternative design would allow UPLC terms to contain `ScriptHash`es
+directly, rather than as λ-abstracted variables, to be looked up in a
+global environment at run-time. This would address this same problem:
+the cost of performing many applications before script evaluation
+proper begins. It would also require changes to the CEK machine, and
+is not really likely to perform better than the 'value scripts'
+variation (in practice, the main difference is the use of a global
+environment to look up script hashes, as opposed to many per-module
+ones). However, this approach is less flexible because it does not
+support dynamic linking (see Static vs Dynamic Linking above). Once a
+`ScriptHash` is embedded in a term, then a different version of the
+script cannot readily be used instead.
+
+#### Module-level recursion
+
+This section discusses the `module-level recursion` subvariation of
+the `value scripts` variation.
+
+UPLC provides a fixpoint combinator, and this is how recursion is
+compiled. For the sake of argument, consider the well-known fixpoint
+combinator `Y` (in reality, `Y` is not suitable for use in a strict
+programming language, so the UPLC version is slightly different). We
+can imagine that a recursive function `f` is compiled as `Y h`, for
+some suitable `h`.
+
+The difficulty that arises is that `Y h` *is not a value*, and thus
+cannot appear at the top-level of a module, under the 'value script'
+restriction. It can be *normalised* into a value, of course, using
+```
+Y h ---> h (Y h)
+```
+and then reducing the application of `h`; this would need to be done
+by a compiler generating UPLC with the `value script`
+restriction. But reducing `h (Y h)` may well duplicate `Y h`. When
+this happens at CEK runtime it is not a problem, because all the
+occurrences of `Y h` are represented by the same pointer. But when the
+reductions are applied by a compiler, and the resulting term is
+serialized to UPLC code for inclusion in a script, then each
+occurrence of `Y h` will be serialized separately, losing sharing and
+causing code duplication in the resulting script. The result could be
+*larger* code, the opposite of what we are trying to achieve. Thus
+this method of compiling recursion fits badly with the 'value scripts'
+variation.
+
+Hence module-level recursion, which allows recursive occurrences of
+script values to be referred to via the `Self` variable instead of
+using a fixpoint combinator implemented in UPLC. To take advantage of
+this feature, the compiler will need to float occurrences of `fix`
+upwards, to the top-level of a module. This can be done using suitable
+analogues of the rules
+```
+(..,fix (λx.e),...) ---> fix (λx.(..,e[proj i x/x],..))
+```
+where `i` is the index in the tuple at which `fix (λx.e)` appears,
+`proj i x` selects the `i`th component from `x`, and `x` does not
+occur free elsewhere in the tuple; a corresponding rule for
+constructor applications; and
+```
+fix (λx. fix (λy.e)) ---> fix (λx. e[x/y])
+```
+Both these rules require adjusting deBruin numbers in the UPLC
+implementation.
+
+The intention here is to implement module-level recursion using a
+cyclic data-structure--the value restriction guarantees that the
+module value `Self` is not needed to compute the top-level value of
+the module, and thus there is no risk of falling into an infinite loop
+at this point. (Of course, a recursive function can loop *when it is
+called*, but constructing the function itself cannot loop because it
+must be a syntactic λ-expression). This is a *more efficient* way to
+implement recursion than the fixpoint combinators currently used in
+UPLC, and so will probably become the preferred way to implement
+recursion.
+
+#### Variation: Explicit lambdas
+
+This variation lifts some of the restrictions of the 'value scripts'
+approach, at the cost of running the CEK machine to evaluate each
+module, and taking care to compute and check costs correctly for the
+new CEK machine runs. This requires topological sorting of the scripts
+in a transaction before evaluation, to guarantee that we do not
+encounter a situation where script A depends on script B, but the
+budget for computing script B depends on the cost of script A--such a
+situation would lead to a blackhole error during script verification.
+
+Because script bodies may now be arbitrary terms, 'module-level
+recursion' is no longer essential--it is possible to use fixpoint
+combinators in script bodies as at present. It would still improve
+efficiency, of course.
+
+Note that if modules *do* meet the syntactic restrictions of 'value
+scripts', then this variation will be less efficient than 'value
+scripts'--sometimes considerably so. This is because even evaluating,
+say, a large tuple whose components are λ-expressions, leads the CEK
+machine to descend into, evaluate, and return out of, each component,
+thus performing several CEK transitions per element. The `cekValue`
+function must also visit each component, of course, doing the same
+work, but because this is done directly in Haskell then it will be
+considerably more efficient.
+
+This variation is compatible with the various tuple-based variations,
+but when the script body is constrained to return a tuple then this
+must be checked dynamically when CEK-evaluation is complete; the check
+cannot be built into deserialization any more because it is no longer
+syntactic.
+
+#### Variation: Tuples of modules
+
+This variation changes the way modules are referenced in scripts: in
+the main specification, each imported module is bound to a name in the
+environment, and referenced using the associated variable; in this
+variation *all* imported modules are bound to a single name, and
+modules are referenced by projecting the corresponding component from
+the tuple bound to this name.
+
+Thus: in the main specification, a module reference costs one name
+lookup; in this variation, a module reference costs a name lookup plus
+projection of a component from a tuple. However, because projecting a
+component from a tuple is constant time, while the cost of a
+name lookup is logarithmic in the number of names in the environment,
+then this variation may reduce the cost of module references--since
+scripts which import many modules will run with significantly fewer
+names in the environment.
+
+Note that the uncurried script form can be generated from the curried
+one, by
+* introducing a `λMods.` outermost,
+* removing the `λ`s binding names to script arguments,
+* substituting `proj i Mods` for the `i`th script argument name in the
+script body
+
+Thus there is no need for any change to earlier parts of the compiler,
+or to the languages Plutus, PIR, or TPLC. Tuples of modules can be
+introduced as a last step in the generation of UPLC.
+
+##### Subvariation: Global module environment
+
+The advantage of using a global module environment instead of one
+tuple of modules per script is that only one, big, tuple of modules
+per transaction need be constructed, instead of one per script. The
+cost is an additional traversal of the script code, needed to adjust
+module indices to refer to the correct index in the global tuple of
+modules. By itself, this is unlikely to improve performance.
+
+However, using a global module environment is a prerequisite to
+building the module environment into the CEK machine. Doing the latter
+transforms a module reference from a projection from the
+tuple-of-modules variable, to a custom construction `ModuleRef i` that
+directly accesses the module in the `i`th component of the global
+module environment. This reduces the cost from a variable lookup plus
+a projection, to just a projection; this can be expected to speed up every
+reference to an external module.
+
+##### Subvariation: Unboxed modules
+
+This subvariation makes every reference to a module export cheaper, by
+replacing two projections from a tuple by one. It does require
+preprocessing script code before it is run, updating export references
+to refer to the correct element of the large tuple combining several
+modules. This requires a traversal of all the script code in a
+transaction, which must be performed every time script verification is
+run, including on the chain. Because of this, it makes most sense to
+use this subvariation in combination with 'global module environment',
+which also requires such a traversal. In both cases, the purpose is to
+adjust references to refer to the correct index in the new, merged
+data structure; a single traversal suffices to achieve both ends.
+
+The syntactic restriction, requiring a module body to be a tuple of
+exports, is not onerous. While some compilers might wish to represent
+a module as built-in data, or as a function from a tag (as Solidity
+does), this can be achieved by placing the intended module value as
+the first component of a one-tuple. The implementation described here
+optimises away the selection of an element from such a tuple, so the
+restriction introduces no extra overhead in this case.
+
+### Transaction fees
+
+Imported modules are provided using reference scripts, an existing
+mechanism (see CIP-33), or in the transaction itself. Provided the
+cost of loading reference scripts is correctly accounted for, this CIP
+introduces no new problems.
+
+Note that there is now (since August 2024) a hard limit on the total
+size of reference scripts used in a transaction, and the transaction
+fee is exponential in the total script size (see
+[here](https://github.com/IntersectMBO/cardano-ledger/blob/master/docs/adr/2024-08-14_009-refscripts-fee-change.md)).The
+exponential fees provide a strong motivation to prefer the 'lazy
+loading' variation in this CIP: even a small reduction in the number
+of reference scripts that need to be provided may lead to a large
+reduction in transaction fees.
+
+The motivation for these fees is to deter DDoS attacks based on
+supplying very large Plutus scripts that are costly to deserialize,
+but run fast and so incur low execution unit fees. While these fees
+are likely to be reasonable for moderate use of the module system, in
+the longer term they could become prohibitive for more complex
+applications. It may be necessary to revisit this design decision in
+the future. To be successful, the DDoS defence just needs fees to
+become *sufficiently* expensive per byte as the total size of
+reference scripts grows; they do not need to grow without bound. So
+there is scope for rethinking here.
+
+Some of the variations in this CIP require a traversal of all the
+script code in a transaction to adjust module references before
+execution. This should be reflected by a component in the transaction
+fee linear in the total size of scripts.
+
+### Verification
+
+Since scripts invoked by a transaction specify all their dependencies
+as hashes, then the running code is completely known, and testing or
+formal verification is no harder than usual. Standalone verification
+of modules using 'dynamic linking' poses a problem, however, in that
+the code of the dependencies is unknown. This makes testing
+difficult--one would have to test with mock implementations of the
+dependencies--and formal verification would require formulating
+assumptions about the dependencies that the module can rely on, and
+later checking that the actual implementations used fulfill those
+assumptions.
+
+### Impact on optimisation and script performance
+
+Splitting a script into separately compiled parts risks losing
+optimisation opportunities that whole-program compilation gives. Note
+that script arguments are known in advance, and so potentially some
+cross-module optimisation may be possible, but imported modules are
+shared subterms between many scripts, and they cannot be modified when
+the client script is compiled. Moreover, unrestrained inlining across
+module boundaries could result in larger script sizes, and defeat the
+purpose of breaking the code into modules in the first place.
+
+On the other hand, since the size limit on scripts will be less of a
+problem, then compilers may be able to optimize *more*
+aggressively. For example, today the Plinth inliner is very careful
+not to increase script size, but once modules are available it may be
+able to inline more often, which can enable further optimizations.
+
+Moreover, today we see examples of deliberate choice of worse
+algorithms, because their code is smaller, and easier to fit within
+the script size limit. Removing the need to make such choices can
+potentially improve performance considerably.
+
+### Example: Defining and using a Set module
+
+As an example of how the module system might be used in a high-level
+language, consider the following code, which defines and uses a module
+implementing set insertion and membership testing, using an ordered
+binary tree.
+```
+data Tree a = Leaf | Branch (Tree a) a (Tree a)
+
+empTree = Leaf
+
+insTree a Leaf = Branch Leaf a Leaf
+insTree a (Branch l b r)
+  | a < b = Branch (insTree a l) b r
+  | a > b = Branch l b (insTree a r)
+  | a== b = Branch l b r
+
+memTree a Leaf = False
+memTree a (Branch l b r)
+  | a < b = memTree a l
+  | a > b = memTree a r
+  | a== b = True
+
+data Set = Set {emptySet :: forall a. Tree a,
+     	        insertSet :: forall a. Ord a => a -> Tree a -> Tree a,
+		memberSet :: forall a. Ord a => a -> Tree a -> Bool}
+
+setMod = Set empTree insTree memTree
+
+setModule :: Module Set
+setModule = makeModule ($$(PlutusTx.compile [| setMod |]))
+
+client set redeemer _ = memberSet set redeemer s
+  where s = insertSet set 1 (insertSet set 2 (emptySet set))
+
+clientModule = makeModule ($$(PlutusTx.compile [| client |]))
+ `applyModule` setModule
+```
+Here the module signature is represented by a Haskell record type;
+Haskell records are compiled into tuples in UPLC, and the record
+fields are all values (once fixpoints are floated upwards to the
+module level), so the `setModule` in this example fits the 'unboxed
+modules' syntactic restrictions. The client script takes the record as
+an argument, and uses the module exports via record field selectors,
+which compile to projections from the tuple. Thus the client also
+meets the syntactic restrictions for 'unboxed modules'. To make use
+of these modules, the off-chain code must construct a UTxO
+containing `setModule` as a reference script, and include it as a
+reference UTxO in transactions that use the client.
+
+### Related work
+
+#### Merkelized Validators
+
+Philip DiSarro describes ["Merkelized
+validators"](https://github.com/Anastasia-Labs/design-patterns/blob/main/merkelized-validators/merkelized-validators.md),
+which offer a way of offloading individual function calls to stake
+validators: the client script just checks that the appropriate stake
+validator is invoked with a particular function-argument and result
+pair, checks that the argument equals the argument it wants to call
+the function with, and then uses the result as the result of the
+function. The stake validator inspects the argument-result pair,
+computes the function for the given argument, and checks that the
+result equals the result in the pair. This design pattern enables the
+logic of a script to be split between the client script and the stake
+validator, thus circumventing the limits on script size. But the main
+point is that the function call, whose result may be needed by several
+validators, can be computed just *once* per transaction. More details
+can be found
+[here](https://github.com/Anastasia-Labs/design-patterns/blob/main/stake-validator/STAKE-VALIDATOR-TRICK.md).
+
+Factoring out a shared part of the validation in this way is a
+generally useful technique which is largely independent of the
+existence of modules--this CIP does not remove the need for sharing
+work between validators, and indeed this trick will work equally well
+once modules are added. But as a way of *implementing* modules, it is
+rather intricate and unsatisfactory.
+
+#### The WebAssembly Component Model
+
+The [Web Assembly Component
+Model](https://component-model.bytecodealliance.org/) defines a
+high-level IDL to enable components written in different programming
+languages (such as C/C++, C#, Go, Python and Rust), to work together
+in one WebAssembly system. WASM already has a module system, and a
+WASM component may consist of a number of modules (written in the same
+language). The focus here is on interaction between *different*
+programming languages in a well-typed way. Defining such an IDL for
+Cardano might be useful in the future, but it is too early to do so
+now.
+
+### Preferred Options
+
+Allowing script code to be spread across many transactions lifts a
+commonly complained-about restriction faced by Cardano script
+developers. It permits more complex applications, and a much heavier
+use of libraries to raise the level of abstraction for script
+developers. Modules are already available on the Ethereum blockchain,
+and quite heavily used. Adopting this CIP, in one of its variations,
+will make Cardano considerably more competitive against other smart
+contract platforms.
+
+The *main alternative* in this CIP is the simplest design, is easiest
+to implement, but suffers from several inefficiencies.
+
+The *lazy loading* variation allows redundant scripts to be omitted
+from transactions, potentially making transactions exponentially
+cheaper. To take full advantage of it requires a balancer that can
+drop redundant scripts from transactions. Three alternative methods
+are described: *search*, the simplest, which must run script
+verification a quadratic number of times in the number of scripts in
+the worst case; *garbage collection*, a self-contained change to the
+balancer which analyses script dependencies and thus needs to run script
+verification only a linear number of times; a *modified CEK machine*
+which adds tagged values to the machine, which the balancer can use to
+identify redundant scripts in *one* run of script verification,
+possibly requiring one more run to make accurate exunit cost estimates.
+
+The *value scripts* variation restricts scripts to be explicit
+λ-expressions binding the script arguments, with an innermost script
+body which is a syntactic value. Such scripts can be converted to CEK
+values in a single traversal; each script can be converted to a value
+*once per transaction*, rather than at every use. *Module-level
+recursion* enables recursive definitions to recurse via the module
+itself, rather than locally, and makes the syntactic-value restriction
+easier to satisfy. This variation is expected to reduce the start-up
+costs of running each script considerably; on the down-side the
+syntactic restriction would be a little annoying, and it requires CEK
+operations which are not currently part of the API, so it requires
+modifications to a critical component of the Plutus implementation.
+
+The *explicit λs* variation is a half-way house between the main
+variation and the 'value scripts' variation. It places less onerous
+syntactic restrictions on script bodies, and as such can be used with
+the existing implementation of recursion (although efficiency would
+still benefit from module-level recursion). Cost accounting during
+script evaluation is a little intricate. It requires modifications to
+the loop at the core of the CEK machine.
+
+The *tuples of modules* variation replaces parameters referring to
+individual modules with a single parameter bound to a tuple of
+modules, effectively uncurrying scripts wrt their module
+parameters. At the cost of a traversal of all the script code in a
+transaction to 'relocate' module references, it is possible to replace
+many tuples-of-modules, one per script, by a global tuple of modules
+for the entire transaction; a further improvement would then be to
+unbox modules, replacing the global tuple of modules with a global
+tuple of module exports. These variations reduce the cost of referring
+to a module export, at the cost of an additional traversal of the
+script code before execution. Extensive benchmarking would be needed
+to decide whether or not they improve performance overall.
+
+Performance can probably be improved further by building the module
+environment in to the CEK machine. However, as this involves a pervasive
+change to the monad underlying the CEK machine itself, it would
+require recalibration of all the execution unit costs for CEK machine
+steps.
+
+The simplest alternative to implement would be the main alternative
+without variations. A more efficient implementation would combine
+value scripts with lazy loading, using tagged values in the CEK
+machine to analyse dynamic script dependencies in the balancer, and so
+drop redundant scripts from each transaction. Further improvements to
+performance may be achievable using a global module environment, and
+unboxed modules; because there are performance costs as well as
+benefits to these approaches, extensive benchmarking would be required
+to make an informed choice.
+
+These latter variations all require modifications to the CEK machine
+and to the balancer, as well as resolving dependencies in scripts;
+that is, they are considerable more expensive to implement.
+
+## Path to Active
+
+### Acceptance criteria
+
+- [ ] determine which approach outlined in this CIP will be selected
+- [ ] `plutus` changes
+- [ ] `cardano-ledger` changes
+- [ ] `cardano-api` changes
+- [ ] benchmarking and testing
+- [ ] integrate the feature into `cardano-node`
+- [ ] end-to-end testing
+- [ ] release at the hard fork introducing the Dijkstra era
+
+### Implementation Plan
+
+Here we use the term "script" to refer to either a validator script (which needs to be run to validate a transaction) and a module script (which serves as a dependency for other scripts).
+Both validators and modules can reference other modules.
+
+The feature proposed in this CIP can only be released in a new ledger era.
+As such, it is anticipated that it will be released alongside the next ledger era - the Dijkstra era.
+
+Whether this feature can be used in existing Plutus ledger language versions (V1 through V3) depends on which of the options described in subsection _Plutus Ledger Language Versions_ (i.e., tagged or untagged modules) is chosen.
+If tagged modules are adopted, the feature will be available across all Plutus language versions (V1 through V4) starting at the hard fork that introduces the Dijkstra era.
+If untagged modules are adopted, it will only be usable in Plutus V4, as explained in the subsection.
+
+The bulk of the implementation effort lies on the Plutus side, including updates to `plutus-ledger-api`, updates to the CEK machine, costing and benchmarking, among others.
+The specifics will depend on which of the various alternatives outlined in this CIP is selected.
+The Plutus team aims to complete the implementation of the selected approach according to its specification, in time for the Dijkstra era.
+
+On the ledger and cardano-api side, the effort required to support this feature is not as substantial as it may appear to be.
+This is because the ledger already supports reference inputs and reference scripts since the Babbage era, and this existing mechanism can largely be reused to accommodate module scripts.
+The processes of storing a module script in a UTXO and using it in a transaction are similar to storing and using a reference script.
+
+The main difference between reference scripts and module scripts is that a module script is, like an object file, not directly runnable but must be linked with a validator to form a runnable script.
+To support this, the ledger and cardano-api will need to implement some changes.
+The specifics will slightly vary depending on which of the alternative approaches is chosen, but it will generally involve the following.
+
+Currently, deserialising a script returns a `ScriptForEvaluation`, which contains a deserialised script, along with the original serialised script. The ledger has a `PlutusRunnable` newtype that wraps `ScriptForEvaluation`.
+With the introduction of modules, deserialising a script no longer produces a runnable script unless it is a self-contained validator that doesn't use modules.
+Otherwise, the module hashes it references must be resolved and the modules linked before the validator can be executed.
+
+To do so, the `plutus-ledger-api` package can implement one of two options, depending on which is more suitable for the ledger:
+- Script deserialisation will be modified to return a new data type, `ScriptForLinking`.
+  It is similar to `ScriptForEvaluation` except that the deserialized script is not necessarily a self-contained script and may be accompanied by a list of module hashes it needs.
+
+  Then, a function `linkValidator :: Map ScriptHash ScriptForLinking -> ScriptHash -> LinkedScript` is provided that performs linking for a particular validator identified by `ScriptHash`, where `LinkedScript ~ UPLC.Program DeBruijn DefaultUni DefaultFun ()` is a fully linked script.
+- Alternatively, the following function can be provided: `linkScripts :: Map ScriptHash SerialisedScript -> Map ScriptHash LinkedScript`, which performs deserialisation and linking for all (validator and module) scripts in one go.
+
+In either case, the ledger should ensure that each script (including validator script and module script) is deserialised and processed no more than once.
+
+Moreover, for the transaction builder to decide which modules a validator refers to are used at runtime, `plutus-ledger-api` will also expose the following function:
+
+```haskell
+getUsedModules ::
+  MajorProtocolVersion ->
+  EvaluationContext ->
+  -- | All scripts provided in a transaction
+  Map ScriptHash SerialisedScript ->
+  -- | Hash of the validator
+  ScriptHash ->
+  -- | Script arguments
+  [Data] ->
+  -- | Hashes of used module scripts
+  Set ScriptHash
+```
+
+The value type of the `Map` could instead be `ScriptForLinking` (i.e., deserialised script) rather than `SerialisedScript`.
+
+This function is to be called by the code building transactions (e.g., `Cardano.Api.Fees.makeTransactionBodyAutoBalance`) to determine which modules are necessary to include in a transaction.
+
+## Categories
+
+This is a Plutus CIP.
+
+As a Plutus CIP, it leaves UPLC, TPLC, PIR and Plinth *unchanged*
+except for the addition of an API for building and using modules in
+Plinth. In the case of the 'modified CEK machine' alternative for
+'balancer modifications', without 'value scripts', it requires *adding
+a construct* to UPLC to tag values so that their use can be observed;
+this is a *minor* change which is backwards-compatible, but it would
+require a new PlutusCore language version.
+
+Building the module environment in to the CEK machine requires *adding
+a construct* to UPLC to refer to components of the module environment;
+this is a *minor* change which is backwards-compatible, but it would
+also require a new PlutusCore language version.
+
+Because this CIP changes the representation of scripts, it requires a
+new Plutus Core ledger language, and can only be introduced at a hard
+fork.
+
+As far as the Ledger is concerned, the representation of a script is
+just `bytes`. This does not change. Therefore there are no changes to
+the Ledger, and thus this is not a Ledger CIP,
+
+## Acknowledgements
+
+This CIP draws heavily on a design by Michael Peyton Jones, and has
+benefitted greatly from discussion with Ziyang Liu, Roman Kireev, and
+Phil Wadler.
+
+## Copyright
+This CIP is licensed under [CC-BY-4.0]](https://creativecommons.org/licenses/by/4.0/legalcode).
+
+---
+
+[^1]: At present, a newer ledger language version may have access to more builtin functions and more Plutus Core versions than an older ledger language version, but this difference is going away.