Skip to content
Snippets Groups Projects
guix.texi 288 KiB
Newer Older
  • Learn to ignore specific revisions
  • package that needs to be patched, and then to ``graft'' it onto packages
    explicitly installed by the user and that were previously referring to
    the original package.  The cost of grafting is typically very low, and
    order of magnitudes lower than a full rebuild of the dependency chain.
    
    @cindex replacements of packages, for grafts
    For instance, suppose a security update needs to be applied to Bash.
    Guix developers will provide a package definition for the ``fixed''
    Bash, say @var{bash-fixed}, in the usual way (@pxref{Defining
    Packages}).  Then, the original package definition is augmented with a
    @code{replacement} field pointing to the package containing the bug fix:
    
    @example
    (define bash
      (package
        (name "bash")
        ;; @dots{}
        (replacement bash-fixed)))
    @end example
    
    From there on, any package depending directly or indirectly on Bash that
    is installed will automatically be ``rewritten'' to refer to
    @var{bash-fixed} instead of @var{bash}.  This grafting process takes
    time proportional to the size of the package, but expect less than a
    minute for an ``average'' package on a recent machine.
    
    Currently, the graft and the package it replaces (@var{bash-fixed} and
    @var{bash} in the example above) must have the exact same @code{name}
    and @code{version} fields.  This restriction mostly comes from the fact
    that grafting works by patching files, including binary files, directly.
    Other restrictions may apply: for instance, when adding a graft to a
    package providing a shared library, the original shared library and its
    replacement must have the same @code{SONAME} and be binary-compatible.
    
    
    
    @node Package Modules
    @section Package Modules
    
    From a programming viewpoint, the package definitions of the
    GNU distribution are provided by Guile modules in the @code{(gnu packages
    @dots{})} name space@footnote{Note that packages under the @code{(gnu
    packages @dots{})} module name space are not necessarily ``GNU
    packages''.  This module naming scheme follows the usual Guile module
    naming convention: @code{gnu} means that these modules are distributed
    as part of the GNU system, and @code{packages} identifies modules that
    define packages.}  (@pxref{Modules, Guile modules,, guile, GNU Guile
    Reference Manual}).  For instance, the @code{(gnu packages emacs)}
    module exports a variable named @code{emacs}, which is bound to a
    @code{<package>} object (@pxref{Defining Packages}).
    
    The @code{(gnu packages @dots{})} module name space is
    
    automatically scanned for packages by the command-line tools.  For
    instance, when running @code{guix package -i emacs}, all the @code{(gnu
    packages @dots{})} modules are scanned until one that exports a package
    object whose name is @code{emacs} is found.  This package search
    facility is implemented in the @code{(gnu packages)} module.
    
    @cindex customization, of packages
    
    @cindex package module search path
    
    Users can store package definitions in modules with different
    
    names---e.g., @code{(my-packages emacs)}@footnote{Note that the file
    
    name and module name must match.  For instance, the @code{(my-packages
    emacs)} module must be stored in a @file{my-packages/emacs.scm} file
    relative to the load path specified with @option{--load-path} or
    @code{GUIX_PACKAGE_PATH}.  @xref{Modules and the File System,,,
    guile, GNU Guile Reference Manual}, for details.}.  These package definitions
    
    will not be visible by default.  Thus, users can invoke commands such as
    @command{guix package} and @command{guix build} have to be used with the
    
    @code{-e} option so that they know where to find the package.  Better
    yet, they can use the
    
    @code{-L} option of these commands to make those modules visible
    
    (@pxref{Invoking guix build, @code{--load-path}}), or define the
    @code{GUIX_PACKAGE_PATH} environment variable.  This environment
    variable makes it easy to extend or customize the distribution and is
    honored by all the user interfaces.
    
    @defvr {Environment Variable} GUIX_PACKAGE_PATH
    This is a colon-separated list of directories to search for package
    modules.  Directories listed in this variable take precedence over the
    distribution's own modules.
    @end defvr
    
    The distribution is fully @dfn{bootstrapped} and @dfn{self-contained}:
    each package is built based solely on other packages in the
    distribution.  The root of this dependency graph is a small set of
    @dfn{bootstrap binaries}, provided by the @code{(gnu packages
    bootstrap)} module.  For more information on bootstrapping,
    
    @pxref{Bootstrapping}.
    
    @node Packaging Guidelines
    @section Packaging Guidelines
    
    The GNU distribution is nascent and may well lack some of your favorite
    packages.  This section describes how you can help make the distribution
    grow.  @xref{Contributing}, for additional information on how you can
    help.
    
    Free software packages are usually distributed in the form of
    @dfn{source code tarballs}---typically @file{tar.gz} files that contain
    all the source files.  Adding a package to the distribution means
    essentially two things: adding a @dfn{recipe} that describes how to
    build the package, including a list of other packages required to build
    it, and adding @dfn{package meta-data} along with that recipe, such as a
    description and licensing information.
    
    In Guix all this information is embodied in @dfn{package definitions}.
    Package definitions provide a high-level view of the package.  They are
    written using the syntax of the Scheme programming language; in fact,
    for each package we define a variable bound to the package definition,
    and export that variable from a module (@pxref{Package Modules}).
    However, in-depth Scheme knowledge is @emph{not} a prerequisite for
    creating packages.  For more information on package definitions,
    
    @pxref{Defining Packages}.
    
    Once a package definition is in place, stored in a file in the Guix
    source tree, it can be tested using the @command{guix build} command
    (@pxref{Invoking guix build}).  For example, assuming the new package is
    
    called @code{gnew}, you may run this command from the Guix build tree
    (@pxref{Running Guix Before It Is Installed}):
    
    ./pre-inst-env guix build gnew --keep-failed
    
    Using @code{--keep-failed} makes it easier to debug build failures since
    it provides access to the failed build tree.  Another useful
    command-line option when debugging is @code{--log-file}, to access the
    build log.
    
    If the package is unknown to the @command{guix} command, it may be that
    the source file contains a syntax error, or lacks a @code{define-public}
    clause to export the package variable.  To figure it out, you may load
    the module from Guile to get more information about the actual error:
    
    @example
    ./pre-inst-env guile -c '(use-modules (gnu packages gnew))'
    @end example
    
    Once your package builds correctly, please send us a patch
    (@pxref{Contributing}).  Well, if you need help, we will be happy to
    help you too.  Once the patch is committed in the Guix repository, the
    new package automatically gets built on the supported platforms by
    
    Ludovic Courtès's avatar
    Ludovic Courtès committed
    @url{http://hydra.gnu.org/jobset/gnu/master, our continuous integration
    
    @cindex substituter
    Users can obtain the new package definition simply by running
    @command{guix pull} (@pxref{Invoking guix pull}).  When
    @code{hydra.gnu.org} is done building the package, installing the
    package automatically downloads binaries from there
    (@pxref{Substitutes}).  The only place where human intervention is
    needed is to review and apply the patch.
    
    * Software Freedom::            What may go into the distribution.
    * Package Naming::              What's in a name?
    * Version Numbers::             When the name is not enough.
    * Python Modules::              Taming the snake.
    * Perl Modules::                Little pearls.
    * Fonts::                       Fond of fonts.
    
    @node Software Freedom
    @subsection Software Freedom
    
    @c Adapted from http://www.gnu.org/philosophy/philosophy.html.
    
    The GNU operating system has been developed so that users can have
    freedom in their computing.  GNU is @dfn{free software}, meaning that
    users have the @url{http://www.gnu.org/philosophy/free-sw.html,four
    essential freedoms}: to run the program, to study and change the program
    in source code form, to redistribute exact copies, and to distribute
    modified versions.  Packages found in the GNU distribution provide only
    software that conveys these four freedoms.
    
    In addition, the GNU distribution follow the
    @url{http://www.gnu.org/distros/free-system-distribution-guidelines.html,free
    software distribution guidelines}.  Among other things, these guidelines
    reject non-free firmware, recommendations of non-free software, and
    discuss ways to deal with trademarks and patents.
    
    Some packages contain a small and optional subset that violates the
    above guidelines, for instance because this subset is itself non-free
    code.  When that happens, the offending items are removed with
    appropriate patches or code snippets in the package definition's
    @code{origin} form (@pxref{Defining Packages}).  That way, @code{guix
    build --source} returns the ``freed'' source rather than the unmodified
    upstream source.
    
    @node Package Naming
    @subsection Package Naming
    
    A package has actually two names associated with it:
    First, there is the name of the @emph{Scheme variable}, the one following
    @code{define-public}.  By this name, the package can be made known in the
    Scheme code, for instance as input to another package.  Second, there is
    the string in the @code{name} field of a package definition.  This name
    is used by package management commands such as
    @command{guix package} and @command{guix build}.
    
    Both are usually the same and correspond to the lowercase conversion of
    the project name chosen upstream, with underscores replaced with
    hyphens.  For instance, GNUnet is available as @code{gnunet}, and
    SDL_net as @code{sdl-net}.
    
    We do not add @code{lib} prefixes for library packages, unless these are
    
    already part of the official project name.  But @pxref{Python
    
    Modules} and @ref{Perl Modules} for special rules concerning modules for
    the Python and Perl languages.
    
    Font package names are handled differently, @pxref{Fonts}.
    
    @node Version Numbers
    @subsection Version Numbers
    
    We usually package only the latest version of a given free software
    project.  But sometimes, for instance for incompatible library versions,
    two (or more) versions of the same package are needed.  These require
    different Scheme variable names.  We use the name as defined
    in @ref{Package Naming}
    for the most recent version; previous versions use the same name, suffixed
    by @code{-} and the smallest prefix of the version number that may
    distinguish the two versions.
    
    The name inside the package definition is the same for all versions of a
    package and does not contain any version number.
    
    For instance, the versions 2.24.20 and 3.9.12 of GTK+ may be packaged as follows:
    
    @example
    (define-public gtk+
      (package
    
        (name "gtk+")
        (version "3.9.12")
        ...))
    
    (define-public gtk+-2
      (package
    
        (name "gtk+")
        (version "2.24.20")
        ...))
    
    @end example
    If we also wanted GTK+ 3.8.2, this would be packaged as
    @example
    (define-public gtk+-3.8
      (package
    
        (name "gtk+")
        (version "3.8.2")
        ...))
    
    @node Python Modules
    @subsection Python Modules
    
    We currently package Python 2 and Python 3, under the Scheme variable names
    @code{python-2} and @code{python} as explained in @ref{Version Numbers}.
    To avoid confusion and naming clashes with other programming languages, it
    seems desirable that the name of a package for a Python module contains
    the word @code{python}.
    
    Some modules are compatible with only one version of Python, others with both.
    If the package Foo compiles only with Python 3, we name it
    @code{python-foo}; if it compiles only with Python 2, we name it
    @code{python2-foo}. If it is compatible with both versions, we create two
    packages with the corresponding names.
    
    If a project already contains the word @code{python}, we drop this;
    for instance, the module python-dateutil is packaged under the names
    @code{python-dateutil} and @code{python2-dateutil}.
    
    @node Perl Modules
    @subsection Perl Modules
    
    Perl programs standing for themselves are named as any other package,
    using the lowercase upstream name.
    For Perl packages containing a single class, we use the lowercase class name,
    replace all occurrences of @code{::} by dashes and prepend the prefix
    @code{perl-}.
    So the class @code{XML::Parser} becomes @code{perl-xml-parser}.
    Modules containing several classes keep their lowercase upstream name and
    are also prepended by @code{perl-}.  Such modules tend to have the word
    @code{perl} somewhere in their name, which gets dropped in favor of the
    prefix.  For instance, @code{libwww-perl} becomes @code{perl-libwww}.
    
    @node Fonts
    @subsection Fonts
    
    For fonts that are in general not installed by a user for typesetting
    purposes, or that are distributed as part of a larger software package,
    we rely on the general packaging rules for software; for instance, this
    applies to the fonts delivered as part of the X.Org system or fonts that
    are part of TeX Live.
    
    To make it easier for a user to search for fonts, names for other packages
    containing only fonts are constructed as follows, independently of the
    upstream package name.
    
    The name of a package containing only one font family starts with
    @code{font-}; it is followed by the foundry name and a dash @code{-}
    if the foundry is known, and the font family name, in which spaces are
    replaced by dashes (and as usual, all upper case letters are transformed
    to lower case).
    For example, the Gentium font family by SIL is packaged under the name
    @code{font-sil-gentium}.
    
    For a package containing several font families, the name of the collection
    is used in the place of the font family name.
    For instance, the Liberation fonts consist of three families,
    Liberation Sans, Liberation Serif and Liberation Mono.
    These could be packaged separately under the names
    @code{font-liberation-sans} and so on; but as they are distributed together
    under a common name, we prefer to package them together as
    @code{font-liberation}.
    
    In the case where several formats of the same font family or font collection
    are packaged separately, a short form of the format, prepended by a dash,
    is added to the package name.  We use @code{-ttf} for TrueType fonts,
    
    @code{-otf} for OpenType fonts and @code{-type1} for PostScript Type 1
    
    @node Bootstrapping
    @section Bootstrapping
    
    @c Adapted from the ELS 2013 paper.
    
    @cindex bootstrapping
    
    Bootstrapping in our context refers to how the distribution gets built
    ``from nothing''.  Remember that the build environment of a derivation
    contains nothing but its declared inputs (@pxref{Introduction}).  So
    there's an obvious chicken-and-egg problem: how does the first package
    get built?  How does the first compiler get compiled?  Note that this is
    a question of interest only to the curious hacker, not to the regular
    user, so you can shamelessly skip this section if you consider yourself
    a ``regular user''.
    
    @cindex bootstrap binaries
    The GNU system is primarily made of C code, with libc at its core.  The
    GNU build system itself assumes the availability of a Bourne shell and
    command-line tools provided by GNU Coreutils, Awk, Findutils, `sed', and
    `grep'.  Furthermore, build programs---programs that run
    @code{./configure}, @code{make}, etc.---are written in Guile Scheme
    (@pxref{Derivations}).  Consequently, to be able to build anything at
    all, from scratch, Guix relies on pre-built binaries of Guile, GCC,
    Binutils, libc, and the other packages mentioned above---the
    @dfn{bootstrap binaries}.
    
    These bootstrap binaries are ``taken for granted'', though we can also
    re-create them if needed (more on that later).
    
    @unnumberedsubsec Preparing to Use the Bootstrap Binaries
    
    @c As of Emacs 24.3, Info-mode displays the image, but since it's a
    @c large image, it's hard to scroll.  Oh well.
    @image{images/bootstrap-graph,6in,,Dependency graph of the early bootstrap derivations}
    
    The figure above shows the very beginning of the dependency graph of the
    distribution, corresponding to the package definitions of the @code{(gnu
    packages bootstrap)} module.  At this level of detail, things are
    slightly complex.  First, Guile itself consists of an ELF executable,
    along with many source and compiled Scheme files that are dynamically
    loaded when it runs.  This gets stored in the @file{guile-2.0.7.tar.xz}
    tarball shown in this graph.  This tarball is part of Guix's ``source''
    distribution, and gets inserted into the store with @code{add-to-store}
    (@pxref{The Store}).
    
    But how do we write a derivation that unpacks this tarball and adds it
    to the store?  To solve this problem, the @code{guile-bootstrap-2.0.drv}
    derivation---the first one that gets built---uses @code{bash} as its
    builder, which runs @code{build-bootstrap-guile.sh}, which in turn calls
    @code{tar} to unpack the tarball.  Thus, @file{bash}, @file{tar},
    @file{xz}, and @file{mkdir} are statically-linked binaries, also part of
    the Guix source distribution, whose sole purpose is to allow the Guile
    tarball to be unpacked.
    
    Once @code{guile-bootstrap-2.0.drv} is built, we have a functioning
    Guile that can be used to run subsequent build programs.  Its first task
    is to download tarballs containing the other pre-built binaries---this
    is what the @code{.tar.xz.drv} derivations do.  Guix modules such as
    @code{ftp-client.scm} are used for this purpose.  The
    @code{module-import.drv} derivations import those modules in a directory
    in the store, using the original layout.  The
    @code{module-import-compiled.drv} derivations compile those modules, and
    write them in an output directory with the right layout.  This
    corresponds to the @code{#:modules} argument of
    @code{build-expression->derivation} (@pxref{Derivations}).
    
    Finally, the various tarballs are unpacked by the
    derivations @code{gcc-bootstrap-0.drv}, @code{glibc-bootstrap-0.drv},
    etc., at which point we have a working C tool chain.
    
    @unnumberedsubsec Building the Build Tools
    
    @c TODO: Add a package-level dependency graph generated from (gnu
    @c packages base).
    
    Bootstrapping is complete when we have a full tool chain that does not
    depend on the pre-built bootstrap tools discussed above.  This
    no-dependency requirement is verified by checking whether the files of
    the final tool chain contain references to the @file{/gnu/store}
    directories of the bootstrap inputs.  The process that leads to this
    ``final'' tool chain is described by the package definitions found in
    
    the @code{(gnu packages commencement)} module.
    
    @c See <http://lists.gnu.org/archive/html/gnu-system-discuss/2012-10/msg00000.html>.
    The first tool that gets built with the bootstrap binaries is
    GNU Make, which is a prerequisite for all the following packages.
    From there Findutils and Diffutils get built.
    
    Then come the first-stage Binutils and GCC, built as pseudo cross
    tools---i.e., with @code{--target} equal to @code{--host}.  They are
    used to build libc.  Thanks to this cross-build trick, this libc is
    guaranteed not to hold any reference to the initial tool chain.
    
    From there the final Binutils and GCC are built.  GCC uses @code{ld}
    from the final Binutils, and links programs against the just-built libc.
    This tool chain is used to build the other packages used by Guix and by
    the GNU Build System: Guile, Bash, Coreutils, etc.
    
    And voilà!  At this point we have the complete set of build tools that
    the GNU Build System expects.  These are in the @code{%final-inputs}
    
    variable of the @code{(gnu packages commencement)} module, and are
    implicitly used by any package that uses @code{gnu-build-system}
    
    (@pxref{Build Systems, @code{gnu-build-system}}).
    
    @unnumberedsubsec Building the Bootstrap Binaries
    
    Because the final tool chain does not depend on the bootstrap binaries,
    those rarely need to be updated.  Nevertheless, it is useful to have an
    automated way to produce them, should an update occur, and this is what
    the @code{(gnu packages make-bootstrap)} module provides.
    
    The following command builds the tarballs containing the bootstrap
    binaries (Guile, Binutils, GCC, libc, and a tarball containing a mixture
    of Coreutils and other basic command-line tools):
    
    @example
    guix build bootstrap-tarballs
    @end example
    
    The generated tarballs are those that should be referred to in the
    @code{(gnu packages bootstrap)} module mentioned at the beginning of
    this section.
    
    Still here?  Then perhaps by now you've started to wonder: when do we
    reach a fixed point?  That is an interesting question!  The answer is
    unknown, but if you would like to investigate further (and have
    significant computational and storage resources to do so), then let us
    know.
    
    @node Porting
    @section Porting to a New Platform
    
    As discussed above, the GNU distribution is self-contained, and
    self-containment is achieved by relying on pre-built ``bootstrap
    binaries'' (@pxref{Bootstrapping}).  These binaries are specific to an
    operating system kernel, CPU architecture, and application binary
    interface (ABI).  Thus, to port the distribution to a platform that is
    not yet supported, one must build those bootstrap binaries, and update
    the @code{(gnu packages bootstrap)} module to use them on that platform.
    
    Fortunately, Guix can @emph{cross compile} those bootstrap binaries.
    When everything goes well, and assuming the GNU tool chain supports the
    target platform, this can be as simple as running a command like this
    one:
    
    @example
    guix build --target=armv5tel-linux-gnueabi bootstrap-tarballs
    @end example
    
    
    For this to work, the @code{glibc-dynamic-linker} procedure in
    @code{(gnu packages bootstrap)} must be augmented to return the right
    file name for libc's dynamic linker on that platform; likewise,
    @code{system->linux-architecture} in @code{(gnu packages linux)} must be
    taught about the new platform.
    
    
    Once these are built, the @code{(gnu packages bootstrap)} module needs
    
    to be updated to refer to these binaries on the target platform.  That
    is, the hashes and URLs of the bootstrap tarballs for the new platform
    must be added alongside those of the currently supported platforms.  The
    bootstrap Guile tarball is treated specially: it is expected to be
    available locally, and @file{gnu-system.am} has rules do download it for
    the supported architectures; a rule for the new platform must be added
    as well.
    
    
    In practice, there may be some complications.  First, it may be that the
    extended GNU triplet that specifies an ABI (like the @code{eabi} suffix
    above) is not recognized by all the GNU tools.  Typically, glibc
    recognizes some of these, whereas GCC uses an extra @code{--with-abi}
    configure flag (see @code{gcc.scm} for examples of how to handle this).
    Second, some of the required packages could fail to build for that
    platform.  Lastly, the generated binaries could be broken for some
    reason.
    
    @c *********************************************************************
    
    @include contributing.texi
    
    @c *********************************************************************
    @node Acknowledgments
    @chapter Acknowledgments
    
    Guix is based on the Nix package manager, which was designed and
    
    implemented by Eelco Dolstra, with contributions from other people (see
    the @file{nix/AUTHORS} file in Guix.)  Nix pioneered functional package
    
    management, and promoted unprecedented features, such as transactional
    package upgrades and rollbacks, per-user profiles, and referentially
    transparent build processes.  Without this work, Guix would not exist.
    
    The Nix-based software distributions, Nixpkgs and NixOS, have also been
    an inspiration for Guix.
    
    
    GNU@tie{}Guix itself is a collective work with contributions from a
    number of people.  See the @file{AUTHORS} file in Guix for more
    information on these fine people.  The @file{THANKS} file lists people
    who have helped by reporting bugs, taking care of the infrastructure,
    providing artwork and themes, making suggestions, and more---thank you!
    
    
    
    @c *********************************************************************
    @node GNU Free Documentation License
    @appendix GNU Free Documentation License
    
    @include fdl-1.3.texi
    
    @c *********************************************************************
    @node Concept Index
    @unnumbered Concept Index
    @printindex cp
    
    
    @node Programming Index
    @unnumbered Programming Index
    @syncodeindex tp fn
    @syncodeindex vr fn
    
    @printindex fn
    
    @bye
    
    @c Local Variables:
    @c ispell-local-dictionary: "american";
    @c End: