FITSHeaders is a pure Julia package for managing basic FITS
structures such as FITS headers. FITS (for Flexible Image Transport
System) is a data file format widely used
in astronomy. A FITS file is a concatenation of Header Data Units (HDUs) that consist in
a header part and a data part. The header of a HDU is a collection of so-called FITS
cards. Each such card is stored in textual form and associates a keyword with a value
and/or a comment.
The FITSHeaders package is intended to provide:
-
Methods for fast parsing of a FITS header or of a piece of a FITS header (that is a single FITS header card).
-
An expressive API for creating FITS cards and accessing their components (keyword, value, and comment), possibly, in a type-stable way.
-
Methods for easy and efficient access to the records of a FITS header.
A FITS header card associates a keyword (or a name) with a value and a comment (both
optional). A FITS header card can be efficiently stored as an instance of FitsCard built
by:
card = FitsCard(key => (val, com))with key the card name, val its value, and com its comment. The value
val may be:
- a boolean to yield a card of type
FITS_LOGICAL; - an integer to yield a card of type
FITS_INTEGER; - a non-integer real to yield a card of type
FITS_FLOAT; - a complex to yield a card of type
FITS_COMPLEX; - a string to yield a card of type
FITS_STRING; nothingto yield a card of typeFITS_COMMENT;undeformissingto yield a card of typeFITS_UNDEFINED.
The comment may be omitted for a normal FITS card and the value may be omitted for a commentary FITS card:
card = FitsCard(key => val::Number)
card = FitsCard(key => str::AbstractString)In the 1st case, the comment is assumed to be empty. In the 2nd case, the string str is
assumed to be the card comment if key is "COMMENT" or "HISTORY" and the card value
otherwise.
Conversely, Pair(card) yields the pair key => (val, com). The convert method is
extended by the FITSHeaders package to perform these conversions.
If the string value of a FITS card is too long, it shall be split across several
consecutive CONTINUE cards when writing a FITS file. Likewise, if the comment of a
commentary keyword ("COMMENT" or "HISTORY") is too long, it shall be split across
several consecutive cards with the same keyword when writing a FITS file.
FITS cards have the following properties (among others):
card.type # type of card: FITS_LOGICAL, FITS_INTEGER, etc.
card.key # quick key of card: Fits"BITPIX", Fits"HIERARCH", etc.
card.name # name of card
card.value # callable object representing the card value
card.comment # comment of card
card.units # units of card value
card.unitless # comment of card without the units part if anyAs the values of FITS keywords have different types, card.value does not yield a Julia
value but a callable object. Called without any argument, this object yields the actual
card value:
card.value() -> val::Union{Bool,Int64,Float64,ComplexF64,String,Nothing,UndefInitializer}but such a call is not type-stable as indicated by the union Union{...} in the above
type assertion. For a type-stable result, the card value can be converted to a given data
type T:
card.value(T)
convert(T, card.value)both yield the value of card converted to type T. For readability, T may be an
abstract type: card.value(Integer) yields the same result as card.value(Int64),
card.value(Real) or card.value(AbstractFloat) yield the same result as
card.value(Float64), card.value(Complex) yields the same result as
card.value(ComplexF64), and card.value(AbstractString) yields the same result as
card.value(String).
To make things easier, a few additional properties are aliases that yield the card value converted to a specific type:
card.logical :: Bool # alias for card.value(Bool)
card.integer :: Int64 # alias for card.value(Integer)
card.float :: Float64 # alias for card.value(Real)
card.complex :: ComplexF64 # alias for card.value(Complex)
card.string :: String # alias for card.value(String)When the actual card value is of a different type than the one requested, an error is thrown if the conversion is not possible or inexact.
valtype(card) yields the Julia type of the value of card while isassigned(card)
yields whether card has a value (that is whether it is neither a commentary card nor a
card with an undefined value).
There are two kinds of FITS keywords:
-
Short FITS keywords are words with at most 8 ASCII characters from the restricted set of upper case letters (bytes 0x41 to 0x5A), decimal digits (hexadecimal codes 0x30 to 0x39), hyphen (hexadecimal code 0x2D), or underscore (hexadecimal code 0x5F). In a FITS file, keywords shorter than 8 characters are right-padded with ordinary spaces (hexadecimal code 0x20).
-
HIERARCHFITS keywords start with the string"HIERARCH "(with a single trailing space) followed by one or more words composed from the same restricted set of ASCII characters as short keywords and separated by a single space.
Keywords longer than 8 characters or composed of several words can only be represented as
HIERARCH FITS keywords. To simplify the representation of FITS cards as pairs, the
FitsCard constructor automatically converts long keywords or multi-word keywords into a
HIERARCH FITS keyword by prefixing the keyword with the string "HIERARCH " for
example:
julia> card = FitsCard("VERY-LONG-NAME" => (2, "keyword is longer than 8 characters"))
FitsCard: HIERARCH VERY-LONG-NAME = 2 / keyword is longer than 8 characters
julia> card.name
"HIERARCH VERY-LONG-NAME"
julia> FitsCard("SOME KEY" => (3, "keyword has 8 characters but 2 words"))
FitsCard: HIERARCH SOME KEY = 3 / keyword has 8 characters but 2 words
julia> card.name
"HIERARCH SOME KEY"This rule is only applied to the construction of FITS cards from pairs. When parsing a
FITS header card from a file, the "HIERARCH " prefix must be present.
The non-exported method FITSHeaders.keyword may be used to apply this rule:
julia> FITSHeaders.keyword("VERY-LONG-NAME")
"HIERARCH VERY-LONG-NAME"
julia> FITSHeaders.keyword("SOME KEY")
"HIERARCH SOME KEY"
julia> FITSHeaders.keyword("NAME")
"NAME"
julia> FITSHeaders.keyword("HIERARCH NAME")
"HIERARCH NAME"In FITSHeaders, a key of type FitsKey is a 64-bit value computed from a FITS keyword.
The key of a short FITS keyword is unique and exactly matches the first 8 bytes of the
keyword as it is stored in a FITS file. Thus quick keys provide fast means to compare and
search FITS keywords. The constructor FitsKey(name) yields the quick key of the string
name. A quick key may be literally expressed by using the @Fits_str macro in Julia
code. For example:
card.key == Fits"NAXIS"is faster than, say card.name == "NAXIS", to check whether the name of the FITS header
card card is "NAXIS". This is because, the comparison is performed on a single integer
(not on several characters) and expression Fits"...." is a constant computed at compile
time with no run-time penalty. Compared to FitsKey(name), Fits"...." checks the
validity of the characters composing the literal short keyword (again this is done at
compile time so without run-time penalty) and, for readability, does not allow for
trailing spaces.
For a HIERARCH keyword, the quick key is equal to the constant Fits"HIERARCH" whatever
the other part of the keyword.
Each FITS header card is stored in a FITS file as 80 consecutive bytes from the restricted
set of ASCII characters from ' ' to '~' (hexadecimal codes 0x20 to 0x7E). Hence Julia
strings (whether they are encoded in ASCII or in UTF8) can be treated as vectors of bytes.
The parsing methods provided by the FITSHeaders package exploit this to deal with FITS
headers and cards stored as either vectors of bytes (of type AbstractVector{UInt8}) or
as Julia strings (of type String or SubString{String}).
A FitsCard object can be built by parsing a FITS header card as it is stored in a FITS
file:
card = FitsCard(buf; offset=0)where buf is either a string or a vector of bytes. Keyword offset can be used to
specify the number of bytes to skip at the beginning of buf, so that it is possible to
extract a specific FITS header card, not just the first one. At most, the 80 first bytes
after the offset are scanned to build the FitsCard object. The next FITS card to parse
is then at offset + 80 and so on.
The considered card may be shorter than 80 bytes, the result being exactly the same as if
the missing bytes were spaces. If there are no bytes left, a FitsCard object equivalent
to the final END card of a FITS header is returned.
The FITSHeaders package provides objects of type FitsHeader to store, possibly
partial, FITS headers.
To build a FITS header initialized with records args.., call:
hdr = FitsHeader(args...)where args... is a variable number of records in any form allowed by the FitsCard
constructor, it can also be a vector or a tuple of records. For example:
dims = (384, 288)
hdr = FitsHeader("SIMPLE" => true,
"BITPIX" => (-32, "32-bit floats"),
"NAXIS" => (length(dims), "number of dimensions"),
ntuple(i -> "NAXIS$i" => dims[i], length(dims))...,
"COMMENT" => "A comment.",
"COMMENT" => "Another comment.",
"DATE" => ("2023-02-01", "1st of February, 2023"),
"COMMENT" => "Yet another comment.")Method keys can be applied to get the list of keywords in a FITS header:
julia> keys(hdr)
KeySet for a Dict{String, Int64} with 7 entries. Keys:
"COMMENT"
"BITPIX"
"SIMPLE"
"NAXIS2"
"NAXIS1"
"NAXIS"
"DATE"A FITS header object behaves as a vector of FitsCard elements with integer or keyword
(string) indices. When indexed by keywords, a FITS header object is similar to a
dictionary except that the order of records is preserved and that commentary and
continuation records (with keywords "COMMENT", "HISTORY", "", or "CONTINUE") may
appear more than once.
An integer (linear) index i or a string index key can be used to retrieve a given
record:
hdr[i] # i-th record
hdr[key] # first record whose name matches `key`For example (with hdr as built above):
julia> hdr[2]
FitsCard: BITPIX = -32 / 32-bit floats
julia> hdr["NAXIS"]
FitsCard: NAXIS = 2 / number of dimensions
julia> hdr["COMMENT"]
FitsCard: COMMENT A comment.Note that, when indexing by name, the first matching record is returned. This may be a
concern for non-unique keywords as in the last above example. All matching records can be
collected into a vector of FitsCard elements by:
collect(key, hdr) # all records whose name matches `key`For example:
julia> collect("COMMENT", hdr)
3-element Vector{FitsCard}:
FitsCard: COMMENT A comment.
FitsCard: COMMENT Another comment.
FitsCard: COMMENT Yet another comment.
julia> collect(rec -> startswith(rec.name, "NAXIS"), hdr)
3-element Vector{FitsCard}:
FitsCard: NAXIS = 2 / number of dimensions
FitsCard: NAXIS1 = 384
FitsCard: NAXIS2 = 288
julia> collect(r"^NAXIS[0-9]+$", hdr)
2-element Array{FitsCard,1}:
FitsCard("NAXIS1" => 384)
FitsCard("NAXIS2" => 288)This behavior is different from that of filter which yields another FITS header
instance:
julia> filter(rec -> startswith(rec.name, "NAXIS"), hdr)
3-element FitsHeader:
FitsCard: NAXIS = 2 / number of dimensions
FitsCard: NAXIS1 = 384
FitsCard: NAXIS2 = 288For more control, searching for the index i of an existing record in FITS header object
hdr can be done by the usual methods:
findfirst(what, hdr)
findlast(what, hdr)
findnext(what, hdr, start)
findprev(what, hdr, start)which all return a valid integer index if a record matching what is found and nothing
otherwise. The matching pattern what can be a keyword (string), a FITS card (an instance
of FitsCard whose name is used as a matching pattern), a regular expression, or a
predicate function which takes a FITS card argument and shall return whether it matches.
The find methods just yield nothing for any unsupported kind of pattern.
The eachmatch method is a simple mean to iterate over matching records:
eachmatch(what, hdr)yields an iterator over the records of hdr matching what. For example:
@inbounds for rec in eachmatch(what, hdr)
... # do something
endis a shortcut for:
i = findfirst(what, hdr)
@inbounds while i !== nothing
rec = hdr[i]
... # do something
i = findnext(what, hdr, i+1)
endwhile:
@inbounds for rec in reverse(eachmatch(what, hdr))
... # do something
endis equivalent to:
i = findlast(what, hdr)
@inbounds while i !== nothing
rec = hdr[i]
... # do something
i = findprev(what, hdr, i-1)
endIf it is not certain that a record exists or to avoid throwing a KeyError exception, use
the get method. For example:
julia> get(hdr, "BITPIX", nothing)
FitsCard: BITPIX = -32 / 32-bit floats
julia> get(hdr, "GIZMO", missing)
missingA record rec may be pushed to a FITS header hdr to modify the header:
push!(hdr, rec)where rec may have any form allowed by the FitsCard constructor. If the keyword is
"COMMENT", "HISTORY", "", or "CONTINUE", rec is appended to the end of the list
of records stored by hdr. For other keywords which must be unique, if a record of the
same name exists in hdr, it is replaced by rec; otherwise, it is appended to the end
of the list of records stored by hdr.
The setindex! method may be used with a keyword (string) index. For example, the two
following statements are equivalent:
hdr[key] = (val, com)
push!(hdr, key => (val, com))The setindex! method may also be used with a linear (integer) index. For example:
hdr[i] = recreplaces the i-th record in hdr by rec. With an integer index, the rule for unique /
non-unique keywords is not applied, so this indexing should be restricted to editing the
value and/or comment of an existing entry. The following example illustrates how this can
be used to modify the comment of the BITPIX record:
julia> if (i = findfirst("BITPIX", hdr)) != nothing
hdr[i] = ("BITPIX" => (hdr[i].value(), "A better comment."))
end
"BITPIX" => (-32, "A better comment.")FITSHeaders is ought to be fast. Below are times and memory allocations for parsing
80-byte FITS cards measured with Julia 1.8.5 on a Linux laptop with an Intel Core i7-5500U
CPU:
- parsing logical FITS card: 114.588 ns (2 allocations: 64 bytes)
- parsing integer FITS card: 118.519 ns (2 allocations: 72 bytes)
- parsing HIERARCH FITS card: 142.462 ns (2 allocations: 88 bytes)
- parsing float FITS card: 274.119 ns (4 allocations: 152 bytes)
- parsing complex FITS card: 424.060 ns (6 allocations: 248 bytes)
- parsing string FITS card: 155.694 ns (4 allocations: 144 bytes)
- parsing string with quotes: 169.223 ns (4 allocations: 168 bytes)
- parsing COMMENT FITS card: 90.615 ns (2 allocations: 112 bytes)
- parsing HISTORY FITS card: 100.591 ns (2 allocations: 72 bytes)
- parsing blank FITS card: 78.261 ns (0 allocations: 0 bytes)
- parsing END FITS card: 82.286 ns (0 allocations: 0 bytes)
The benchmark code is in file test/benchmarks.jl. The HIERARCH
card has an integer value. The float and complex valued cards take more time to parse
because parsing a floating-point value is more complex than parsing, say, an integer and
because the string storing the floating-point value must be copied to replace letters d
and D, allowed in FITS standard to indicate the exponent, by an e.
For comparison, just extracting the keyword, value, and comment parts from a 80-characters
FITS card by calling the functions fits_get_keyname and fits_parse_value of CFITSIO
library takes about 150 ns on the same machine. This does not includes the allocation of
the buffers to store these 3 parts (about 120 ns for this) and the parsing of the value
which are all included in the timings of the FitsCard constructor above.