Conrad Parker

Iteratees at Tsuru Capital

noreply@blogger.com (Anonymous) — Sun, 18 Sep 2011 09:10:00 +0000

Tsuru Capital is a small company. We build our internal systems for live trading and offline analysis in Haskell, and we're proud to be sponsoring ICFP 2011. We use iteratees throughout our systems, and have actively encouraged all our staff to contribute changes upstream and participate in community design discussions. By being part of the open source community and taking part in peer-review, we all end up with better software.

Over time various Tsuru staff members have worked on tools using iteratees, including (grepping the CONTRIBUTORS files): Bryan Buecking, Michael Baikov, Elliott Pace, Conrad Parker, Akio Takano, and Maciej Wos. There's been some lively discussions and many small patches providing functions that we use in production every day.

Last year Conal Elliott provided some mentoring to Tsuru staff, during which we worked through a denotational semantics for iteratees. This resulted in discussions on both the iteratee project list and haskell-cafe about Semantics of iteratees, enumerators, enumeratees.

By using iteratees in production we've contributed various simple but practical functions, including:

enumFdFollow, an enumerator (data source) which allows you to process the growing tail of a log file as it is being written.
ioIter, an iteratee that uses an IO action to determine what to do. Typically this is action involves some user interaction, such as a user issuing commands like play/pause/next/prev.
ListLike functions last (an iteratee that efficiently returns the last element of a stream), mapM_ and foldM.
mapChunksM_, a more efficient version of mapM_ that operates on the underlying chunks, eg. logger = mapChunksM_ (liftIO . print).
takeWhile, and its enumeratee variant takeWhileE

endianRead8, an iteratee for reading 64bit values with a given endianness. I've used this in ght as well as an internal project.

Stream conversion We've done quite a bit of work on stream conversion, as we use a few different layers of data processing. The iteratee architecture allows you to isolate the data source, conversion and processing functions; much of what we've worked on involves ensuring the converters (enumeratees) can control or translate control messages, so that commands like "seek" do not get lost. We've also built combinators to simplify the task of creating new stream converters.

convStateStream, which converts one stream into another while continually updating an internal state. Importantly for variable bitrate binary data, it can produce elements of the output stream from data that spans stream chunks.
(><>) and (<><). These allow stream converters to be composed without rewriting boilerplate. Jon Lato gives a good example using these in the StackOverflow answer to Attoparsec Iteratee.
zip, zip[345], sequence_ for using multiple iteratees to process a single stream instance, and (for zip*) collecting the results.
eneeCheckIfDone*: This family of functions (eneeCheckIfDoneHandle, eneeCheckIfDonePass, eneeCheckIfDoneIgnore) can be used with
unfoldConvStreamCheck to make a version of unfoldConvStream which respects seek messages.

Parallel stream processing We often want to do multiple unrelated analysis tasks on a data stream. Whereas sequence_ takes a list of iteratees to run simultaneously and handles each input chunk by mapM across that list, psequence_ runs each input iteratee in a separate forkIO thread. For a real-world example, see Michael Baikov's post about psequence, psequence_, parE, parI.

Thanks

Thanks to John Lato for consistently and reliably maintaining the iteratee package, providing thoughtful feedback and graciously suggesting improvements.

A Haskell template for GTK, Glade, Cairo apps

noreply@blogger.com (Anonymous) — Fri, 30 Jul 2010 08:36:00 +0000

I just uploaded cairo-appbase to Hackage. This is a template for building new GUI applications using GTK, Glade and Cairo.

To install it:


$ cabal update
$ cabal install gtk2hs-buildtools
$ cabal install cairo-appbase

Then, run cairo-appbase:

The GTK widget layout is done via a Glade XML file which can be edited visually using glade. This template includes working callbacks to handle the File and Help menus and File Save/Open dialogs, with dummy handlers for selecting filenames and the Edit menu's cut/copy/paste. The main canvas uses Cairo for graphics rendering, and includes example code from the cairo package.

To build your own application on top of this, first grab the code. You can either grab it from hackage with cabal unpack cairo-appbase, or clone the git repo:

git clone git://github.com/kfish/cairo-appbase.git

To add widgets, install glade from your distro system and run glade data/main.glade. Note that you must run cabal install to put the glade file in the correct place for your application to pick it up. To modify the code, edit src/cairo-appbase.hs. Hooking up functions to widgets is very simple: get a widget by name (which you set in glade file), and hook one of its signals (which you found in the Signals tab in glade) to an IO () action:


  cut1 <- get G.castToMenuItem "cut1"
  G.onActivateLeaf cut1 $ myCut

The template code includes a trivial definition of myCut:


  myCut :: IO ()
  myCut = putStrLn "Cut"

A real application will want to pass data to the callback. In C, this is fairly tedious as you only have a single void * to pass to callbacks as "user_data", and applications typically do lots of marshalling and unmarshalling to pass data around. In Haskell however, you can make yourself a more complex callback handler and use a curried version of it in each instance:


  cut1 <- get G.castToMenuItem "cut1"
  G.onActivateLeaf cut1 $ myComplexCut project phase 7

  ...

  myCut :: Project -> MoonPhase -> LuckyNumber -> IO ()
  myCut project phase num = do
      let selection = currentSelection project
      when (phase == Full) howl
      when (num /= 7) fail
      doActualCut selection

Erik de Castro Lopo discussed how currying at length in his April 2006 post, GTK+ Callbacks in OCaml. The Haskell GTK+ bindings have been around a long time, but were only recently cabalized and uploaded to Hackage. I put together cairo-appbase in August 2006 when I was playing with it, but now that I have more time for Haskell I've updated it and uploaded it to Hackage. Enjoy, and hack away!

Speeding up cross-compiling with ccache and distcc on Debian

noreply@blogger.com (Anonymous) — Tue, 15 Jun 2010 00:00:00 +0000

The conventional way of doing embedded development is to cross-compile everything then copy it onto the target, but working natively allows you to use "normal" tools and workflows. We want to issue commands directly to a shell on the development board or phone prototype, and speed up the compilation step by distributing it to a faster machine such as your workstation. This isn't the usual way to do things, but I like working this way, and here's how to make it work faster.

This article explains how to configure a Debian PC host and a Debian target system so that development done on the target invokes the cross-compiler on the host. The advantage offered by this approach is a speed-up of compile times. Note that this does not speed up other aspects of building, such as source configuration (which can be slow for packages using GNU autotools), linking or installation.

We assume that a full Debian system is available for development on the target: packages can be built natively using gcc and a full toolchain (binutils, ld etc.), and tools such as automake, autoconf, libtool, version control systems etc. are available.

The setup we work with uses Debian on both the host PC and the target. The examples will use a debian-sh4 on the target, with the sh4-linux-gnu-gcc cross compiler installed on the build host. For other target architectures, simply replace all instances of sh4-linux-gnu- with the arch prefix, eg. arm-linux-gnueabi-.

In this article, commands executed natively on the target device will use the prompt target#, and commands executed on the x86 build host will use the prompt host#.

The first step is to ensure you can build software natively on the target. For GCC:


target$ gcc hello.c -o hello

and for autotools projects:


target$ ./configure
target$ make

ccache

Next, install ccache:


target# apt-get install ccache

ccache keeps a cache of compiled object files, such that the same compilation does not need to be repeated. This cache exists outside of your source tree, so it persists across invocations of 'make clean'. It compares the pre-processed source files, so that compilation of a source file will happen if it or any of its included headers is changed. The usual way to use ccache is to simply set your C compiler to be "ccache gcc".


target$ ccache gcc hello.c -o hello

and for autotools projects:


target$ CC="ccache gcc" ./configure
target$ make

Debian also sets things up so that if you put /usr/lib/ccache ahead of /usr/bin in your PATH, it will get used for native builds whenever gcc is invoked. That is useful to set up, but not necessary for this setup with distcc.

An aside about compiler naming

Before we move on to cross compiling, it's important to realize that the native compiler is also available with its full architecture prefix:


target$ ls -l /usr/bin/sh4-linux-gnu-gcc
lrwxrwxrwx 1 root root 7 Mar 17 01:45 /usr/bin/sh4-linux-gnu-gcc -> gcc-4.4

The binary called sh4-linux-gnu-gcc does the same thing on both the host and target: you can simply think of it as a program that takes in a C file and produces an sh4 binary:


                +-------------------+ 
    C source -> | sh4-linux-gnu-gcc | -> sh4 binary
                +-------------------+

The distinction between "native" and "cross-" compiling is then just a matter of what machine you are running this compiler program on. If you run sh4-linux-gnu-gcc on an x86 machine, you are cross-compiling, but if you run sh4-linux-gnu-gcc on an sh4 machine then you are just compiling. Of course the compiler binaries are different; the point is that a shell script which calls the compiler by its full name would work without modification on either machine.

distcc

distcc allows you to use a compiler running on a different, faster machine. This involves running a server (distccd) there, and it is far easier to set up than it would seem.

First, ensure that we can cross-compile on the build host:


host$ sh4-linux-gnu-gcc hello.c -o hello
host$ file hello
sh4-linux-gnu-gcc hello.c -o hello
host$ file hello
hello: ELF 32-bit LSB executable, Renesas SH, version 1 (SYSV), dynamically linked (uses shared libs), for GNU/Linux 2.6.18, not stripped

Next, we install distcc on the build host:


host# apt-get install distcc

To activate the server and tell it what clients to allow, edit /etc/default/distcc:


STARTDISTCC="true"
ALLOWEDNETS="127.0.0.1 10.0.0.0/16"

and restart it:


host# /etc/init.d/distcc restart

You can check that it is running:


host# netstat -pant | grep distcc
tcp        0      0 10.0.0.1:3632           0.0.0.0:*               LISTEN      16142/distccd

So that we can ensure that compilation is running on the host, watch this log file in a separate window:


host# tail -f /var/log/distccd.log

Then, on the client (ie. the target system) we also install distcc:


target# apt-get install distcc

We do not need to modify the distcc configuration on the target as it will not be running the server, so Debian's defaults are fine. However, we do need to set an environment variable to specify which machine[s] to compile on.


target$ export DISTCC_HOSTS='host'

You run distcc in a similar manner to ccache, by simply setting your C compiler. Note that we are only distributing compilation, not linking, so we just run the compilation step:


target$ distcc sh4-linux-gnu-gcc -c hello.c

This should turn up in the host's distcc logs:


host# tail -f /var/log/distccd.log
distccd[16390] (dcc_job_summary) client: 10.0.1.103:45983 COMPILE_OK exit:0 sig:0 core:0 ret:0 time:46ms sh4-linux-gnu-gcc hello.c

And back on the target, we have the hello.o file which was generated by the sh4-linux-gnu-gcc cross-compiler on the build host:


target$ ls -l *.o
total 16
-rw-r--r-- 1 conrad conrad  884 Jun 11 07:28 hello.o
target$ file hello.o
hello.o: ELF 32-bit LSB relocatable, Renesas SH, version 1 MathCoPro/FPU/MAU Required (SYSV), not stripped

The C file was transferred over the network to the host, where distccd invoked the cross-compiler and then sent the results back to the target. The end result is the same as if sh4-linux-gnu-gcc had been run directly on the target, but we avoided using the slower CPU of the target system.

To fully take advantage of distcc, you can run distccd on multiple build hosts, and specify all their names in the DISTCC_HOSTS environment variable on the target. Then use eg. "make -j 10" to run multiple compiles in parallel, which will each then get farmed out to different build hosts.

Combining ccache and distcc

~~You can quite simply put these two tools together, by calling:~~


target$ ccache distcc sh4-linux-gnu-gcc -c hello.c

You can quite simply put these two tools together, by setting CCACHE_PREFIX to "distcc" before calling ccache:


target$ export CCACHE_PREFIX="distcc"
target$ ccache sh4-linux-gnu-gcc -c hello.c

(Thanks to Joel Rosdahl for the correction).

The first time we run this the code is cross-compiled on the build host and sent back to the target, and ccache keeps track of that. The second time we run this, ccache notices that it already has a stored copy of the output hello.o, and decides to use that rather than calling the compiler. (From ccache's point of view, the compiler is "distcc sh4-linux-gnu-gcc").

For autotools project, you can simply do the following before calling ./configure:


target$ export CCACHE_PREFIX="distcc"
target$ export CC="ccache sh4-linux-gnu-gcc"

After which the ./configure step will write Makefiles which specify to compile with ccache, so the rest of your build (ie. make -j 10) just works as normal without any new settings or any other change to your workflow.

For more discussion of combining distcc with ccache, see the distcc(1) man page.

Summary

By combining both ccache and distcc we can:

avoid redundant compilations, and
distribute required compilations to a faster build host.

The result is faster build times, which speeds up your development cycle and allows you to work more efficiently on the target system itself.

Monday Music: Heyoo by Kobi

noreply@blogger.com (Anonymous) — Mon, 24 May 2010 00:04:00 +0000

Made with AUBE on Linux last November, this is Heyoo by Kobi:

AUBE/Metadecks Live is a music production tool designed for live use. A track like this is made by setting up a bunch of sample, rhythm and effects units, playing them for a while and recording the result.

This post uses the HTML5 <audio> tag. If the audio controls are not present then the problem may simply be that your browser does not support HTML5 <audio> with Ogg Vorbis (in which case upgrade to one that does). If you are reading this in a feed reader or via a planet aggregator, then the problem may be that the reader or aggregator strips the HTML5 <audio> tag -- in which case you might want to switch to a more modern reader, or upgrade your planet.

Monday Music: Deika by Kobi

noreply@blogger.com (Anonymous) — Mon, 17 May 2010 00:00:00 +0000

Made with AUBE on Linux a few years ago, this is Deika by Kobi:

Streaming Ogg Vorbis with sighttpd 1.1.0

noreply@blogger.com (Anonymous) — Tue, 11 May 2010 00:30:00 +0000

I just released Sighttpd version 1.1.0, which includes support for streaming Ogg Vorbis from standard input. In an earlier post introducing a new HTTP streaming server (sighttpd 1.0.0), I described how sighttpd could be used to stream raw data, such as plain text:

$ while `true`; do date; sleep 1; done | sighttpd

and H.264 elementary video streams but not Ogg, because an Ogg stream needs to have setup headers prepended for each codec stream. "Instead, we would need to do something like Icecast: buffering these headers and serving them first to each client that connects before continuing with live Ogg pages".

So, that's exactly what version 1.1.0 introduces with a new <OggStdin> module. The sighttpd.conf setup is similar to the normal <Stdin> configuration:

Listen 3000

# Streaming Ogg Vorbis from stdin, using the special
# OggStdin module that caches Ogg Vorbis headers
<OggStdin>
        Path "/stream.ogg"
        Type "audio/ogg"
</OggStdin>

You can run this with a shell pipeline like:

$ arecord -c 2 -r 44100 -f S16_LE -t wav | oggenc -o - - | sighttpd -f examples/sighttpd-oggstdin.conf

And you can connect to it as an Ogg stream, eg:

$ ogg123 http://localhost:3000/stream.ogg

At the start of an Ogg Vorbis file or stream are three mandatory header packets:

The Vorbis BOS (beginning of stream) header, which describes basic information like the number of channels and the samplerate of the audio.
Metadata in VorbisComment format, which basically consists of text values like "ARTIST=Richard Feynman".
The setup header, which includes "codec setup information as well as the complete VQ and Huffman codebooks needed for decode".

We can view the raw contents of these packets with oggz dump:

$ oggz dump Kobi-Birk_20011125.ogg |head -n 30
00:00:00.000: serialno 0639825516, granulepos 0, packetno 0 *** bos: 30 bytes
    0000: 0176 6f72 6269 7300 0000 0002 44ac 0000  .vorbis.....D...
    0010: 18fc ffff 00f4 0100 18fc ffff b801       ..............

00:00:00.000: serialno 0639825516, calc. gpos 0, packetno 1: 94 bytes
    0000: 0376 6f72 6269 7320 0000 0058 6970 686f  .vorbis ...Xipho
    0010: 7068 6f72 7573 206c 6962 566f 7262 6973  phorus libVorbis
    0020: 2049 2032 3030 3130 3831 3303 0000 000a   I 20010813.... 
    0030: 0000 0074 6974 6c65 0042 6972 6b0b 0000  ...title.Birk ..
    0040: 0061 7274 6973 7400 4b6f 6269 0d00 0000  .artist.Kobi ...
    0050: 6461 7465 0032 3030 3131 3132 3501       date.20011125.

00:00:00.000: serialno 0639825516, granulepos 0, packetno 2: 2.820 kB
    0000: 0576 6f72 6269 7325 4243 5601 0040 0000  .vorbis%BCV..@..
    0010: 8020 9a19 a7b1 945a 6bad 1d72 9a42 abb5  . .....Zk..r.B..
    0020: d65a 6bad 2594 5a5b adb5 d65a 6bad b5d6  .Zk.%.Z[...Zk...
    0030: 5a6b adb5 d65a 6b8d 81d0 9055 0000 1000  Zk...Zk....U....
    0040: 0021 0c55 0651 c99c d65a 6b44 1064 0649  .! U.Q...ZkD.d.I
    0050: e920 d65a 6be8 a0a5 105a 4cad d65a 6bad  . .Zk....ZL..Zk.
    0060: b5d6 5a6b adb5 d61a 6320 3464 1500 0004  ..Zk....c 4d....
    0070: 00c0 1863 8c31 0619 6410 5248 21a5 9452  ...c.1..d.RH!..R
    0080: 8c31 e618 74d2 5147 9d76 da71 6821 9594  .1..t.QG.v.qh!..
    0090: 5acc 2de7 9c73 ceb9 d61a 080d 5905 0024  Z.-..s..... Y..$
    00a0: 0000 a838 8664 5886 0584 86ac 0200 3200  ...8.dX.......2.
    00b0: 0004 1024 4353 34c7 d554 cf34 5d55 0542  ...$CS4..T.4]U.B
    00c0: 4356 0100 4000 0002 8000 0a18 4451 1445  CV..@..... .DQ.E
    00d0: 5114 4551 1445 d1f3 3ccf f33c cff3 3ccf  Q.EQ.E..<..<..<.
    00e0: f33c cff3 3ccf f33c cf03 4243 5601 0009  .<..<..<..BCV.. 
    00f0: 0000 1a8a a228 8ee2 00a1 21ab 0080 0c00  .....(....!... .
    0100: 0001 0cc7 9014 49d1 244d d22c cff2 80d0  .. ...I.$M.,....

When a client connects to a stream somewhere in the middle of a song, these headers from the beginning are required in order to decode the audio data. sighttpd writes the pages containing the 3 header packets to a temporary file (created with mkstemp(3)). When a new client connects, the contents of that file are sent to it with sendfile(2) before jumping into the current contents of the stream.

I'm not trying to make a replacement for Icecast, but instead building a more general streaming server -- and of course I want it to have good Ogg support! So, please try it out, and leave some feedback in the comments or in email to me or ogg-dev :)

Monday Music: Birk by Kobi

noreply@blogger.com (Anonymous) — Mon, 10 May 2010 00:00:00 +0000

Made with AUBE on Linux a few years ago, this is Birk by Kobi:

The rhythms are made with a simple drum machine, which is basically a matrix of triggers tied up to sample players. These are fed through a cascade of delays to get the rolling effect -- I love feeding a short delay to provide echo into a longer delay which matches the beat, so that the individual sounds combine with each other to make a more complex rhythm.

The rhythm is sent through a resonant low-pass filter; as the track starts off, the cutoff of that filter is raised to give the effect of opening up the whole track. It's a pretty simple technique, used in tracks like Fatboy Slim's Right Here, Right Now.

The filtered version is called the "wet" part of the mix, and the unfiltered version is the "dry" part. Changing the amount of these is useful: the dry part provides definition (the attacks of each drum are clearly audible), and the wet part has a more interesting texture. In a sequencer you might program the "wetness" of the effect; I like to work with it more directly by feeding the two versions into a cross-fader and switching between them live. If you are quick enough with the controls then your other arm is free for doing handstands :)

A monoid for server parties

noreply@blogger.com (Anonymous) — Sat, 08 May 2010 07:00:00 +0000

Happstack is a Haskell web applications framework. I hadn't played with it in a while but Happstack 0.5.0 was recently released so I decided to try it out. You can get it with cabal:

$ cabal update
$ cabal install happstack

Happstack has a pretty detailed tutorial, which is actually a self-hosted happstack site that you can cabal install and dig around in. It takes a while though, so let's just get into it. The tutorial doesn't actually start showing any code until section 7, first shot at happstack. This shows you how to run a Hello World server from Haskell's REPL ghci:

$ ghci
Prelude> import Happstack.Server
Prelude Happstack.Server> simpleHTTP (Conf 8080 Nothing) (return "Hello World!")

Then your http://localhost:8080/ should show a Hello World message, ie. you can run this in another terminal:

$  curl -i http://localhost:8080/
HTTP/1.1 200 OK
Connection: Keep-Alive
Content-Length: 12
Content-Type: text/plain; charset=UTF-8
Date: Tue, 04 May 2010 01:02:31 GMT
Server: Happstack/0.5.0

Hello World!

A REPL is great for playing around, but some real code to read for an example server is ControllerBasic.hs.

At the top of that file we get hit with this:

mzero corresponds to a 404 and mzero `mappend` f = f, while if f is not mzero then f `mappend` g = f.

That's not even code, it's a comment. Like, omg why would anyone talk like that? lol

It's talking about a type called ServerPartT, which you can think of as an abstract part of your web server, like the part that handles "everything under /articles" or "all the images". If you connect a bunch of these together you get your whole web server. Anyway, it turns out that it's much more fun if you simply pronounce ServerPartT as "Server Party":

So what's all this about monoids? Mathematically speaking, a monoid is a simple party game that some data objects can play when they get together. This is a mathematical definition in the sense that mathematicians are fun at parties.

The rules of the game are just that you have some way of appending things together; the tricky Haskell name for this is mappend, named after the famous French mathematician M. Append. Whenever you mappend two things together you get another thing of the same type that can also be mappended. There's also an empty element called mempty, or here called mzero(*).

So a monoid is just a way of saying how you connect things up. In terms of ServerPartTs:

mzero corresponds to a 404: The empty part of your server is 404 Not Found; ie. if your server contained no application parts at all, it would just have to return 404 for any request. In general if a ServerPartT can't handle the current request (eg. the ServerPartT for images doesn't handle /articles then it'll act like mzero for that request).
mzero `mappend` f = f: mappend is the way that you connect up two server parts. Basically you just try server parts one after another: when a request comes along, if the first ServerPartT can't handle it, ie. acts like mzero, then try the next ServerPartT (and hey let's call it f).
if f is not mzero then f `mappend` g = f: On the other hand, if the first server part can handle the request, ie. it does not return 404 and is not mzero, then use it and ignore all the other ServerPartTs (call them g). The whole server is acting just like f by itself!

The point is that because ServerPartT follows all the rules of the monoid party game, you can suddenly use all the functions available in Data.Monoid, like mconcat which takes a whole list of objects and works out what would happen if they were all mappended together. This allows you to simply make a list of ServerPartTs and use the first one that doesn't return 404: you don't even need to write a function for evaluating your whole server, you can just use the plain old boring mconcat from the base libraries!

The structure of monoids (stuff that can be appended) is pretty trivial, but very common. I highly recommend sigfpe's Haskell Monoids and their Uses to learn about some other more general uses.

As for Happstack: it's obviously a bit deeper than your average web framework. In this article I've only looked at the basic idea behind making a server; it has many more features for managing data, transactions and scaling. So what do you think? Is the monoidal mumbo-jumbo useful or does it just add a layer of confusion? Would servers really wear party hats to a ServerPartT?

(*) because `ServerPartT` is the awesome kind of monoid formed by the `MonadPlus` type class, obviously.

How oggz-validate works

noreply@blogger.com (Anonymous) — Wed, 05 May 2010 00:00:00 +0000

oggz-validate is a tool for checking the conformance of Ogg files against the Ogg logical bitstream framing specification and RFC3533. It is used by validator.xiph.org an online conformance-checking service.

oggz-validate builds on the correctness checks imposed by liboggz when writing Ogg packets. Whereas the low-level libogg simply allows an application to construct arbitrary Ogg packets and push them into a stream, liboggz checks each packet against the basic constraints:

Packet belongs to unknown serialno
Granulepos decreasing within track
Multiple bos pages
Multiple eos pages

oggz_write() fails if any of these constraints are violated.

oggz-validate works by reading the input file and attempting to reproduce its sequence of packets. It creates both a reader and a writer and feeds the output of the reader into the writer; any errors in stream creation are reported as validation errors.

Additionally, oggz_write() checks the following higher-level constraints:

File contains no Ogg packets
Packets out of order
eos marked but no bos
Missing eos pages
eos marked on page with no completed packets
Granulepos on page with no completed packets
Theora video bos page after audio bos page
Terminal header page has non-zero granulepos
Terminal header page contains non-header packet
Terminal header page contains non-header segment

For example, the check for "packets out of order" uses liboggz's parsing of codec granulepos to interpret timestamps of many free codecs including Ogg Dirac, FLAC, Speex, Theora and Vorbis. Also, there is a simple constraint in the specification for Ogg Theora that the BOS (Beginning Of Stream) header packet for Theora must come before that for Vorbis (or another audio codec).

What oggz-validate does not do is check that the contents of the codec streams are valid for that codec. Such checking is left up to codec-specific tools such as vorbose, and flac --test.

Towards adaptive streaming for Ogg

noreply@blogger.com (Anonymous) — Tue, 04 May 2010 00:00:00 +0000

Video streaming must be reliable and glitch-free. It must be possible for video hosting sites to allow clients to adapt to the available bandwidth, and for clients to be able to take advantage of this.

Adaptive streaming refers to a system which allows a video streaming client to request different versions of a stream according to the bandwidth it has available, and to change this selection on the fly, during the course of streaming. Such a system of course requires the streaming server to have various versions of a stream available, each in different bitrates. In order to allow the client to switch streams on the fly the content must be produced in such a way that corresponding video frames in the different representations can be easily accessed and decoded.

The first stage in building an adaptive streaming system is making it work for static content, ie. files on disk. The second stage is making it work for live content, ie. streams coming from a video production system consisting of cameras, mixing desks and random people in black tshirts. The first is mainly a technical problem; the second requires developing both technology and production processes.

Microsoft have a proprietary technology for adaptive bitrate streaming called Smooth Streaming, and an extension for Live Smooth Streaming. Apple are following a more open path, pursuing standardization of their specifications through the IETF, in the current form of the HTTP Live Streaming Internet-Draft. This extends the m3u playlist format with durations, sequence numbering, caching and stream information hints.

Ogg does not yet have an adaptive streaming specification; this should be developed in a way that is compatible with open specifications, and also taking into account the various quirks of Ogg. For example, the client must have access to codec setup headers for each bitrate representation, and the system must accomodate chained Ogg resources (as commonly used for streaming Ogg). In the W3C Media Fragments working group we are developing specifications for addressing fragments of media resources. The ongoing development of Ogg Skeleton allows Ogg to take advantage of these, allowing faster seeking through OggIndex and gapless playback through hints on presentation time.

Encouraging use of these features requires tool support and demonstrations of novel applications for video mash-ups. Video on the web should be a means of creative expression, allowing new applications that mash up parts of many videos and present the result seamlessly to the user. This goal makes Ogg fun, and brings us beyond thinking about video on the Web as just a different way of watching pre-packaged TV-style content.

Monday Music: Wannago by Kobi

noreply@blogger.com (Anonymous) — Mon, 03 May 2010 00:00:00 +0000

A change of pace ... here's a little Ogg Vorbis track to test out the <audio> tag, seeing as your browser probably supports that now.

Made with AUBE on Linux, a few years ago, this is Wannago by Kobi:

AUBE/Metadecks Live is a music production tool designed for live use. A track like the above is made by setting up a bunch of sample, rhythm and effects units, playing them for a while and recording the result.

Stable release maintenance with git (liboggz 1.0.2 and 1.1.1)

noreply@blogger.com (Anonymous) — Fri, 30 Apr 2010 00:00:00 +0000

I recently released two new versions of liboggz, liboggz-1.0.2 and liboggz-1.1.1. These are unremarkable maintenance releases, fixing some bugs but adding no new functionality.

Last year I released liboggz-1.1.0, which introduced a new oggz_packet type. This changed some of the public API while remaining binary compatible. As this was a fairly insidious change, I decided to also keep maintaining the previous 1.0.x version so that any distributions shipping that could easily upgrade without risking breakage. I do general maintenance work and bugfixes on the 1.0.x version as much as possible, and then adapt those to 1.1.x. Luckily this is quite straightforward to keep track of in git.

After committing a change to the 1.0-stable branch I merge that into master:

$ git commit # on 1.0-stable
[1.0-stable ccd2a2f] Fix regression introduced in 8c2da1
 1 files changed, 19 insertions(+), 7 deletions(-)
$ git checkout master
Switched to branch 'master'
$ git merge 1.0-stable
Merge made by recursive.
 src/liboggz/oggz_read.c |   26 +++++++++++++++++++-------
 1 files changed, 19 insertions(+), 7 deletions(-)

As these were just maintenance releases, the commit graph produced by git lol is quite well woven:

Lightweight branching makes it easy to keep track of these changes so that simple maintenance work is isolate from other development. The upshot is that these branches are ready for release at any time; if a critical fix comes along that requires a new release, then no backporting or cherry-picking needs to be done to get the code into shape: there is always a branch in releasable state.

Of course on top of that I also have topic branches for new features under development, and I periodically merge master into those. When the new features are ready for release they can simply be merged back into the master branch and shipped, without ever getting in the way of general maintenance work.

git lola

noreply@blogger.com (Anonymous) — Thu, 29 Apr 2010 00:00:00 +0000

The best tip I learned at Scott Chacon's talk at linux.conf.au 2010, Git Wrangling - Advanced Tips and Tricks was this alias:

        lol = log --graph --decorate --pretty=oneline --abbrev-commit

This provides a really nice graph of your tree, showing the branch structure of merges etc. Of course there are really nice GUI tools for showing such graphs, but the advantage of git lol is that it works on a console or over ssh, so it is useful for remote development, or native development on an embedded board.

It is even nicer when you turn syntax coloring on in git, which also has the advantage of colorizing diff output to warn about bad whitespace.

To get an idea of a whole project structure, I found myself often running git lol --all, where the --all option says to show all branches. I used that often enough that I made a new alias, git lola:

        lola = log --graph --decorate --pretty=oneline --abbrev-commit --all

which has the added bonus of making me hum Lola every single day.

So, just copy the following into ~/.gitconfig for your full color git lola action:

[alias]
        lol = log --graph --decorate --pretty=oneline --abbrev-commit
        lola = log --graph --decorate --pretty=oneline --abbrev-commit --all
[color]
        branch = auto
        diff = auto
        interactive = auto
        status = auto

HTTP H.264 from multiple cameras with sighttpd's shrecord

noreply@blogger.com (Anonymous) — Wed, 28 Apr 2010 00:00:00 +0000

Today we'll look at how to use sighttpd for multi-camera H.264 video encoding and streaming.

This post is the last in a series about using hardware video encoding and image conversion features of Renesas SH-Mobile on Linux. In earlier posts, we described the way we do resource management in userspace (libuiomux), use the hardware image manipulation features for colorspace conversion and rescaling (libshveu); hardware encoding with libshcodecs; and simple HTTP streaming from standard input with sighttpd:

Today's post ties all these together, showing how to use sighttpd's support for integrated capture, video encoding and streaming. We'll also look at the performance of the server under some light load, rather than the performance of raw encoding to /dev/null that was done in the earlier libshcodecs article.

(Apologies to people reading this from Planet Haskell, I'll have to whip up something with Happstack and Hogg to make up for the disruption ;-)

Configuration

The sighttpd.conf setup is fairly straightforward; we put the options for each stream that we want to serve into an <SHRecord> block, including the desired URL path and the location of the control file to use. The same control file that are used for shcodecs-record can be used (the output filename is ignored by sighttpd).

Listen 3000

<SHRecord>
        Path "/video0/vga.264"
        CtlFile "/usr/share/shcodecs-record/k264-v4l2-vga-stream.ctl"
        Preview off
</SHRecord>

<SHRecord>
        Path "/video0/cif.264"
        CtlFile "/usr/share/shcodecs-record/k264-v4l2-cif-stream.ctl"
        Preview off
</SHRecord>

<SHRecord>
        Path "/video1/vga.264"
        CtlFile "/usr/share/shcodecs-record/k264-v4l2-vga-stream2.ctl"
        Preview off
</SHRecord>

<SHRecord>
        Path "/video1/cif.264"
        CtlFile "/usr/share/shcodecs-record/k264-v4l2-cif-stream2.ctl"
        Preview off
</SHRecord>

I turn the on-screen Preview off because the Ecovec board I'm using has no LCD panel and is instead plugged directly into an HDMI display, which introduces a lot of bus contention. Disabling the on-screen preview improves performance markedly.

This configuration on the host ecovec will make four H.264 streams appear at: http://ecovec:3000/video0/vga.264, http://ecovec:3000/video0/cif.264, http://ecovec:3000/video1/vga.264, and http://ecovec:3000/video1/cif.264. These streams are derived from two camera sources, which here happen to be /dev/video0 and /dev/video2 (sic) as specified in the control files.

Performance

Before any clients connect, sighttpd is continuously running the cameras, colorspace conversion, rescaling and encoding all 4 streams. The CPU usage is similar to that of shcodecs-record encoding 4 streams, ie. a little under 2% of this 500MHz SH7724 CPU:

top - 06:47:47 up  3:35,  2 users,  load average: 0.17, 0.13, 0.24
Tasks:  50 total,   1 running,  49 sleeping,   0 stopped,   0 zombie
Cpu(s):  1.9%us,  0.6%sy,  0.0%ni, 95.8%id,  1.6%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:    248332k total,   211220k used,    37112k free,        0k buffers
Swap:        0k total,        0k used,        0k free,   143752k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
27787 root      20   0 93284 9.8m 1396 S  1.6  4.0   0:01.72 sighttpd
27821 root      20   0  2976 1204  988 R  1.0  0.5   0:00.19 top
    1 root      20   0  2372  708  620 S  0.0  0.3   0:01.46 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthreadd

I hacked up the following quick script on a locally connected Linux PC to create 400 stream connections (100 to each of the 4 video streams) and fire them off one per second. The -m option to curl provides a maximum timeout for each connection, which we use here to fetch 20s of video during each connection. (If you know a similar option for httperf to tell it to receive only a specified duration of a continuous HTTP stream with each connection, please leave a note in the comments!)

#!/bin/sh
for i in `seq 1 100`; do
        curl http://ecovec:3000/video0/vga.264 -o /dev/null -s -m 20 \
        -w "$i vga0: HTTP %{http_code} , %{time_total}s %{size_download} bytes\n" >> benchmark.log &
        sleep 1
        curl http://ecovec:3000/video1/vga.264 -o /dev/null -s -m 20 \
        -w "$i vga1: HTTP %{http_code} , %{time_total}s %{size_download} bytes\n" >> benchmark.log &
        sleep 1
        curl http://ecovec:3000/video0/cif.264 -o /dev/null -s -m 20 \
        -w "$i cif0: HTTP %{http_code} , %{time_total}s %{size_download} bytes\n" >> benchmark.log &
        sleep 1
        curl http://ecovec:3000/video1/cif.264 -o /dev/null -s -m 20 \
        -w "$i cif1: HTTP %{http_code} , %{time_total}s %{size_download} bytes\n" >> benchmark.log &
        sleep 1
done

The middle section of the benchmark.log file produced (while there are 20 parallel connections) looks like this:

52 vga1: HTTP 200 , 20.001s 475165 bytes
52 cif0: HTTP 200 , 20.004s 211838 bytes
52 cif1: HTTP 200 , 20.608s 353310 bytes
53 vga0: HTTP 200 , 20.024s 963123 bytes
53 vga1: HTTP 200 , 20.015s 568863 bytes
53 cif0: HTTP 200 , 20.032s 1172898 bytes
53 cif1: HTTP 200 , 20.012s 1004619 bytes
54 vga0: HTTP 200 , 20.039s 1269070 bytes
54 vga1: HTTP 200 , 20.068s 951508 bytes
54 cif0: HTTP 200 , 20.059s 1088203 bytes

and while that is running, top looks like this:

top - 08:30:54 up  5:18,  2 users,  load average: 0.30, 1.28, 0.79
Tasks:  49 total,   1 running,  48 sleeping,   0 stopped,   0 zombie
Cpu(s): 12.0%us,  1.2%sy,  0.0%ni, 81.0%id,  3.6%wa,  1.2%hi,  0.9%si,  0.0%st
Mem:    248332k total,   210472k used,    37860k free,        0k buffers
Swap:        0k total,        0k used,        0k free,   144124k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
29162 root      20   0  130m 9.8m 1348 S 12.7  4.0   0:01.42 sighttpd
29169 conrad    20   0  2976 1204  988 R  1.3  0.5   0:00.15 top
    1 root      20   0  2372  708  620 S  0.0  0.3   0:01.46 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthreadd

I'm not claiming that it can handle thousands of connections, but at least we can be sure that an embedded camera system based on this will reliably provide all the streams that you have asked it to capture and encode without dropouts. The usual use-case for this is as an input to an HTTP stream repeater on a larger server with a faster upstream connection, designed to handle a much higher load.

The bigger picture

Stepping back, the point of this series of articles has been to demonstrate that it is very easy use hardware acceleration with Linux: we can export complex driver functionality to userspace, we can quickly develop layered applications, and we can do this while still leaving enough CPU around for other (perhaps unrelated) tasks.

A new HTTP streaming server (sighttpd 1.0.0)

noreply@blogger.com (Anonymous) — Tue, 27 Apr 2010 00:00:00 +0000

I just released sighttpd version 1.0.0. Sighttpd is an HTTP streaming server designed for distributing realtime input. It is particularly useful for making camera streams available to multiple clients, and has been designed for embedded systems use.

In today's post I'll explain how sighttpd can be used to make its standard input available over HTTP. This is useful for prototyping an HTTP streaming system.

The syntax of sighttpd.conf(5) is vaguely reminiscent of Apache's configuration. We'll set up a <stdin> module to serve sighttpd's standard input at a URL that we choose.

The <stdin> configuration block takes two parameters, Path and Type. Path specifies the local part of the URL path at which you would like the content to appear. For example, when configuring the server http://example.com/, the configuration directive:

        Path /my/video.264

in a <stdin> block will make the content appear at http://example.com/my/video.264.

The Type parameter specifies the Internet media type (ie. MIME type) of the stream, which will appear in the Content-Type HTTP response header, which will in turn give a hint to your browser about what to do with the stream. For example, the configuration:

        Type video/mp4

will instruct sighttpd to serve this stream with Content-Type: video/mp4.

Streaming text

To begin with, let's set up a server streaming plain text, eg. a timestamp every second. We'll set up sighttpd so the content appears at the path /date.txt (eg. http://localhost/date.txt):

Listen 3000

<stdin>
        Path "/date.txt"
        Type "text/plain"
</stdin>

Then, run:

    $ while `true`; do date; sleep 1; done | sighttpd

and connect from another terminal:

$ curl -i http://localhost:3000/date.txt
HTTP/1.1 200 OK
Date: Wed, 07 Apr 2010 04:23:09 GMT
Server: Sighttpd/0.9.0
Content-Type: text/plain

Wed Apr  7 13:23:09 JST 2010
Wed Apr  7 13:23:10 JST 2010
Wed Apr  7 13:23:11 JST 2010
...

Streaming H.264 video with shcodecs-record

In yesterday's post about multi-camera, multi-resolution hardware encoding I introduced some ways of using shcodecs-record> to encode H.264 video on SH-Mobile. By specifying "-" as the output file for encoding, we can tell shcodecs-record to dump its encoded stream to standard output. Let's use this to set up a simple video stream with sighttpd:

Listen 3000

<stdin>
        Path "/stream.264"
        Type "video/mp4"
</stdin>

Then run your streaming video input, eg:

        $ shcodecs-record k264-v4l2-stream.ctl | sighttpd

and connect with a video player:

        $ mplayer http://localhost:3000/stream.264 -fps 30

What about Ogg?

Unfortunately we can't use this method of streaming raw data from standard input for Ogg because an Ogg stream needs to have setup headers prepended for each codec stream. Instead, we would need to do something like Icecast: buffering these headers and serving them first to each client that connects before continuing with live Ogg pages. Similarly, the RTP profile for Ogg Theora sends the codec setup headers out-of-band (eg. a separate HTTP resource), the location of which is advertised via SDP.

How would you stream your favourite format?

Streaming from standard input is a really simple way of testing out an HTTP stream. What's your favourite commandline for generating audio streams for internet radio? What other kind of data would you find useful to stream in realtime using sighttpd? Answer in the comments, please!

Sighttpd can also handle multiple streams, and in the next post we'll look at some multi-stream and multi-camera configurations for H.264 capture and encoding using the SHRecord module.

Multi-camera, multi-resolution hardware encoding (libshcodecs 1.1.0)

noreply@blogger.com (Anonymous) — Fri, 23 Apr 2010 09:06:00 +0000

I just released libshcodecs 1.1.0, a user-space library for controlling Renesas SH-Mobile hardware codecs. These tools now use libuiomux and libshveu for device access, memory management, colorspace coversion and rescaling.

The big feature is that it can now do simultaneous encode and decode of multiple streams. Coolest is that the shcodecs-record tool can handle multiple V4L2 camera interfaces, and can encode multiple streams of different resolutions from each camera source. And it can do this without breaking a sweat:

# time shcodecs-record -P k264-v4l2-vga.ctl k264-v4l2-vga-cam2-null.ctl k264-v4l2-qvga-null.ctl k264-v4l2-qvga-cam2-null.ctl
[0] Input file: /dev/video0
[0] Output file: /dev/null
[1] Input file: /dev/video2
[1] Output file: /dev/null
[2] Input file: /dev/video0
[2] Output file: /dev/null
[3] Input file: /dev/video2
[3] Output file: /dev/null
Camera 0 resolution:  640x480
Camera 1 resolution:  640x480
[0] Encode resolution:  640x480
[1] Encode resolution:  640x480
[2] Encode resolution:  320x240
[3] Encode resolution:  320x240
Target framerate:   30.0 fps
  Encoding @ 29.48 fps  (avg 30.04 fps)
Elapsed time (capture): 33.3 s
Captured 1000 frames (30.00 fps)

Elapsed time (capture): 33.3 s
Captured 1000 frames (30.00 fps)
[0] Elapsed time (encode): 33.3 s
[0] Encoded 1000 frames (30.04 fps)
[1] Elapsed time (encode): 33.3 s
[1] Encoded 1000 frames (30.04 fps)
[2] Elapsed time (encode): 33.3 s
[2] Encoded 1000 frames (30.04 fps)
[3] Elapsed time (encode): 33.3 s
[3] Encoded 1000 frames (30.04 fps)

real    0m34.137s
user    0m1.016s
sys     0m0.748s

That's 4 simultaneous H.264 encodes using 2 VGA camera sources, encoding each into both VGA and QVGA in realtime at 30fps, and using < 2% of this 500MHz SH7724 CPU:

top - 07:45:01 up 17 min,  2 users,  load average: 0.02, 0.04, 0.05
Tasks:  51 total,   2 running,  49 sleeping,   0 stopped,   0 zombie
Cpu(s):  1.9%us,  0.6%sy,  0.0%ni, 95.2%id,  1.6%wa,  0.3%hi,  0.3%si,  0.0%st
Mem:    248332k total,    78380k used,   169952k free,        0k buffers
Swap:        0k total,        0k used,        0k free,    24672k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 1526 root      20   0 68412 1676 1256 R  1.9  0.7   0:00.89 shcodecs-record
 1482 root      20   0  2976 1188  980 R  0.3  0.5   0:07.67 top
    1 root      20   0  2372  708  620 S  0.0  0.3   0:01.63 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthreadd
    3 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ksoftirqd/0

Similarly, a single 720p encode:

# time shcodecs-record -P k264-v4l2-720p.ctl 
[0] Input file: /dev/video0
[0] Output file: /dev/null
Camera 0 resolution:  1280x720
[0] Encode resolution:  1280x720
Target framerate:   30.0 fps
  Encoding @ 31.11 fps  (avg 29.97 fps)
Elapsed time (capture): 33.4 s
Captured 1000 frames (29.97 fps)
[0] Elapsed time (encode): 33.4 s
[0] Encoded 1000 frames (29.97 fps)

real    0m33.887s
user    0m0.684s
sys     0m0.492s

using < 2% CPU:

top - 07:53:51 up 26 min,  2 users,  load average: 0.02, 0.03, 0.03
Tasks:  50 total,   1 running,  49 sleeping,   0 stopped,   0 zombie
Cpu(s):  1.3%us,  0.6%sy,  0.0%ni, 96.1%id,  1.9%wa,  0.0%hi,  0.0%si,  0.0%st
Mem:    248332k total,    78168k used,   170164k free,        0k buffers
Swap:        0k total,        0k used,        0k free,    24736k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 1618 root      20   0 49152 1532 1256 S  1.3  0.6   0:00.93 shcodecs-record
 1482 root      20   0  2976 1188  980 R  0.6  0.5   0:11.15 top
    1 root      20   0  2372  708  620 S  0.0  0.3   0:01.63 init
    2 root      20   0     0    0    0 S  0.0  0.0   0:00.00 kthreadd
    3 root      20   0     0    0    0 S  0.0  0.0   0:00.00 ksoftirqd/0

Of course the reason the CPU is doing so little work is that it is just acknowledging interrupts and setting up the rescale and encode hardware to do the actual work. This shows the kind of results that can be achieved when hardware manufacturers include ASIC support for video encoding ;-)

This version of shcodecs-record uses a new shcodecs_encoder_run_multiple() function which runs multiple encoder instances in a consistent order, interleaving the encoding of individual frames. This allows the encoded output to be used in a realtime streaming environment.

libshcodecs-1.1.0 also includes support for running encoders and decoders in parallel threads, a feature developed by Phil Edworthy of Renesas Electronics Europe. We'll be using this in some GStreamer plugins under development (gst-sh-mobile), to make it even simpler to make use of this hardware video acceleration in applications.

Driving the VEU from userspace (libuiomux 1.1.0 and libshveu 1.2.0)

noreply@blogger.com (Anonymous) — Thu, 22 Apr 2010 21:53:00 +0000

I just released libuiomux 1.1.0 and libshveu 1.2.0. These were the subject of my talk at linux.conf.au 2010, Sharing Userspace IO Devices for fast access to multimedia hardware. The target processor is Renesas SH-Mobile, a system-on-chip with hardware H.264 compression and image manipulation functions.

The onboard VEU can do scaling and cropping, mirroring, 90 degree rotations, some filtering, and colorspace conversions between YCbCr and RGB. Colorspace conversion in particular is a repetitive calculation for every pixel value, that is required in all video applications; hence it gets its own dedicated hardware block. As an added bonus we can also rescale and crop using that block. I implemented a simple Ken Burns style demo of panning and zooming across a live video stream to demonstrate this during the linux.conf.au talk.

libshveu is a userspace device driver that controls the VEU, allowing applications to resize, crop, rotate and colorspace convert images using a plain C library interface:

  shveu_operation(veu,
                  y_physical_addr, c_physical_addr,
                  src_w, src_h, src_stride, SHVEU_YCbCr420,
                  dest_rgb_addr, 0UL,
                  dst_w, dst_h, dest_stride, SHVEU_RGB565,
                  SHVEU_NO_ROT);

One limitation of the VEU is that it bypasses the memory management unit (MMU). An MMU provides virtual to physical address translation and cache control, so the VEU does not have access to these. Hence the need to program the VEU from userspace using physical addresses in the example above. Also, the memory allocated for use by the UIO device is marked as non-cacheable, to ensure that both the application running on the CPU and the VEU are working on the same data.

Hardware blocks like the VEU are exposed to Linux userspace using Magnus Damm's uio_pdrv_genirq, a platform driver for Userspace IO (UIO) with generic IRQ handling code. These allocate some physically contiguous memory using the coherent memory allocator, and expose this and the device's register window to userspace.

UIO itself is very lightweight, and provides no mechanisms for co-ordinating access to the device, or management of resources associated with it. In Linux we want these hardware functions to be available to many processes, which may or may not know about each other. This motivated UIOMux, a resource management layer for UIO devices. It provides fine-grained locking, memory management and interrupt dispatch.

As an example, imagine capturing an image from a camera and displaying it on screen, and we need to convert the captured YCbCr image to RGB for the framebuffer. We capture using V4L2 (the standard Linux kernel interface for video capture). As we need to capture directly into the physically contiguous memory required by the VEU we use the recently-introduced V4L2_USERPTR method, which allows an application to specify the capture buffers for V4L2. Similarly, the Linux framebuffer's FBIOGET_FSCREENINFO ioctl() allows us to retrieve the physical address for display, so that we can tell the VEU to write its converted output directly into screen memory.

The steps are then:

Allocate a physically-contiguous buffer with uiomux_malloc().
Set up V4L2 to capture into it using V4L2_USERPTR.
Convert to a physical address with uiomux_virt_to_phys(); this is the source address for our colorspace conversion.

Get the physical address of the destination framebuffer with FBIOGET_FSCREENINFO.
Run the shveu_operation() to perform the colorspace conversion.
Set the Linux framebuffer to the destination buffer's virtual address.

The captured image is then displayed on the framebuffer in RGB, and no memcpy()s are involved. Run this in a loop to watch the video capture, and set up multiple capture and display buffers to flip between for smooth performance. The CPU is basically just co-ordinating the operations of the V4L2, VEU and framebuffer devices by responding to interrupts and setting register values; the result is that nearly zero CPU is used, the application processor is free to handle more important tasks.

Trivial git routines in Haskell (ght 0.2.0)

noreply@blogger.com (Anonymous) — Thu, 22 Apr 2010 00:41:00 +0000

Yesterday I uploaded ght to Hackage. Its just a bunch of trivial routines for inspecting git repositories. It is in no way useful beyond that.

It uses UI.Command to handle various subcommands and generate documentation:

conrad@slippy:~$ ght
Trivial git inspection tools
Usage: ght [--version] [--help] command [args]

  This is a bunch of trivial routines for inspecting git repositories. It is in no way useful beyond that.

Reporting:
  show-prefix   Show path from top-level directory of repo
  show-root     Show path to top-level directory of repo

Blob management:
  show          Show an object
  log           Show commit logs
  show-raw      Show the raw dump of an object
  show-pack     Show the raw dump of a pack
  hash-object   Compute object ID from a file
  branch        show branches

Miscellaneous:
  help          Display help for a specific cmdcommand
  man           Generate Unix man page for specific cmdcommand

Please report bugs to <conrad@metadecks.org>

I wrote ght a few months back, and subsequently found gat. Similarly, the motivation for writing ght was mainly to understand git better. Often the best way to understand a new system or language is to implement it in Haskell ;-)

UI.Command

noreply@blogger.com (Anonymous) — Tue, 20 Apr 2010 22:49:00 +0000

I just uploaded the first cut of UI.Command, a Haskell framework for "friendly commandline programs". I hacked this together last year by pulling the command handling and documentatation generation bits out of hogg. The result is a fairly simple way of adding self-documentation, help text and man page generation to a commandline tool; it's especially useful for developing little tools that go along with libraries.

It works a lot like various web frameworks, but for building commandline apps. To use it, you first declare the various bits of metadata about your application:

hello :: Application () ()
hello = def {
                appName = "hello",
                appVersion = "0.1",
                appAuthors = ["Joe R. Hacker"],
                appBugEmail = "bugs@example.com",
                appShortDesc = "UI.Command example program",
                appLongDesc = longDesc,
                appCategories = ["Greetings", "Cat Math"],
                appSeeAlso = ["tractorgen"],
                appProject = "Haskell",
                appCmds = [world, times]
        }

longDesc = "a demonstration program for the UI.Command framework."

For each of the commands you want to support, you then declare a Command like this:

world :: Command ()
world = defCmd {
                cmdName = "world",
                cmdHandler = worldHandler,
                cmdCategory = "Greetings",
                cmdShortDesc = "An implementation of the standard software greeting."
        }

worldHandler = liftIO $ putStrLn "Hello world!"

and finally, use UI.Command's appMain:

main :: IO ()
main = appMain hello

The result is a commandline program:

$ hello world
Hello world!

that can print out its own help text:

$ hello 
UI.Command example program
Usage: hello [--version] [--help] command [args]

  a demonstration program for the UI.Command framework.

Greetings:
  world         An implementation of the standard software greeting.

Cat Math:
  times         A repetition of salutation

Miscellaneous:
  help          Display help for a specific cmdcommand
  man           Generate Unix man page for specific cmdcommand

Please report bugs to <bugs@example.com>

$ hello help world
hello world: 
Usage: hello world [options]

  An implementation of the standard software greeting.

and also generate its own man pages:

$ hello man world
.TH HELLO 1 "April 2010" "hello" "Haskell" 

.SH SYNOPSIS

.B hello
.RI world
[
.I OPTIONS
]


.SH DESCRIPTION



An implementation of the standard software greeting.
.SH AUTHORS

hello was written by Joe R. Hacker

This manual page was autogenerated by
.B hello man world.

Please report bugs to <bugs@example.com>

The next step would be to port the syntax-checking parts of hogg selfcheck which checks that the help examples pass through getOpt without errors.

CFFPP: linux.conf.au 2010

noreply@blogger.com (Anonymous) — Wed, 15 Jul 2009 03:53:00 +0000

The call for papers for linux.conf.au 2010 has been open for a few weeks, and closes soon (July 24).

I really want to encourage some talks about functional programming! The conference has a pretty strong developer focus, and most talks are about a practical topic. More importantly, we're looking for talks that inspire people to try new techniques, to approach design and troubleshooting with clarity and vigor (yarr!), to boldly consider that they should perhaps spend some time honing their craft before writing yet another application that inexplicably fails at runtime -- all in a friendly and entirely non-condescending environment of hackers having fun hacking.

Here's some suggestions for the kind of talks that I think could be interesting:

systems programming in Haskell/OCaml/whatever: how you wrote an interface to some hardware, handled lots of IO, controlled a robot, whatever
functional programming for kernel development: verification, security etc.
game programming: higher order design for 3D, AI etc.
proof vs. testing: (can anyone do a tutorial on proof without greek letters? not that Patryk Zadarnowski's talk about the Curry-Howard Isomorphism a few years ago wasn't *awesome*, but as a result of that people are clamoring (clamoring!) for some advice about how to prove their programs have no bugs).
some ... other ... practical benefit of functional programming!

The conference is in Wellington in January. January! it'll be windy, and it's in New Zealand!

Release: libfishsound 0.9.2

noreply@blogger.com (Anonymous) — Tue, 07 Apr 2009 22:48:00 +0000

Fishsound has moved to Xiph.org! The new home page is at http://www.xiph.org/fishsound/.

New in this release

This release contains security and other bugfixes:

Security fixes related to Mozilla bugs 468293, 480014, 480521, 481601.
Fix bounds checking of mode in Speex header
Handle allocation failures throughout due to out of memory
Added support for libFLAC 1.1.3
Add conditional support for speex_lib_get_mode() from libspeex 1.1.7 If available, this function is used in place of static mode definitions. For ticket:419
Check for Vorbis libs via pkgconfig, required for MacPorts etc.

A proposal for generalizing the byte-range referral HTTP Response header

noreply@blogger.com (Anonymous) — Tue, 07 Apr 2009 16:32:00 +0000

Re: the Media Fragments WD. Here I am using the term "byte-range referral" for multiple concatenated HTTP requests, for the purpose of improving cacheability; this is called a "4-way handshake" in the current working draft.

Shortcomings of the existing byte-range referral scheme

The above WD, and the current Annodex scheme, are specified to allow sharing of non-header data between different temporal views of media resources. They limit the positioning of custom data to the media headers. different segments to have different headers, which is useful for Ogg but not necessarily so for other formats.

Even for Ogg, it could be useful to refer to the codebooks separately from the Skeleton for more finely-grained data re-use. Then a client can locally cache the codebooks and know not to bother retrieving them over and over; but to get the updated skeleton and keyframe data for temporal segment requests.

Hence, I am proposing that we should specify an ordered list of tuples of (URI, byte range) which the concatenation of is byte-wise identical to the byte contents of the requested URI

This response can also contain data, so if you want to refer to this response you can include a tuple of (this, range) where this is the literal string "this", and refers to the body of the current response.

This syntax then allows the server to include parts from many different URLs. The custom data is then centralized in this response, and can be used for any parts of construction of the response so that it can be used for tail data (such as ID3 tags, divx seek tables etc.)

List and tuple separator characters

The list separator should be commas, as this then allows the list to be separated over HTTP response lines (without re-ordering).

Hence the tuple separator should not be commas; it can simply be whitespace:

Range-Referral: http://www.example.com/video.ogv?headers 0-1280
Range-Referral: http://content1.example.com/video.ogv 5380-48204
Range-Referral: this 0-950
Range-Referral: http://content1.example.com/video.ogv 60880-238382

By comma replacement, this set of headers is equivalent to the single header:

Range-Referral: http://www.example.com/video.ogv?headers 0-1280, http://content1.example.com/video.ogv
5380-48204, this 0-950, http://content1.example.com/video.ogv 60880-238382

Interpretation of other response headers

The body of this request is simply all the custom parts for this view, concatenated bytewise. The Range-Referral header explains how to use this data.

Content-Length: is the length of the body.

A Range request is made relative to the body. So for example a client could just do a HEAD request to get the Range-Referral headers, and then do multiple Range requests to retrieve the reqired parts in sequence (rather than locally caching all the data for tailers etc.). Coherence of the concatenated responses can be assured by the use of existing HTTP/1.1 caching identifiers.

So, this constructed response is only special in that a user agent knows how to use it in conjuction with other URI response data to display a media segment. Otherwise it is standard HTTP, and can have caching headers/tags attached, be cached by intermediate proxies, and itself be the subject of range requests.

Generalization to other segment types

This mechanism allows a complex sequence of byte-ranges to be specified. It explicitly marks data ranges which are re-usable, allowing them to be cached. It generalizes so that any complex data subview can be served, where re-usable data is keyed canonically and can be cached on the network.

For example, it may be useful for specifying the data for a spatial subrange of video.

liboggplay, liboggz, libfishsound migrated to git.xiph.org

noreply@blogger.com (Anonymous) — Fri, 03 Apr 2009 06:13:00 +0000

The source repositories for some Ogg libraries developed as part of the Annodex project have moved from from svn.annodex.net to git.xiph.org. These libraries are:

liboggplay, an Ogg Theora playback library used by Mozilla Firefox;
libfishsound, a simplified API for using audio codecs, used by liboggplay and the by the DirectShow Oggcodecs; and
liboggz, a library for seeking, reading and writing Ogg (used by liboggplay), and tools for managing Ogg streams. This includes oggz-chop, which is used by various sites including the Internet Archive to serve Ogg files.

Reasons for the migration

Xiph.org, which develops free codecs (Ogg Vorbis, Theora, Dirac, Speex, CELT, FLAC), already provided the hosting for Annodex.net projects. The move to the xiph.org domain reflects that these libraries are recommended for general use by projects requiring Ogg support.

The move from Subversion to Git allows for distributed development, letting developers without write access to the central Subversion repository develop code using a version control system, and making it easier for developers and packagers to track multiple independent changes. Among distributed version control systems, Git was chosen for its flexibility and popularity. It is already used within Xiph.org for Speex, the ultra-low latency, high quality audio codec CELT, and the experimental text overlay codec Kate.

Checking out the sources

To do a fresh checkout of the code, make a new git repository This assumes that you begin with an empty working directory:

$ git clone git://git.xiph.org/liboggz.git

Adding a remote to an existing git-svn checkout

Many developers already used git-svn to access the previous svn repositories. In this case you will already have a local git clone of the sources, perhaps with your own local changes. In that case, simply add a new remote to your existing repository, eg.:

$ git remote add xiph git://git.xiph.org/liboggz.git

Discovery and fallback for media segment addressing over HTTP

noreply@blogger.com (Anonymous) — Wed, 01 Apr 2009 06:56:00 +0000

This post concerns the use of queries or fragments in the URI specification for accessing segments of media over HTTP. We outline the user-visible differences between the two approaches, including the form of the URIs seen by users in each scenario and the consequent user interface activity, and then explain the HTTP request and response mechanisms that result. The purpose of this analysis is to better understand the trade-offs in usability and the impact on network performance, with reference to existing implementations rather than hypothetical scenarios.

I will make the case that the user-visible differences between the two syntaxes are immaterial, and that a more important distinction is that they induce different protocols. I will also claim that the use of the fragment syntax introduces unnecessary complexity in that it lacks a discovery mechanism and has no useful fallback to existing HTTP.

User-visible differences

We are constructing a URI syntax for addressing segments of media data. Taking the simple case of addressing some video content beginning at an offset of 10 seconds, we consider the two forms:

Query syntax: http://www.example.com/media.ogv?t=10
Fragment syntax: http://www.example.com/media.ogv#t=10

For simplicity here we are using a shortened segment identifier t=10; I touched on the topic of segment identifiers in a recent article about pretty printing durations.

Regarding the direct HTTP semantics of these two forms, if the user is already viewing the specified media.ogv, the query syntax reloads the portion from 10 seconds as a new resource, whereas the fragment syntax modifies the view of the current resource.

Although developers are rightly wary of a page refresh due to the time required to render complex HTML, in practice no visible change occurs when reloading a video. The query syntax has been used to control video seeking in JavaScript (using the Java cortado video player plugin, or an earlier Oggplay plugin), and also natively in the current Firefox 3.5 implementation.

In any case, this distinction is only user-visible if the video is the top-level resource. In the common case of a web page that embeds a video, the user-visible resource is the HTML page. In this case, the mechanism for controlling video is under the control of the embedding web page via JavaScript.

For example, URIs to YouTube pages allow a time segment to be appended using a fragment syntax. However, this fragment is used by JavaScript to control the embedded Flash video player; the mechanism for then retrieving video data is then managed by the Flash player. Similarly, in HTML5 Ogg <video> implementations, a fragment identifier appended to the HTML page may be interpreted by JavaScript to control seeking in the <video> source using a non-fragment mechanism, like query syntax.

Differences in request mechanisms

Either way we introduce a new behaviour that user agents can use to retrieve media segments over HTTP.

When handling a media segment which is specified by a query, the user agent initiates a standard HTTP request. It connects to port 80 on the specified host, and uses the entire path, including the query specifer, in the GET request. The server then begins transferring the required data representing that segment of the media.

To retrieve the URI http://www.example.com/media.ogv?t=10:

GET /media.ogv?t=10 HTTP/1.1
Host: example.com

However the proposed request mechanism for handling a segment specified by a fragment is not standard HTTP. In conventional HTTP, a fragment specifier is stripped by the user agent and not sent to the server at all; rather, the server sends the requested response (representing the entire resource), and after retrieval, the user-agent uses the fragment specifier to select the view shown to the user.

A recently proposed behaviour for handling media segments involves placing the segment specifier into the Range HTTP Request header, with a new units of seconds.

To retrieve the URI http://www.example.com/media.ogv#t=10:

GET /media.ogv?t=10 HTTP/1.1
Host: example.com
Range: seconds=10-

Response mechanism: byte-range redirection

The byte-range redirection response mechanism involves identifying parts of the segment view which are byte-wise identical to the original resource, and specifying redirections to those.

How discovery works

A user-agent will only receive a byte-range redirection response if it has indicated that it is capable of interpreting that, by including an extra HTTP request header. For example, here using a media segment URL specified with a query parameter:

GET /media.ogv?t=10 HTTP/1.1
Host: example.com
X-Accept-Range-Redirect: bytes

If the server is capable of handling the byte-range redirection mechanism, it will do so and indicate that it has done so explicitly in its response headers.

Query syntax has a sensible fallback to standard HTTP

However if the extra request header is not present, the server will simply send an entire response corresponding to the requested segment. Similarly if the header is present but the server is not capable of this new mechanism, it will simply continue with a standard HTTP response. The client can tell if the response is a segment response or not by the presence of an acknowledging response header.

If either client or server does not understand the byte-range redirection protocol, the request falls back to standard HTTP and the required segment is correctly returned. The cost of this fallback, compared to the case where both client and server understand the new request/response headers, is a loss of cacheability for subsequent overlapping segment requests.

Fragment syntax has a high cost of failure

The mechanism involving the fragment specifier does not have a fallback to standard HTTP: if the client does not understand that it should add the Range header with newly defined units, then it will end up simply requesting the entire resource. Similarly, if the server does not understand the new header then it will simply respond with the entire resource. If the cost of failure is to download some number of hours of extra video, as it would be in the case of MetaVid's congress proceedings, that is a prohibitive cost.

Summary

The distinction is one of protocol mechanism
For the common case of video displayed in HTML, the distinction is not user-visible
The use of fragment specifiers do not have a fallback to standard HTTP
The cost of discovery failure for fragments is high (retrieval of entire resource)

Actions

To clarify within the Media Fragments WG how queries can be used effectively, for both considered user scenarios.
To consider how the byte-range redirection mechanism can be generalized for other segment specifiers, such as spatial regions.

The economics of Twitter spam

noreply@blogger.com (Anonymous) — Sun, 08 Mar 2009 23:59:00 +0000

Recently more and more people have reported that they are being followed by spammers on Twitter. It's easy to track this problem: just search for #spam. Being followed by a Twitter spammer isn't like being stalked by a murderer; actually in the current environment, these guys are a fairly benign parasite that can work in your favor. So let's look at the economics of Twitter spam.

The upside for spammers is the usual obvious SEO shite: you've got something useless to peddle (yourself, your scam, your illegitimate business selling poor copies of pretentious luxury goods, your legitimate business selling enhancement placebos to suckers); you spend your time trying to defile fine and upstanding web pages with links to your pathetic piece of virtual real estate; Twitter comes along and your primitive brain realizes it can post its links there. You follow people so that they get a notification in their email pointing to your Twitter feed. Maybe they read it, maybe they click the tinyurl-obscured link. You cream yourself if they choose to follow you, because then they'll get all your spam, and you'll look more legit by having actual followers (like, real people from outside your cluster of bots and morons).

Now, what's the upside for normal humans in being followed by these scum?

Knowledge is work, a means for putting food on the table; information is power, a means for taking food from others.

Following as many people as you can on Twitter is a useful way to stay in front of your game: you know what people are up to, you see trends evolve, you get notice of articles before they're syndicated, you watch news unfold in your little niche of the world. And of course, the more people that follow you, the further your own message spreads: how great you are, how you're beating the system, how your ~~pretentious~~ beautiful designs and products can uplift and empower.

So there's an incentive to increase both the number of people you follow and the number of people who follow you. The first is easy; you just find people and press their button. The second is more difficult: you need to say something worthwhile in your tweets. Sometimes, not always, people will reciprocate when you follow them -- (SEO tip here!:) it helps if your own tweets are interesting.

However, there is a 2000 following limit: you can't follow more than 2000 people until you have 2000 followers. So, if you want to expand your reach into the info-verse, every follower counts -- even those spambots. So, now, these guys have evolved a little symbiotic, parasitic relationship with their hosts (you). You feel the first bite when they follow, but it feeds your ego. All you need is followers! no-one's going to do background checks on your popularity!

Relevance ranking anyone?

There's more to it though: Twitter search is currently being rolled out across the default user interface, and various bloggers are describing Twitter as a "search engine" (apparently that's the appropriate noun to describe someone that collects ideas). Twitter search is currently a realtime feed of query matches (the zeigeist! *fap* *fap* *fap*) with no relevance ranking. As the search feature gains usage, people will want relevant results to more complex queries. An obviously useful ranking input is the number of followers that a Twit has. These spambots will make you appear relevant!

We can follow this down silly paths -- eg. the more you tweet, the more spambot-followers you get, the more ranking relevance you have. The spammers introduce an incentive to posting often, and that mechanism has positive feedback.

More useful ranking mechanisms are things like reply frequency and analysis of re-tweets. Re-tweets are interesting to track because you can find the users who originate popular ideas: give them the microphone, dammit.

Action items

So there's an imbalance in the Twitter economy. Spammers are using Twitter and the environment encourages it.

Wishlist for Twitter:

Track how often users are blocked, warn against and auto-ban them.
Add user-initiated "Report spammer" buttons.
Implement detection of spammer clusters and auto-ban them.

Action items for Twitter users:

Block spammers on Twitter.
Block spammers on Twitter.
Block spammers on Twitter.

Please rant about how much you love the symbiotic parasitic relationship with your spambot-followers!