Custom thumbnail generators for audio waveforms and GIMP XCF files

I recently packaged these 3D thumbnailers for Arch Linux and this act inspired me to dig into this topic a bit and make some of my own. First off, what’s a “thumbnailer”? It related to this Free Desktop specification and I’d summarise it as:

A thumbnailer is a program that can generate a small preview image (called “thumbnail”) of a specific file type. File browsers know about thumbnailers by reading configuration from .thumbnailer files which specify which program to call for a specific MIME type, and which arguments to use to get the desired output.

For more details and practical examples of what these files look like and what you can do with them, see this blog post by Radu Zaharia. For most of the 3D thumbnailers I packaged, the thumbnailer programs are small purpose-built scripts, written in Python or Bash, that understand the specified format and how to get a PNG image out of it. The thumbnailer config just specified which order it wants its file input and output arguments in. However, as Radu’s blog post points out these can also be more advanced one-liners using programs you already have installed, so I decided to try that out and this post I’ll cover 2 such examples: Audio & GIMP.

tl;dr: Copy-pastable version at the end of the post Continue reading

Getting Xcape functionality in Wayland

The thing I’ve missed most since switching to Wayland is definitely Xcape, but more on that later. The solution is to use the Interception Tools plugin caps2esc and in fact this is a display-server agnostic solution because it’s at a much lower level, so it also works for X11 and even in the virtual consoles! 😮

Luckily Arch Linux actually has this plugin in the official package repos, so installation is easy:

$ pacman -S interception-caps2esc

This also installs interception-tools as a dependency, including udevmon which is a program that associates a job with any current and new devices (e.g. keyboards). In our case the “job” we want is a shell script piping intercepted keyboard output through the caps2esc plugin.

To do this, you copy this snippet, literally as-is from the plugin README file:

- JOB: intercept -g $DEVNODE | caps2esc | uinput -d $DEVNODE
  DEVICE:
    EVENTS:
      EV_KEY: [KEY_CAPSLOCK, KEY_ESC]

Paste that into a new file under udevmon.d, for example /etc/interception/udevmon.d/caps2esc.yaml to associate this plugin with the CapsLock and Esc keys on all devices.

Lastly, start up the udevmon service. Despite the README talking about running commands manually, the Arch Linux package includes a systemd unit file with the recommend defaults pre-configured, you just need to turn it on:

sudo systemctl enable --now udevmon.service

That’s it! Now Esc and CapsLock are swapped and holding down Esc functions as Ctlr, in Wayland, in X and even in the TTY!!!


So… why did I want to do this? Well, the caps2esc project documentation says it best:

transforming the most useless key ever in the most useful one

I spend a lot of my computer-time editing code and text in Vim where the Esc key is really frequently used because it’s how you change to “normal mode”. Lots of other programs also let you use Escape to do things like dismissing windows or stepping out of form fields. In GNOME settings there’s an option to swap the CapsLock and Escape keys and that made it a much smaller reach to get the Esc key with my pinky.

Reading up a bit more on this I stumbled upon Xcape which lets you set up CapsLock so that it’ll send Esc if you press it briefly, but act as Ctrl if you hold it down together with other keys. Amazing. The Control key is another annoying pinky-reach and its position varies between my work and personal computers. It’s used in loads of keyboard shortcuts across all apps, and it’s part of most key combinations in readline shortcuts used in the shell. As a power user, having both this and Escape close at hand on this really big, easy-to-hit CapsLock key in the home row is fantastic! Until I switched to macOS.

The old work laptop got replaced with a newer MacBook that brought exciting new hardware compatibility issues. In this case, the built-in keyboard refused to work at all, and after a few weekends of trying to fix it I gave up and accepted the challenge of using the other OS. I was happy to learn this dual Ctrl-Esc functionality was something Mac users were after too and using Karabiner-Elements I was able to set this up pretty easily, as well as the compose key. I ended up using this for a several years and it worked well. I might write a bit about my personal experiences with macOS compared to Linux at some point but eventually this laptop got old too and the next machine was Linux-compatible.

Meanwhile in Linux-land support for Wayland is stable with GNOME desktop and so I switched from X11 to Wayland to get better scaling between the laptop’s HiDPI screen and the HD external monitor. The issue of being able to share my desktop got fixed upstream but the last thing I was missing from X11 was some alternative to Xcape. The X in Xcape alludes to X11, and if an X server isn’t running Xcape can’t start and anyway there wouldn’t be any X input events to handle without X. Pretty annoying. I tolerated this for about a year because the solutions I found looked a bit less out-of-the box than Xcape and I didn’t want to spend so much time tweaking a computer. Xcape issue #67 wayland support lists some alternatives, ranging between pretty hacky to “just use X”.

It was in this list that I came across caps2esc, bookmarked it for later, and later is today. I’ll be honest, I struggled a bit to make heads or tails of the Interception Tools documentation. On one hand it seems the way to write custom “plugin” functionality was written in C, and the code looked short and readable enough, but on the other hand it’s talking about configuring things with YAML files. The YAML config didn’t seem to be input to the plugin itself though. After some back-and-forth between the main docs and the plugin docs it became obvious: The plugin itself is written in C. If you have it installed, you can use the interception tools intercept and udevmon to send keyboard events through the caps2esc plugin. Using udevmon you can configure to which devices to apply this rule (called a job) and the way to do that is to write some udevmon config in YAML. The puzzle pieces all fit together and the result is the condensed set of instructions at the top of this post.

It’s better to use floats for maths and leave ints for indices

Someone showed me a bug in my recent game, Freefall, whereby you can get as far as you like, by just changing speeds continuously, you don’t even need to look at the screen, just tap with a consistent tempo to keep changing speeds. This works because the projectile speeds are set with integers which means they have:

  1. A base speed at which they will hit you with the parachute
  2. Fast (double base) speed they will you with if in free fall
  3. Anything faster than this will miss you because it will get off the screen before it reaches you and is therefore only in there for decoration / to confuse the player

I used integers for practically everything in the game because the low resolution meant that even moving one pixel at a time is a pretty big step (there are only 48px in the vertical axis) and you can’t draw smaller than that.

During testing I focussed on checking that 1 & 2 would hit you at their respective speeds and that you could dodge them if you changed at the last moment. If you change speeds so infrequently the problem is less apparent. Continue reading

Launched a new game: Freefall

On 27 February I released a new Nokia 3310-style game: Freefall. In this game you control a parachute for humanitarian airdops in hostile territory. It’s my entry to Nokia 3310 Jam 5 and the theme was Fast & Slow so the main game action revolves around controlling the speed of the airdop between parachuting and freefall to dodge bullets coming from both sides.

The aim of the Nokia 3310 Jam is to have fun making a game using the restrictions of a Nokia 3310, so green, low-res, 1-bit screen, monophonic and only 12 keypad buttons. My entry placed 28th out of a 147 submissions and I’m especially proud of this because I did the music, animations, programming and game design all by myself. The game is open source and you can read it in the GitHub repository.