Solaris portability - flock()

Jan 08, 2013

tags: illumos, smartos, solaris

This is the first of what I hope will be a regular series of posts looking at software portability when it comes to Solarish (Solaris, illumos, SmartOS etc.) systems. We will begin with flock(), as it’s what I happen to have been fixing today.

The most recent version of tmux fails with the following errors:

client.c: In function 'client_get_lock':
client.c:81:2: warning: implicit declaration of function 'flock' [-Wimplicit-function-declaration]
client.c:81:20: error: 'LOCK_EX' undeclared (first use in this function)
client.c:81:20: note: each undeclared identifier is reported only once for each function it appears in
client.c:81:28: error: 'LOCK_NB' undeclared (first use in this function)

from this section of code:

int
client_get_lock(char *lockfile)
{
        int lockfd;

        if ((lockfd = open(lockfile, O_WRONLY|O_CREAT, 0600)) == -1)
                fatal("open failed");

        if (flock(lockfd, LOCK_EX|LOCK_NB) == -1 && errno == EWOULDBLOCK) {
                while (flock(lockfd, LOCK_EX) == -1 && errno == EINTR)
                        /* nothing */;
                close(lockfd);
                return (-1);
        }

        return (lockfd);
}

This code is reasonably straight-forward, trying to create a lock on a file descriptor, and if it isn’t able to waits until the lock is released then fails, ready for the calling function to retry.

However, the flock() routine is of BSD heritage and does not exist on Solaris. There used to be a compatability version as part of the /usr/ucb environment, but that no longer exists in newer versions.

Thankfully, there is an alternative interface which has existed for as long as flock() but with the added benefit of being standardised by POSIX and thus much more portable - enter fcntl()!

Let’s look at how the two are similar-but-different, and then show how we can change this code to be more portable.

flock() implementation

As we can see from the NetBSD manual page below, flock() is very simple, and this is probably the main reason why people choose to use it over fcntl() for file locking.

NAME
     flock -- apply or remove an advisory lock on an open file

LIBRARY
     Standard C Library (libc, -lc)

SYNOPSIS
     #include <fcntl.h>

     #define   LOCK_SH   1    /* shared lock */
     #define   LOCK_EX   2    /* exclusive lock */
     #define   LOCK_NB   4    /* don't block when locking */
     #define   LOCK_UN   8    /* unlock */

     int
     flock(int fd, int operation);

There isn’t much to it, you call flock() specifying either a shared or exclusive lock, with an option to make the call non-blocking. Afterwards you unlock the previously held lock.

The tmux code in question uses LOCK_NB for the initial call, then if that fails reverts to the default blocking operation so that the client can retry as soon as the lock is released.

Use fcntl() instead!

In contrast to flock(), setting up fcntl() is a bit more involved as it allows finer-grained control over the locking.

Set up struct flock

There is a flock structure which controls the lock, defined as:

struct flock {
        off_t       l_start;    /* starting offset */
        off_t       l_len;      /* len = 0 means until end of file */
        pid_t       l_pid;      /* lock owner */
        short       l_type;     /* lock type: read/write, etc. */
        short       l_whence;   /* type of l_start */
};

So, let’s set that up:

int
client_get_lock(char *lockfile)
{
	int lockfd;
	struct flock lock;

	lock.l_start = 0;
	lock.l_len = 0;
	lock.l_type = F_WRLCK;
	lock.l_whence = SEEK_SET;
...

Setting l_start and l_len both to 0 means ‘lock the entire file’, l_whence of SEEK_SET means we are setting absolute values rather than relative, and we set l_type to be a write lock. If we wanted a read lock, we’d use F_RDLCK here instead.

Note that l_pid is only used for a F_GETLK operation - we are only interested in attempting to set a lock, so it is left unset.

Add fcntl() calls

The synopsis for fcntl() is:

SYNOPSIS
     #include <fcntl.h>

     int
     fcntl(int fd, int cmd, ...);

and the commands available for file locking are (from BSD):

#define F_GETLK         7               /* get record locking information */
#define F_SETLK         8               /* set record locking information */
#define F_SETLKW        9               /* F_SETLK; wait if blocked */

Note there is no ‘clear lock’ command. To clear a lock, you use the F_GETLK command with l_type set to F_UNLCK.

So, to show how we would rewrite the flock() instances to fcntl() instead, I’ve put them together below:

#ifdef __sun
	if (fcntl(lockfd, F_SETLK, &lock) == -1 && errno == EAGAIN) {
		while (fcntl(lockfd, F_SETLKW, &lock) == -1 && errno == EINTR)
			/* nothing */;
		close(lockfd);
		return(-1);
	}
#else
	if (flock(lockfd, LOCK_EX|LOCK_NB) == -1 && errno == EWOULDBLOCK) {
		while (flock(lockfd, LOCK_EX) == -1 && errno == EINTR)
			/* nothing */;
		close(lockfd);
		return (-1);
	}
#endif

As you can see, they are very similar, making it relatively straight-forward to rewrite code to use the more portable fcntl().

An important note on semantics

There are two important difference between flock() and fcntl() you need to be aware of which may affect a simple conversion:

fcntl() locks are not held across a fork(), so you cannot pass locks down to child processes.
The semantics of fcntl() are such that any closure of a file descriptor in your application will release the locks held against that file. This is best illustrated with a lock on /etc/passwd that gets released if you call getpwname() as that opens and closes the /etc/passwd file.

Generally such semantics do not apply, but you should be aware of them, and it always pays to carefully read the manual pages, preferably those from BSD.

flock() -> fcntl() cheat sheet

To aid your own conversions, here are some further examples (without error checking for clarity) of flock() and fcntl() equivalents.

#if defined(USING_FLOCK)

	/* Blocking */
	flock(lockfd, LOCK_SH);  // Shared (read) lock
	flock(lockfd, LOCK_EX);  // Exclusive (write) lock

	/* Non-blocking */
	flock(lockfd, LOCK_SH|LOCK_NB);  // Shared (read) lock
	flock(lockfd, LOCK_EX|LOCK_NB);  // Exclusive (write) lock

	/* Release */
	flock(lockfd, LOCK_UN):

#elif defined(USING_FCNTL)

	struct flock lock;

	lock.l_start = 0;
	lock.l_len = 0;
	lock.l_whence = SEEK_SET;

	/* Blocking */
	lock.l_type = F_RDLCK;
	fcntl(lockfd, F_SETLKW, &lock);  // Shared (read) lock
	lock.l_type = F_WRLCK;
	fcntl(lockfd, F_SETLKW, &lock);  // Exclusive (write) lock

	/* Non-blocking */
	lock.l_type = F_RDLCK;
	fcntl(lockfd, F_SETLK, &lock);  // Shared (read) lock
	lock.l_type = F_WRLCK;
	fcntl(lockfd, F_SETLK, &lock);  // Exclusive (write) lock

	/* Release */
	lock.l_type = F_UNLCK;
	fcntl(lockfd, F_GETLK, &lock);

#endif

flock() wrapper

Alternatively, there is a flock() wrapper which is used by the NetBSD toolchain, which you could include in a compatability library or so:

int flock(int fd, int op) {
	int rc = 0;

#if defined(F_SETLK) && defined(F_SETLKW)
	struct flock fl = {0};

	switch (op & (LOCK_EX|LOCK_SH|LOCK_UN)) {
	case LOCK_EX:
		fl.l_type = F_WRLCK;
		break;

	case LOCK_SH:
		fl.l_type = F_RDLCK;
		break;

	case LOCK_UN:
		fl.l_type = F_UNLCK;
		break;

	default:
		errno = EINVAL;
		return -1;
	}

	fl.l_whence = SEEK_SET;
	rc = fcntl(fd, op & LOCK_NB ? F_SETLK : F_SETLKW, &fl);

	if (rc && (errno == EAGAIN))
		errno = EWOULDBLOCK;
#endif

	return rc;
}

It might be handy if the illumos folks merged this, it would immediate fix at least 19 packages in the pkgsrc collection :)

Summary

flock() is simpler, and retains locks across fork() and concurrent access boundaries, but at the cost of portability. Please try to use fcntl() where possible, it isn’t much harder to use, and makes your software run on more platforms.

Share this post on Twitter, HackerNews, Facebook or Google+

All Posts

16 Jul 2015 » Reducing RAM usage in pkgin
03 Mar 2015 » pkgsrc-2014Q4: LTS, signed packages, and more
06 Oct 2014 » Building packages at scale
04 Dec 2013 » A node.js-powered 8-bit CPU - part four
03 Dec 2013 » A node.js-powered 8-bit CPU - part three
02 Dec 2013 » A node.js-powered 8-bit CPU - part two
01 Dec 2013 » A node.js-powered 8-bit CPU - part one
21 Nov 2013 » MDB support for Go
30 Jul 2013 » What's new in pkgsrc-2013Q2
24 Jul 2013 » Distributed chrooted pkgsrc bulk builds
07 Jun 2013 » pkgsrc on SmartOS - creating new packages
15 Apr 2013 » What's new in pkgsrc-2013Q1
19 Mar 2013 » Installing SVR4 packages on SmartOS
27 Feb 2013 » SmartOS is Not GNU/Linux
18 Feb 2013 » SmartOS development preview dataset
17 Jan 2013 » pkgsrc on SmartOS - fixing broken builds
15 Jan 2013 » pkgsrc on SmartOS - zone creation and basic builds
10 Jan 2013 » Multi-architecture package support in SmartOS
09 Jan 2013 » Solaris portability - cfmakeraw()
08 Jan 2013 » Solaris portability - flock()
06 Jan 2013 » pkgsrc-2012Q4 illumos packages now available
23 Nov 2012 » SmartOS and the global zone
24 Oct 2012 » Setting up Samba on SmartOS
10 Oct 2012 » pkgsrc-2012Q3 packages for illumos
23 Aug 2012 » Creating local SmartOS packages
10 Jul 2012 » 7,000 binary packages for OSX Lion
09 Jul 2012 » 9,000 packages for SmartOS and illumos
07 May 2012 » Goodbye Oracle, Hello Joyent!
13 Apr 2012 » SmartOS global zone tweaks
12 Apr 2012 » Automated VirtualBox SmartOS installs
30 Mar 2012 » iptables script for Debian / Ubuntu
20 Feb 2012 » New site design
11 Jan 2012 » Set up anonymous FTP upload on Oracle Linux
09 Jan 2012 » Kickstart Oracle Linux in VirtualBox
09 Jan 2012 » Kickstart Oracle Linux from Ubuntu
22 Dec 2011 » Last day at MySQL
15 Dec 2011 » Installing OpenBSD with softraid
21 Sep 2011 » Create VirtualBox VM from the command line
14 Sep 2011 » Creating chroots for fun and MySQL testing
30 Jun 2011 » Graphing memory usage during an MTR run
29 Jun 2011 » Fix input box keybindings in Firefox
24 Jun 2011 » How to lose weight
23 Jun 2011 » How to fix stdio buffering
13 Jun 2011 » Serving multiple DNS search domains in IOS DHCP
13 Jun 2011 » Fix Firefox URL double click behaviour
20 Apr 2011 » SSH via HTTP proxy in OSX
09 Nov 2010 » How to build MySQL releases
29 Apr 2010 » 'apt-get' and 5,000 packages for Solaris10/x86
16 Sep 2009 » ZFS and NFS vs OSX
12 Sep 2009 » pkgsrc on Solaris
09 Dec 2008 » Jumpstart from OSX
31 Dec 2007 » Set up local caching DNS server on OSX 10.4