CS330 Intro to Threads

Highlights of this lab:

Preamble
POSIX Threads Commands
References

Lab Code

To get the sample and exercise code, please use the following commands in your cs330 directory:

  curl https://www.labs.cs.uregina.ca/330/Threads/Lab5.zip -O -s
  unzip Lab5.zip

Preamble

Sometimes it is more efficient to divide a problem into smaller problems that can be solved at the same time. One way to do this in modern operating systems is to fork the program into multiple programs that run in parallel and use files, pipes, and interprocess communications to coordinate their activities. However, creating a whole process for a sub-problem and coordinating communication between them can be resource intensive and may eliminate any benefits gained from running the processes in parallel. This is where threads come in. The following is from Interprocess Communications in Linux:

Aware of such limitations, the designers of modern versions of UNIX anticipated the need for constructs that would facilitate concurrent solutions but not be as system-intensive as separate processes. ... [W]hy not allow individual processes the ability to simultaneously take different execution paths through their process space? This idea led to a new abstraction called a thread. Conceptually, a thread is a distinct sequence of execution steps performed within a process.

For a thread to be able to run on its own it needs its own program counter, register set, call stack, and the ability to create thread specific variables. Everything else needed for a process is shared between its various threads including instructions, files, and virtual memory. This means there is less need to communicate via pipes, files or IPC shared memory. However, because files and global variables, and static variables can be altered by any thread, there is a greater need to synchronize access to shared resources. Because of this most threading systems also include include a simplified binary semaphore-type thing called a mutex, or mutual exclusion lock.

POSIX Threads Commands

In this lab you will see how to do the following with POSIX threads (pthreads):

create threads
exit from a thread correctly
wait for a thread to finish and interpret what it returns (join it)
tell a thread you will not wait for it (detach it)
use a mutex to synchronize thread activities

Since threads are a deep topic and we only have one lab to cover them, we will cover them superficially. We will survey the basic forms of the following calls:

pthread_create() - fork() + exec*() for threads
pthread_self() - get thread id
pthread_exit() - exit() for threads
pthread_join() - wait() for threads
pthread_detach() - tell a thread you won't join it (wait for it)
pthread_mutex_init() - get a mutex
pthread_mutex_lock() - lock a mutex (acquire it), block until available
pthread_mutex_trylock() - try to lock a mutex (acquire it), do not block if unavailable
pthread_mutex_unlock() - unlock a mutex (release it)
pthread_mutex_destroy() - destroy a mutex (remove it)

We will not cover how to create anything other than the default thread and mutex types. You should explore the references section if you want to learn more.

To use these commands you must include pthreads.h.

In Linux, you can request different versions of POSIX compliance. Consult the man page for feature_test_macros and search for POSIX to see what options are available.

Also in Linux, to compile you need to compile with -pthread to links against the pthread library and enable necessary pthread features in some headers:

g++ sample.cpp -pthread

Insead of using -pthread you can also define _REENTRANT and link against libpthread (-lpthread) but this is discouraged (see man pthreads for more).

If you are interested in writing POSIX threads programs on a Mac, you will be happy to know that you don't even need to specify -pthread – Mac programs are heavily threaded, and threading is enabled by default.

(What's reentrant you ask? Learn more from Wikipedia - Reentrant. Reentrancy is part of being thread-safe. You should look that up too: Wikipedia - Thread Safe. The basic idea is that, when you are using threads and can interrupt a function at any point, changes to shared global and static variables can cause all sorts of trouble. You are already familiar with one function that is not reentrant and not thread safe: strtok() ). You will find a full list of C library functions that are not thread-safe in man pthreads in the section "Thread-safe functions".

Creating Threads

pthread_create()

    #include <pthread.h>
    int pthread_create(pthread_t *tid, 
                       pthread_attr_t * attr_t, 
                       void *(* start_routine) (void *),
                       void *arg );

Notes:

Preconditions

tid: should be a reference to a user provided buffer of type pthread_t
pthread_attr_t: can point to custom attributes for the new thread.

Providing a NULL pointer results in a thread with default attributes. This is good enough for now.
See the pthread_attr_init man page for details on setting non default attributes.

start_routine: the new thread will execute this function first.

It should look like a C function.

In C++ this may require that the function be specified as extern "C". See sample code for details.

The function can use C++ code, but may not be a class member.

The function's prototype is: void * function_name(void *)

in error messages this will probably look like void * (*)(void *) which is nearly indecipherable
the void * argument and return type allow us to pass or return any type of data we want.

Postconditions:

If successful, pthread_create returns 0 and tid contains a unique thread ID
If unsuccessful, pthread_create returns an error number and sets errno.

When a pthread_thread() system call is made, the resulting operation is very system dependent. Some operating systems implement user level controls for managing threads. Some operating systems create lightweight kernel level processes managing threads. On some systems a process and its threads will only execute on one processor. On others they can execute on multiple processors. Modern Linux kernels support threads with the NPTL (Native POSIX Threads Library), which, according to the pthreads man page, is "a so-called 1:1 implementation, meaning that each thread maps to a kernel scheduling entity", meaing they can be scheduled to any processor on a correctly configured system.

The new POSIX thread will share some attributes with all other threads in the same process:

process ID (PID)
parent process ID (PPID)
open file descriptors
actions taken in response to signals
data segment (code/global variables/static variables/constants)
heap segment (dynamic memory)
process environment and information (see pthreads man page for details)

It will also have some distinct attributes:

thread ID
errno variable
call stack (for creating distinct local variables)
signal mask and queue

pthread_self()

    #include <pthread.h>
    pthread_t pthread_self();

Notes:

This function is like getpid() - it returns a thread's unique thread id. Remember that threads are supposed to share PIDs, so thread IDs are the way to distinguish them.

The following is an example of pthread_create

//
// Creating threads
//
// Based on p11.1.cxx from Interprocess Communications in Linux
// By: John Shapley Gray
// Adapted for CS330 by Alex Clarke
//

#include <iostream>
#include <cstdlib>
#include <cstdio>
#include <pthread.h>
#include <sys/types.h>
#include <unistd.h>

using namespace std;

extern "C" 
{
   void * say_it( void * );
}

int main(int argc, char *argv[])
{
   int num_threads;
   pthread_t *thread_ids;

   //Use unbuffered output on stdout
   setvbuf(stdout, (char *) NULL, _IONBF, 0);

   cout << "How many threads? ";
   cin >> num_threads;
   thread_ids = new pthread_t[num_threads];

   cout << "Making Threads" << endl;

   // generate threads 
   for (int i = 0; i < num_threads; i++)
   {
      if( pthread_create(&thread_ids[i],NULL,say_it,&thread_ids[i]) > 0)
      {
            cerr << "pthread_create failure" << endl;
            return 2;
      }
   }

   // wait a bit
   cout << "Making Changes in a Moment" << endl;
   sleep(1);

   // modify contents of arguments to threads
   for (int i = 0; i < num_threads; i++)
   {
      thread_ids[i] = i;
   }

   //wait a bit more
   sleep(2);

   system("bash -c 'read -sn 1 -p \"Press any key to quit...\" ' ");
   cout << endl;
   delete [] thread_ids;
   return 0;
}

// Print out the thread number twice
void * say_it(void *num)
{
   cout << "I am thread #" << *(unsigned int *)(num) << "." << endl;
   sleep (2);
   cout << "I am thread #" << *(unsigned int *)(num) << "." << endl;
   return NULL;
}

This program creates a user specified number of threads, passing them their thread id. Each thread prints that id twice.

Is the id the same both times? Why or why not?
See if you can wait for the change in id using pthread_self() rather than using a sleep.
What would happen if we provided the loop counter i as the argument to the new thread? How would you do that?

Managing Threads

If you do not detach or join a thread, then when it exits it will become a zombie thread and consume some system resources. This can really cause trouble, so make sure you take care of the zombies before they take care of you. They will eat your computer's brains.

All threads return a void * when they are done. This can happen at the end of the start routine, or when pthread_exit() is called. The value returned can be a pointer to anything in process memory, which makes the a thread's returned value much more powerful than that of a process. This value, along with the thread's state, will be either accepted and freed by another thread that is waiting on it with pthread_join(), discarded if the thread is detached, or be freed when the process quits if neither of the other two conditions is met.

pthread_exit()

    #include <pthread.h>
    void pthread_exit(void *retval);

Notes:

can be used to exit a thread at any time
threads can also exit by returning a void * at the end of their start routine

pthread_join()

    #include <pthread.h>
    int pthread_join(pthread_t tid, void **retval);

Notes:

used to catch the value returned by a thread
Preconditions:

tid: is a valid thread id
retval: is a reference to a void pointer

Postconditions:

If successful, pthread_join() will return 0 and

the calling thread will block until thread tid has exited
retval: the void pointer this refers to will have been modified and can be interpreted however the programmer wishes, eg:

the thread could have returned a reference to an integer, a string, or even a data structure
the thing being returned should exist after the thread has returned, meaning it should have been a global variable or dynamically allocated (in which case it should be freed)

if unsuccessful, pthread_join() will return an error number and set errno

pthread_detach()

    #include <pthread.h>
    int pthread_detach(pthread_t tid);

Notes:

used to indicate that a thread's state can be discarded when it exits
threads are not detached by default, but can be created as detached by setting attributes at creation time
Preconditions:

tid refers to a valid thread

Postconditions:

If successful, pthread_detach() will return 0 and the thread tid will be detached.
If unsuccessful, pthread_detach() will return an error number and set errno

The following demonstrates a sample use of pthread_exit() and pthread_join() :

//
// Joining threads and interpreting exit values
//
// Based on p11.1.cxx from Interprocess Communications in Linux
// By: John Shapley Gray
// Adapted for CS330 by Alex Clarke
//

#include <iostream>
#include <cstdlib>
#include <cstdio>
#include <pthread.h>
#include <sys/types.h>
#include <unistd.h>

using namespace std;

//Thread start 
extern "C" 
{
   void * say_it( void * );
}


int main(int argc, char *argv[])
{
   int num_threads;
   pthread_t *thread_ids;
   void  *p_status;

   //Use unbuffered output on stdout
   setvbuf(stdout, (char *) NULL, _IONBF, 0);

   cout << "How many threads? ";
   cin >> num_threads;
   thread_ids = new pthread_t[num_threads];

   cout << "Displaying" << endl;

   // generate threads 
   for (int i = 0; i < num_threads; i++)
   {
      int *arg = new int;
      *arg = i;
      if( pthread_create(&thread_ids[i],NULL,say_it,arg) > 0)
      {
                perror("creating thread:");
              return 2;
      }
   }

   // join threads and print their return values
   for (int i = 0; i < num_threads; i++)
   {
      if (pthread_join(thread_ids[i], &p_status) != 0)
      {
         perror("trouble joining thread: ");
         return 3;
      }
      cout << "Thread " << i << ": " << (char *)p_status << endl;

      delete [] (char *)p_status;
   }

   delete [] thread_ids;

   return 0;
}

// Build a message and return it at exit
void * say_it(void *num)
{
   int t_num = *(int *)num;
   char *msg = new char[255];
   cout << "Building message for thread" << t_num << endl;
   sleep(1);
   if (t_num == 5)
   {
      snprintf(msg, 255, "I am not %lX. I am #%d. I. AM. ALIVE.",
               pthread_self(), t_num);
      pthread_exit(msg);
   }
   snprintf(msg, 255, "My thread id was %lX. Goodbye...", pthread_self());
   return msg;
}

Synchronizing Threads

Just as with forked processes, and perhaps moreso, it is important to synchronize access to resources between threads. You need to worry about access to global variables now as well. You can't do this with the semaphores you have already learned because they are not thread safe. Fortunately the POSIX threads API offers many methods to synchronize access to resources. This week we will focus on the mutex. A mutex is like a single binary semaphore with a couple small differences:

0 means the mutex is available, 1 means it is locked - this is the opposite of how semaphores work
only the thread that locks a mutex can unlock it whereas any process or thread can release a semaphore

Other synchronization methods supported by the POSIX threads API include condition variables, read/write locks, and multithread semaphores.

pthread_mutex_init()

    #include <pthread.h>
    int pthread_mutex_init(pthread_mutex_t *mutex, const pthread_mutexattr_t *attr);

used to create a new mutex
Preconditions:

mutex: refers to a real variable of type pthread_mutex_t
attr: can refer to a custom attrubutes for the new mutex

Providing a NULL pointer results in a mutex with default attributes. This is good enough for now.
The Linux man pages for POSIX mutexes are poor. I recommend you read Interprocess Communications in Linux if you want more details.

Postconditions:

If successful, pthread_mutex_init() will return 0 and mutex will refer to a valid mutex.
If unsuccessful, pthread_mutex_init() will return an error number and set errno

pthread_mutex_lock()
pthread_mutex_trylock()

    #include <pthread.h>
    int pthread_mutex_lock(pthread_mutex_t *mutex);
    int pthread_mutex_trylock(pthread_mutex_t *mutex);

used to get and lock access to serial (one at a time) resources
Preconditions:

mutex: refers to a valid mutex created by pthread_mutex_init

Postconditions:

If successful, the function will return 0 and the thread will have locked the mutex and gained access to the resource.

pthread_mutex_lock() blocks until the mutex is available.

If unsuccessful, the function will return an error number and set errno.

pthread_mutex_unlock()will fail with error number EBUSY if the mutex is unavailable. The thread can do other work before trying again.

pthread_mutex_unlock()

    #include <pthread.h>
    int pthread_mutex_unlock(pthread_mutex_t *mutex);

used to unlock access to serial (one at a time) resources
only the thread that locked a mutex may unlock it
Preconditions:

mutex: refers to a valid mutex created by pthread_mutex_init

Postconditions:

If successful, the function will return 0 and the thread will have unlocked the mutex
If unsuccessful, the function will return an error number and set errno.

pthread_mutex_destroy()

    #include <pthread.h>
    int pthread_mutex_destroy(pthread_mutex_t *mutex);

used to free up unneeded mutexes
a locked mutex will not be destroyed - the attempt will fail
Preconditions:

mutex: refers to a valid mutex created by pthread_mutex_init

Postconditions:

If successful, the function will return 0 and the thread will have destroyed the mutex
If unsuccessful, the function will return an error number and set errno.

Mutex Example - Using Mutexes to Control Output

You may have noticed that the output of the previous examples is a bit... messy. This program adds a mutex that controls access to stdout. Now our eyes rejoice as only one thread at a time may have access to this precious resource.

//
// Controlling output with mutexes.
//
// Based on p11.1.cxx from Interprocess Communications in Linux
// By: John Shapley Gray
// Adapted for CS330 by Alex Clarke
//

#include <iostream>
#include <cstdlib>
#include <cstdio>
#include <pthread.h>
#include <sys/types.h>
#include <unistd.h>

using namespace std;

pthread_mutex_t output_lock;

void * say_it( void * );

int main(int argc, char *argv[])
{
   int num_threads;
   pthread_t *thread_ids;
   void  *p_status;

   //Use unbuffered output on stdout
   setvbuf(stdout, (char *) NULL, _IONBF, 0);

   //Set up an output lock so that threads wait their turn to speak.
   if (pthread_mutex_init(&output_lock, NULL)!=0)
   {
      perror("Could not create mutex for output: ");
      return 1;
   }

   cout << "How many threads? ";
   cin >> num_threads;
   thread_ids = new pthread_t[num_threads];

   cout << "Displaying" << endl;

   // generate threads 
   for (int i = 0; i < num_threads; i++)
   {
      int *arg = new int;
      *arg = i;
      if( pthread_create(&thread_ids[i],NULL,say_it,arg) > 0)
      {
                perror("creating thread:");
              return 2;
      }
   }

   // join threads and print their return values
   for (int i = 0; i < num_threads; i++)
   {
      if (pthread_join(thread_ids[i], &p_status) != 0)
      {
         perror("trouble joining thread: ");
         return 3;
      }

      //Threads may still be building their return, so lock stdout
      if (pthread_mutex_lock(&output_lock) != 0)
      {
          perror("Could not lock output: ");
          return 4;
      }
      cout << "Thread " << i << ": " << (char *)p_status << endl;
      if (pthread_mutex_unlock(&output_lock) != 0)
      {
          perror("Could not unlock output: ");
          return 5;
      }

      delete [] (char *)p_status;
   }

   return 0;
}

// 
void * say_it(void *num)
{
   int t_num = *(int *)num;
   char *msg = new char[255];

   if (pthread_mutex_lock(&output_lock) != 0)
   {
       perror("Could not lock output: ");
       exit(4); //something horrible happened - exit whole program with error
   }
   cout << "Building message for thread" << t_num << endl;
   if (pthread_mutex_unlock(&output_lock) != 0)
   {
       perror("Could not unlock output: ");
       exit(5); //something horrible happened - exit whole program with error
   }

   if (t_num == 6)
   {
      snprintf(msg, 255, "My thread id is %lX, but I am so much more. I. AM. ALIVE.",
              pthread_self());
      pthread_exit(msg);
   }
   snprintf(msg, 255, "My thread id was %lX. Goodbye...", pthread_self());
   return msg;
}

References and Related Materials

Interprocess Communications in Linux--The Nooks and Crannies by John Shapley Gray (Chapter 11: pages 311-321 & 333-342)

This resource is available through the university's Safari e-books portal (requires @uregina.ca login if you are off campus)

man pages: pthread_create, pthread_join, pthread_detach, pthread_exit, pthread_destroy, pthread_mutex_init, pthread_mutex_lock
Linux POSIX thread programming in C++, including how to pass C function pointers from C++ when creating threads: http://www.yolinux.com/TUTORIALS/LinuxTutorialPosixThreads.html
Some info on non-POSIX thread libraries: http://www.intel.com/cd/ids/developer/asmo-na/eng/54214.htm?prn=Y
C++11 thread library, based on POSIX style, but Object-Oriented and truly cross-platform. Cplusplus.com links: threads and simple mutexes.

CS330 Intro to Threads

Highlights of this lab:

Lab Code

Preamble

POSIX Threads Commands

Creating Threads

pthread_create()

pthread_self()

Managing Threads

pthread_exit()

pthread_join()

pthread_detach()

Synchronizing Threads

pthread_mutex_init()

pthread_mutex_lock() pthread_mutex_trylock()

pthread_mutex_unlock()

pthread_mutex_destroy()

References and Related Materials

pthread_mutex_lock()
pthread_mutex_trylock()