Using GNU's GDB Debugger

By Peter Jay Salzman (re-written from WebArchive by Cameron Farvin @assemblyDruid)

Chapter 0: Administrata

Why Write This Tutorial?

This is one of the most comprehensive GDB tutorials on the Internet. It's more than you'd find in most books, which tend to discuss GDB as a lightning-fast afterthought. I wrote this document because I couldn't find a good GDB tutorial. The only comprehensive source of information about GDB is GNU's GDB User's Manual, but learning GDB from it is like learning a foreign language from a dictionary.

I'll be using sample programs, and there will be links to the source code in each section that uses them, along with compilation instructions. I urge you to download the code and follow along with the examples. Following along, doing it yourself as you read, is really the best way to learn.

Acknowledgements and Dedication

This tutorial is sincerely and respectfully dedicated to Richard M. Stallman, the most important and under appreciated hero of the Free Software movement.

I'm in a perpetual state of learning, and thanks goes to the following people who've helped me understand C and GDB:

Will Deutsch: For answering questions about GDB.
Mike Simons: For answering questions about GDB.
Paul Hinton: of Wolfram Research for convincing me to try this crazy thing called "GNU/Linux".
Jeff Newmiller: Who has yet to be stumped by any question I throw at him.
Norm Matloff: Who seems to know everything that I don't know (which is a LOT!)
Mark K. Kim: Who never tires of my questions and has an amazing ability to incorporate out-of-box thinking with formal learning. A true hacker, good friend, and humble guy.

Authorship and Copyright

This entire tutorial is copyright (c) 2004 Peter Jay Salzman, [email protected]. Permission is granted to copy, distribute and/or modify it under the terms of The Open Source License, version 1.1. You can find a copy of this license at opensource.org/licenses/osl.php

The canonical and most updated version of this document can be found at www.dirac.org/linux/gdb.

If you want to create a derivative work or publish this document for commercial purposes, I would appreciate it if you contacted me first. This will give me a chance to give you the most recent version. It'll also stroke my ego. I'd also appreciate either a copy of whatever it is you're doing or a spinach, garlic, mushroom, feta cheese and artichoke heart pizza.

About Exercises

There are exercises at the end of most sections. The exercises are mandatory. These exercises are designed to both cover topics I don't formally cover, and give you experience using your new-found skills.

There are topics I don't cover except for in the exercises. This isn't because I'm lazy. It's because I want you to think. Use your noggin to begin understanding concepts in your own words, not in my words. I want you to develop intuition. The best debugging tool is not GDB. And it certainly isn't printf(). The best debugging tool is your brain.

Thank You's

The following people sent in corrections:

Nick
Jason E. Siefken
Eric T. Stuebe
Jeff Terrell
Lawrence Poorman
Yi Yang
Aaron Mayerson
Doug Yoder

A Plug For The EFF

If you're not a member of the EFF, you must stop everything you're doing and become a member right this moment. 9/11 was a horrible tragedy; I was in New York City at the time and witnessed the chaos with my own two eyes. I love my country, and am a very proud United States citizen, but the steady erosion of our freedoms and civil liberty is another tragic casualty of the post 9/11 era. I'm very worried for my country.

The EFF is the most important defense we have in protecting our on-line and digital rights. If you have any interest in protecting your civil liberties in a digital age that has gone out of balance, please read their very short mission. Please consider becoming a member of the EFF. Honestly, it's only the price of a pizza. Or the cost of two movie tickets plus popcorn.

A Request For Help

This tutorial took (takes?) more time than I care to admit. It's a tremendous job. If you found this tutorial to be at all useful, please consider helping me maintain and actively develop it. There are many ways you can help. Pick one that suits you or your talents (in no particular order):

Become a member of the EFF, or buy their merchandise.
Become a member of the FSF, or buy their merchandise.
If you're handy with HTML or PHP, please send an email offering to help with the website. I don't want fancy pages (I want lynx/links users to be able to use this site), but I'm really just a "hack" at HTML and PHP. If you're handy with design or formatting, please offer some advice on how to make my pages more readable and good looking.
Report spelling errors, technical errors, and broken links.
Email me questions. Tell me if something isn't clear.
Send me a picture postcard from where you live. Better yet, send me a letter with either a picture of yourself or the area where you live. Let me know how you got involved with free software and what you use GDB for. My mailing address is Peter Jay Salzman, 111 Montague Street, Brooklyn NY 11201, USA.
Proofread.
If you live in Italy, email me pictures of pizza from your country. No, this isn't a joke. I really love pizza.

I may not respond to all email. It really depends on how busy my life is at the moment (this isn't my whole life, you see). If I have time, I'll try to reply. But please don't get upset if I can't reply. It means I'm swamped with work and can't afford to reply. Rest assured, even if you don't hear from me, receiving an email from you is still helpful. It reassures me that people are reading and benefiting from my work.

Chapter 1: Introduction

What Is A Debugger?

A debugger (or more accurately, symbolic debugger), is an application that runs your program, just like you can, when you type the name of your program. The difference is, a debugger can step through your source code, line by line, executing each line only when you want it to. You can even step through your program machine instruction by machine instruction (try that with printf())! At any point, you can inspect and even change the value of any variable at run-time. If your program crashes, a symbolic debugger tells you where and why the program crashed so you can deduce what went wrong. You can go through the program and see what source code lines get executed and in what order.

Do you have an infinite loop? No problem! Use a debugger to step through the loop and see why your conditional fails to do what you had expected. Did the program crash on a variable access? No problem! The debugger will tell you all sorts of information about the variable you tried to access and the value you assigned (or perhaps didn't assign) to it. Is there a line in your code which isn't executing? No problem! Use the debugger to see what gets executed, in what order, and why a particular line isn't getting reached! Other than a compiler, the debugger is the most useful tool a programmer can use.

Why Not Use `printf()`?

Most people use the printf() debugging method. This is called adding "trace code" to your program. Simply put, they sprinkle their code with printf() to view the value of variables at certain strategic points and also to examine the order of execution of lines of source code.

There are a few reasons why this may not be the best way of doing things:

Sometimes you need a lot of printf()'s, and it can get tedious putting them in and taking them out. Inserting and deleting superfluous code all the time is really distracting. It draws attention away from what you're doing. It's like trying to implement a linked list while someone is talking to you about last night's Futurama episode.
A symbolic debugger can do an awful lot that printf() can't. You can do just about anything you can think of, including changing the value of variables at run-time, halt the program temporarily, list source code, printo the datatype of a variable or struct that you don't recognize, jump to an arbitrary line of code, and much, much more.
You can use a symbolic debugger on a running process; you don't even have to kill the process! Try that with printf()!
You can use a symbolic debugger on a process that has already crashed and died without having to re-run the program. You'll see the state the program was in at the time of death and can inspect all the variables.
A knowledge of GDB will increase your knowledge of programs, processes, memory and your language of choice.

You'll be able to find and fix your bugs faster using a symbolic debugger like GDB. However, this isn't to say that printf() has no use in debugging. Sometimes it's the best way to go. However, for real code, a debugger can almost always get the job done orders of magnitude faster and easier. And using a debugger is always more elegant, and if you don't care about elegance, you should quit programming on Linux and start using Visual C++.

What Is GDB?

In the previous section I told you what a symbolic debugger is. There are actually MANY symbolic debuggers, and in the next section I'll mention some of them. However, this tutorial is about one particular debugger which I use, called GDB.

GDB is a debugger which is part of the Free Software Foundation's GNU operating system. Its original author is Richard M. Stallman (affectionately known as "RMS", one of the finest heroes of the free software movement), and has a long and impressive list of contributors, including some interesting corporate sponsorship for support under various architectures. It's a wonderful piece of software and outclasses nearly every other debugger I've seen, including commercial ones.

GDB can be used to debug C, C++, Objective-C, Fortran, Java and Assembly programs. There's partial support for Modula-2 and Pascal. It'll run on any architecture you can think of that supports Unix, so learning GDB on your home PC will give you the power to debug code anywhere Unix can run!

Way back when, dbx was the canonical debugger people used on Unix systems. With the advent of GNU being the standard by which all Unix systems are measured, GDB became the canonical debugger of the debugging world. As a result, even commercial debuggers have a tendency to be command compatible (or even idea compatible) with GDB, so learning GDB will enable you to use a whole slew of other debuggers. In short, if you learn GDB, you will be able to debug anything almost anywhere with any debugger in the Unix world.

GDB's homepage is located at www.gnu.org/software/gdb/gdb.htmland as of Nov 2006, is up to version 6.4.

GDB is copyleft software (meaning that not only is GDB free software, but all publicly released derivatives and enhancements people make to GDB must also be free) and is licensed under the GNU GPL

Other Symbolic Debuggers

This section documents other debuggers, both actively developed and long gone. I give a short history when the information is available. Any additions (history, debuggers not listed here, other front ends, screenshots), please let me know.

Debuggers

The first debugger that I know of was called dbx, and like GDB, was command line driven. The text UI of GDB was written to resemble dbx, although the two debuggers are not command line compatible. Other symbolic debuggers were written so that their UI resembled dbx (or GDB) as well. For this reason, you'll find many command line debuggers to be quite similar. If you learn to use GDB, you'll largely be able to navigate through most other debuggers.
ups is another debugger originally developed by Mark Russell but is now updated by Rod Armstrong. It also comes with its own theme song. Ups includes a C interpreter which allows you to add fragments of code simply by editing them into the source window (the source file itself is not modified). Perversely, this lets you add debugging printf() calls without recompiling, relinking or even restarting the target program. ups supports C, C++ and limited FORTRAN debugging on SunOS, Solaris, Linux and FreeBSD.
The Portland Group sells an excellent high-quality GUI debugger named pgdbg. pgdbg specializes in debugging all kinds of parallel code on many different kinds of clusters (distributed memory, SMP servers, etc). While pgdb is a very high-powered debugger, it's also expensive.

Front Ends

xwpe.
The most popular GDB front end is DDD, the Data Display Debugger which uses the Motif widget set. DDD has some nice features: it can give you graphical representations of linked lists, ADT's and trees. In addition, DDD is a front end to the Python, Java and Perl debuggers as well. I personally don't use DDD much, but I still appreciate it. DDD used to be quite buggy. Over the years it has stopped crashing regularly(!) on me, but as of March 2003, still crashes on a blue moon. In addition, the pop-up command tool definitely has "issues" with window managers that have multiple screens, like Enlightenment.
tgdb is a Tcl/Tk front end for GDB first written in 1994 by a company named HighTec EDV-Systeme GmbH, in Germany. It was shareware (asking price was $30). Development and support seems to have ended many years ago. It shouldn't be confused with "trivial gdb" which is also called tgdb.
xdbx is a front end to dbx that was created by Po Cheung of Microelectronics and Computer Technology Corporation (MCC) in March 10, 1989. It uses the old X Athena widget set (libxaw). It has its own license which is open source but not copyleft. Development died a long, long time ago.
xxgdb is a front end to GDB that was created in December 1990 by Pierre Willard. It has its own license which is open source but not copyleft. It's built from the source code for xdbx; basically, xxgdb is xdbx adapted to GDB instead of dbx. xxgdb uses the old X Athena widget set (libxaw). It currently doesn't run on any system that uses unix98 posix TTYs. Development died in 2002. It most likely doesn't work with current versions of GDB.
mxgdb is a Motif based front end for GDB written by Jim Tsillas back in January 3 1992. mxgdb is based on xxgdb: Jim ported xxgdb from the Athena widget set to the Motif widget set (in turn, xxgdb was a GDB port of xdbx). It's licensed under the GNU GPL and was last maintained (I think) by Robert Stockmann. It most likely doesn't work with current GDB versions.

Chapter 2: Memory Layout and the Stack

Where Are We Going To Go?

To effectively learn how to use GDB, you must understand frames, which are also called stack frames because they're the frames that comprise the stack. To learn about the stack, we need to learn about the memory layout of an executing program. The discussion will mainly be theoretical, but to keep things interesting we'll conclude the chapter with an example of the stack and stack frames using GDB.

The material learned in this chapter may seem rather theoretical, but it does serve a few very useful purposes:

Understanding the stack is absolutely necessary for using a symbolic debugger like GDB.
Knowing the memory layout of a process will help us understand what exactly a segmentation fault (or segfault) is, and why they happen (or sometimes, more importantly) don't happen when they should. In brief, segfaults are the most common immediate cause for a program to bomb.
A knowledge of a program's memory space can often allow us to figure out the location of well-hidden bugs without the use of print() statements, a compiler or even GDB! In the next section, which is a guest written piece by one my friends, Mark Kim, we'll see some real Sherlock Holmes style sleuthing. Mark homes in on a well hidden bug in somewhat lengthy code. It only took him about 5 or 10 minutes, and all he did was look at the program and use his knowledge of how a program's memory space works. It's really impressive!

So without futher ado, let's take a look at how programs are laid out in memory.

Virtual Memory

Whenever a process is created, the kernel provides a chunk of physical memory which can be located anywhere at all. However, through the magic of virtual memory (VM), the process believes it has all the memory on the computer. You might have heard "virtual memory" in the context of using hard drive space as memory when RAM runs out. That's called virtual memory too, but is largely unrelated to what we're talking about. The VM we're concerned with consists of the following principles:

Each process is given physical memory called the process's virtual memory space.
A process is unaware of the details of its physical memory (i.e. where it physically resides). All the process knows is how big the chunk is and that its chunk begins at address 0.
Each process is unaware of any other chunks of VM belonging to other processes.
Even if the process did know about other chunks of VM, it's physically prevented from accessing that memory.

Each time a process wants to read or write to memory, its request must be translated from a VM address to a physical memory address. Conversely, when the kernel needs to access the VM of a process, it must translate a physical memory address into a VM address. There are two major issues with this:

Computers constantly access memory, so translations are very common; they must be lighting fast.
How can the OS ensure that a process doesn't trample on another process's VM?

The answer to both questions lies in the fact that the OS doesn't manage VM by itself; it gets help from the CPU. Many CPUs contain a device called an MMU: a memory management unit. The MMU and the OS are jointly responsible for managing VM, translating between virtual and physical addresses, enforcing permissions on which processes are allowed to access which memory locations, and enforcing read/write permissions on sections of a VM space, even for the process that owns that space.

It used to be the case that Linux could only be ported to architectures that had an MMU (so Linux wouldn't run on, say, an x286). However, in 1998, Linux was ported to the 68000 which had no MMU. This paved the way for embedded Linux and Linux on devices such as the Palm Pilot.

Exercises

Read a short Wikipedia blurb on the MMU
Optional: If you want to know more about VM, here's a link. This is much more than you need to know.

Memory Layout

That's how VM works. For the most part, each process's VM space is laid out in a similar and predictable manner.

Memory	Program Stack	Notes
High Addresses	Args & Environment Vars	Command line arguments & Environment Vars
	Stack (grows down)
	Unused Memory
	Heap (grows up)
	Uninitialized Data Segment (bss)	Initialized to 0 by `exec`
	Initalized Data Segment	Read from the program file by `exec`
Low Adresses	Text Segment	Read from the program file by `exec`

Text Segment: The text segment contains the actual code to be executed. It's usually sharable, so multiple instances of a program can share the text segment to lower memory requirements. This segment is usually marked read-only so a program can't modify its own instructions.
Initialized Data Segment: This segment contains global variables which are initialized by the programmer.
Uninitialized Data Segment: Also named "bss" (block started by symbol) which was an operator used by an old assembler. This segment contains uninitialized global variables. All variables in this segment are initialized to 0 or NULL pointers before the program begins to execute.
The stack: The stack is a collection of stack frames which will be described in the next section. When a new frame needs to be added (as a result of a newly called function), the stack grows downward.
The heap: Most dynamic memory, whether requested via C's malloc() and friends or C++'s new is doled out to the program from the heap. The C library also gets dynamic memory for its own personal workspace from the heap as well. As more memory is requested "on the fly", the heap grows upward.

Given an object file or an executable, you can determine the size of each section (realize we're not talking about memory layout; we're talking about a disk file that will eventually be resident in memory). Given , Makefile:

1   // hello_world-1.c
2
3   #include <stdio.h>
4
5   int main(void)
6   {
7      printf("hello world\n");
8
9      return 0;
10  }

compile it and link it separately with:

   $ gcc -W -Wall -c hello_world-1.c
   $ gcc -o hello_world-1  hello_world-1.o

You can use the size command to list out the size of the various sections:

   $ size hello_world-1 hello_world-1.o
   text   data   bss    dec   hex   filename
    916    256     4   1176   498   hello_world-1
     48      0     0     48    30   hello_world-1.o

The data segment is the initialized and uninitialized segments combined. The dec and hex sections are the file size in decimal and hexadecimal format respectively.

You can also get the size of the sections of the object file using "objdump -h" or "objdump -x".

   $ objdump -h hello_world-1.o

   hello_world-1.o:     file format elf32-i386

   Sections:
   Idx Name          Size      VMA       LMA       File off  Algn
     0 .text         00000023  00000000  00000000  00000034  2**2
                     CONTENTS, ALLOC, LOAD, RELOC, READONLY, CODE
     1 .data         00000000  00000000  00000000  00000058  2**2
                     CONTENTS, ALLOC, LOAD, DATA
     2 .bss          00000000  00000000  00000000  00000058  2**2
                     ALLOC
     3 .rodata       0000000d  00000000  00000000  00000058  2**0
                     CONTENTS, ALLOC, LOAD, READONLY, DATA
     4 .note.GNU-stack 00000000  00000000  00000000  00000065  2**0
                     CONTENTS, READONLY
     5 .comment      0000001b  00000000  00000000  00000065  2**0
                     CONTENTS, READONLY

Exercises

The size command didn't list a stack or heap segment for hello_world or hello_world.o. Why do you think that is?
There are no global variables in hello_world-1.c. Give an explanation for why size reports that the data and bss segments have zero length for the object file but non-zero length for the executable.
size and objdump report different sizes for the text segment. Can you guess where the discrepancy comes from? Hint: How big is the discrepancy? See anything of that length in the source code?
Optional: Read this link about object file formats.

Stack Frames And The Stack

You just learned about the memory layout for a process. One section of this memory layout is called the stack, which is a collection of stack frames. Each stack frame represents a function call. As functions are called, the number of stack frames increases, and the stack grows. Conversely, as functions return to their caller, the number of stack frames decreases, and the stack shrinks. In this section, we learn what a stack frame is. A very detailed explanation, but we'll go over what's important for our purposes.

A program is made up of one or more functions which interact by calling each other. Every time a function is called, an area of memory is set aside, called a stack frame, for the new function call. This area of memory holds some crucial information, like:

Storage space for all the automatic variables for the newly called function.
The line number of the calling function to return to when the called function returns.
The arguments, or parameters, of the called function.

Each function call gets its own stack frame. Collectively, all the stack frames make up the call stack. We'll use hello_world-2.c for the next example.

1   #include <stdio.h>
2   void first_function(void);
3   void second_function(int);
4
5   int main(void)
6   {
7      printf("hello world\n");
8      first_function();
9      printf("goodbye goodbye\n");
10
11     return 0;
12  }
13
14
15  void first_function(void)
16  {
17     int imidate = 3;
18     char broiled = 'c';
19     void *where_prohibited = NULL;
20
21     second_function(imidate);
22     imidate = 10;
23  }
24
25
26  void second_function(int a)
27  {
28     int b = a;
29  }

When the program starts, there's one stack frame, belonging to main(). Since main() has no automatic variables, no parameters, and no function to return to, the stack frame is uninteresting. Here's what the stack looks like just before the call to first_function() is made.

Frame for main()
   [ end of frame ]

When the call to first_function() is made, unused stack memory is used to create a frame for first_function(). It holds four things: storage space for an int, a char, and a void *, and the line to return to within main(). Here's what the call stack looks like right before the call to second_function() is made.

Frame for main()
   [ end of frame ]
Frame for first_function()
   Return to main(), line 9
   Storage space for an int
   Storage space for a char
   Storage space for a void *
   [ end of frame ]

When the call to second_function() is made, unused stack memory is used to create a stack frame for second_function(). The frame holds 3 things: storage space for an int and the current address of execution within second_function(). Here's what the stack looks like right before second_function() returns.

Frame for main()
   [ end of frame ]
Frame for first_function():
   Return to main(), line 9
   Storage space for an int
   Storage space for a char
   Storage space for a void *
   [ end of frame ]
Frame for second_function():
   Return to first_function(), line 22
   Storage space for an int
   Storage for the int parameter named a
   [ end of frame ]

When second_function() returns, its frame is used to determine where to return to (line 22 of first_function()), then deallocated and returned to stack. Here's what the call stack looks like after second_function() returns:

Frame for main()
   [ end of frame ]
Frame for first_function():
   Return to main(), line 9
   Storage space for an int
   Storage space for a char
   Storage space for a void *
   [ end of frame ]

When first_function() returns, its frame is used to determine where to return to (line 9 of main()), then deallocated and returned to the stack. Here's what the call stack looks like after first_function() return:

Frame for main()
   [ end of frame ]

And when main() returns, the program ends.

Exercises

Suppose a program makes 5 function calls. How many frames should be on the stack?
We saw that the stack grows linearly downward, and that when a function returns, the last frame on the stack is deallocated and returned to unused memory. Is it possible for a frame somewhere in the middle of the stack to be returned to unused memory? If it did, what would that mean about the running program?
Can a goto() statement cause frames in the middle of the stack to be deallocated? The answer is no, but why?
Can longjmp() cause frames in the middle of the stack to be deallocated?

The Symbol Table

A symbol is a variable or a function. A symbol table is exactly what you think: it's a table of variables and functions within an executable. Normally, symbol tables contain only memory addresses of symbols, since computers don't use (or care) what we name variables and functions.

But in order for GDB to be useful to us, it needs to be able to refer to variable and function names, not their addresses. Humans use names like main() or i. Computers use addresses like 0x804b64d or 0xbffff784. To that end, we can compile code with "debugging information" which tells GDB two things:

How to associate the address of a symbol with its name in the source code.
How to associate the address of a machine code with a line of source code.

A symbol table with this extra debugging information is called an augmented or enhanced symbol table. Because gcc and GDB run on so many different platforms, there are many different formats for debugging information:

stabs: The format used by DBX on most BSD systems.
coff: The format used by SDB on most System V systems before System V Release 4.
xcoff: The format used by DBX on IBM RS/6000 systems.
dwarf: The format used by SDB on most System V Release 4 systems.
dwarf2: The format used by DBX on IRIX 6.
vms: The format used by DEBUG on VMS systems.

In addition to debugging formats, GDB understands enhanced variants of these formats that allow it to make use of GNU extensions. Debugging an executable with a GNU enhanced debugging format with something other than GDB could result in anything from it working correctly to the debugger crashing.

Don't let all these formats scare you: in the next section, I'll show you that GDB automagically picks whatever format is best for you. And for the .1% of you that need a different format, you're already knowledgeable enough to make that decision.

Preparing An Executable For Debugging

If you plan on debugging an executable, a corefile resulting from an executable, or a running process, you must compile the executable with an enhanced symbol table. To generate an enhanced symbol table for an executable, we must compile it with gcc's -g option:

   gcc -g -o filename filename.c

As previously discussed, there are many different debugging formats. The actual meaning of -g is to produce debugging information in the native format for your system.

As an alternative to -g, you can also use gcc's -ggdb option:

   gcc -ggdb -o filename filename.c

which produces debugging information in the most expressive format available, including the GNU enhanced variants previously discussed. I believe this is probably the option you want to use in most cases.

You can also give a numerical argument to -g, -ggdb and all the other debugging format options, with 1 being the least amount of information and 3 being the most. Without a numerical argument, the debug level defaults to 2. By using -g3 you can even access preprocessor macros, which is really nice. I suggest you always use -ggdb3 to produce an enhanced symbol table.

Debugging information compiled into an executable will not be read into memory unless GDB loads the executable. This means that executables with debug information will not run any slower than executables without debug information (a common misconception). While it's true that debugging executables take up more disk space, the executable will not have a larger "memory footprint" unless it's from within GDB. Similarly, executable load time will be nearly the same, again, unless you run the debug executable from within GDB.

One last comment. It's certainly possible to perform compiler optimizations on an executable which has an augmented symbol table, in other words: gcc -g -O9 try1.c. In fact, GDB is one of the few symbolic debuggers which will generally do quite well debugging optimized executables. However, you should generally turn off optimizations when debugging an executable because there are situations that will confuse GDB. Variables may get optimized out of existence, functions may get inlined, and more things may happen that may or may not confuse gdb. To be on the safe side, turn off optimization when you're debugging a program.

Exercises

Run strip --only-keep-debug try1. Look at the file size of try1. Now run strip --strip-debug try1 and look at the file size. Now run strip --strip-all try1 and look at the file size. Can you guess what's happening? If not, your punishment is to read "man strip", which makes for some provocative reading.
You stripped all the unnecessary symbols from try1 in the previous exercise. Re-run the program to make sure it works. Now run strip --remove-section=.text try1 and look at the file length. Now try to run try1. What do you suppose is going on?
Read this link about symbol tables (it's short).
Optional: Read this link about the COFF object file format.

Investigating The Stack With GDB

We'll look at the stack again, this time, using GDB. You may not understand all of this since you don't know about breakpoints yet, but it should be intuitive. Compile and run try1.c:

   1    #include<stdio.h>
   2    static void display(int i, int *ptr);
   3
   4    int main(void) {
   5       int x = 5;
   6       int *xptr = &x;
   7       printf("In main():\n");
   8       printf("   x is %d and is stored at %p.\n", x, &x);
   9       printf("   xptr points to %p which holds %d.\n", xptr, *xptr);
   10      display(x, xptr);
   11      return 0;
   12   }
   13
   14    void display(int z, int *zptr) {
   15    	printf("In display():\n");
   16       printf("   z is %d and is stored at %p.\n", z, &z);
   17       printf("   zptr points to %p which holds %d.\n", zptr, *zptr);
   18   }

Make sure you understand the output before continuing with this tutorial. Here's what I see:

   $ ./try1
   In main():
      x is 5 and is stored at 0xbffff948.
      xptr points to 0xbffff948 which holds 5.
   In display():
      z is 5 and is stored at 0xbffff924.
      zptr points to 0xbffff948 which holds 5.

You debug an executable by invoking GDB with the name of the executable. Start a debugging session with try1. You'll see a rather verbose copyright notice:

   $ gdb try1
   GNU gdb 6.1-debian
   Copyright 2004 Free Software Foundation, Inc.
   GDB is free software, covered by the GNU General Public License, and you are
   welcome to change it and/or distribute copies of it under certain conditions.
   Type "show copying" to see the conditions.
   There is absolutely no warranty for GDB.  Type "show warranty" for details.

   (gdb)

The (gdb) is GDB's prompt. It's now waiting for us to input commands. The program is currently not running; to run it, type run. This runs the program from inside GDB:

   (gdb) run
   Starting program: try1
   In main():
      x is 5 and is stored at 0xbffffb34.
      xptr points to 0xbffffb34 which holds 5.
   In display():
      z is 5 and is stored at 0xbffffb10.
      zptr points to 0xbffffb34 which holds 5.

   Program exited normally.
   (gdb)

Well, the program ran. It was a good start, but frankly, a little lackluster. We could've done the same thing by running the program ourself. But one thing we can't do on our own is to pause the program in the middle of execution and take a look at the stack. We'll do this next.

You get GDB to pause execution by using breakpoints. We'll cover breakpoints later, but for now, all you need to know is that when you tell GDB break 5, the program will pause at line 5. You may ask: does the program execute line 5 (pause between 5 and 6) or does the program not execute line 5 (pause between 4 and 5)? The answer is that line 5 is not executed. Remember these principles:

    break 5 means to pause at line 5.
    This means GDB pauses between lines 4 and 5. Line 4 has executed. Line 5 has not.

Set a breakpoint at line 10 and rerun the program:

   (gdb) break 10
   Breakpoint 1 at 0x8048445: file try1.c, line 10.
   (gdb) run
   Starting program: try1
   In main():
      x is 5 and is stored at 0xbffffb34.
      xptr holds 0xbffffb34 and points to 5.

   Breakpoint 1, main () at try1.c:10
   10         display(x, xptr);

We set a breakpoint at line 10 of file try1.c. GDB told us this line of code corresponds to memory address 0x8048445. We reran the program and got the first 2 lines of output. We're in main(), sitting before line 10. We can look at the stack by using GDB's backtrace command:

   (gdb) backtrace
   #0  main () at try1.c:10
   (gdb)

There's one frame on the stack, numbered 0, and it belongs to main(). If we execute the next line of code, we'll be in display(). From the previous section, you should know exactly what should happen to the stack: another frame will be added to the bottom of the stack. Let's see this in action. You can execute the next line of code using GDB's step command:

   (gdb) step
   display (z=5, zptr=0xbffffb34) at try1.c:15
   15              printf("In display():\n");
   (gdb)

Look at the stack again, and make sure you understand everything you see:

   (gdb) backtrace
   #0  display (z=5, zptr=0xbffffb34) at try1.c:15
   #1  0x08048455 in main () at try1.c:10

Some points to note:

We now have two stack frames, frame 1 belonging to main() and frame 0 belong to display().
Each frame listing gives the arguments to that function. We see that main() took no arguments, but display() did (and we're shown the value of the arguments).
Each frame listing gives the line number that's currently being executed within that frame. Look back at the source code and verify you understand the line numbers shown in the backtrace.
Personally, I find the numbering system for the frame to be confusing. I'd prefer for main() to remain frame 0, and for additional frames to get higher numbers. But this is consistent with the idea that the stack grows "downward". Just remember that the lowest numbered frame is the one belonging to the most recently called function.

Execute the next two lines of code:

   (gdb) step
   In display():
   16         printf("   z is %d and is stored at %p.\n", z, &z);
   (gdb) step
      z is 5 and is stored at 0xbffffb10.
   17         printf("   zptr holds %p and points to %d.\n", zptr, *zptr);

Recall that the frame is where automatic variables for the function are stored. Unless you tell it otherwise, GDB is always in the context of the frame corresponding to the currently executing function. Since execution is currently in display(), GDB is in the context of frame 0. We can ask GDB to tell us which frame its context is in by giving the frame command without arguments:

   (gdb) frame
   #0  display (z=5, zptr=0xbffffb34) at try1.c:17
   17         printf("   zptr holds %p and points to %d.\n", zptr, *zptr);

I didn't tell you what the word "context" means; now I'll explain. Since GDB's context is in frame 0, we have access to all the local variables in frame 0. Conversely, we don't have access to automatic variables in any other frame. Let's investigate this. GDB's print command can be used to give us the value of any variable within the current frame. Since z and zptr are variables in display(), and GDB is currently in the frame for display(), we should be able to print their values:

   (gdb) print z
   $1 = 5
   (gdb) print zptr
   $2 = (int *) 0xbffffb34

But we do not have access to automatic variables stored in other frames. Try to look at the variables in main(), which is frame 1:

   (gdb) print x
   No symbol "x" in current context.
   (gdb) print xptr
   No symbol "xptr" in current context.

Now for magic. We can tell GDB to switch from frame 0 to frame 1 using the frame command with the frame number as an argument. This gives us access to the variables in frame 1. As you can guess, after switching frames, we won't have access to variables stored in frame 0. Follow along:

   (gdb) frame 1                           <--- switch to frame 1
   #1  0x08048455 in main () at try1.c:10
   10         display(x, xptr);
   (gdb) print x
   $5 = 5                                  <--- we have access to variables in frame 1
   (gdb) print xptr
   $6 = (int *) 0xbffffb34                 <--- we have access to variables in frame 1
   (gdb) print z
   No symbol "z" in current context.       <--- we don't have access to variables in frame 0
   (gdb) print zptr
   No symbol "zptr" in current context.    <--- we don't have access to variables in frame 0

By the way, one of the hardest things to get used to with GDB is seeing the program's output:

   x is 5 and is stored at 0xbffffb34.
   xptr holds 0xbffffb34 and points to 5.

intermixed with GDB's output:

   Starting program: try1
   In main():
   ...
      Breakpoint 1, main () at try1.c:10
   10         display(x, xptr);

intermixed with your input to GDB:

   (gdb) run

intermixed with your input to the program (which would've been present had we called some kind of input function). This can get confusing, but the more you use GDB, the more you get used to it. Things get tricky when the program does terminal handling (e.g. ncurses or svga libraries), but there are always ways around it.

Exercises

Continuing from the previous example, switch back to display()'s frame. Verify that you have access to automatic variables in display()'s frame, but not main()'s frame.
Figure out how to quit GDB on your own. Control-d works, but I want you to guess the command that quits GDB.
GDB has a help feature. If you type help foo, GDB will print a description of command foo. Enter GDB (don't give GDB any arguments) and read the help blurb for all GDB commands we've used in this section.
Debug try1 again and set a breakpoint anywhere in display(), then run the program. Figure out how to display the stack along with the values of every local variable for each frame at the same time. Hint: If you did the previous exercise, and read each blurb, this should be easy.

Interlude: Debugging With Your Brian

Please Read Before Continuing

As of SDL 1.2.11, it appears that SDL_SetVideoMode() no longer generates SIGFPE when passed SDL_OPENGL. This means you can use GDB to debug spinning_cube. However, this is still an excellent example of:

How to debug with your brain. Why knowing theory, like the memory layout of a program, can be helpful when debugging.

Debugging With Your Brain

In the last section we looked at how a program is laid out in memory. Knowing this is not only useful for debugging with GDB, but it's also useful for debugging without GDB. In this interlude, guest written by my close friend, Mark Kim, we'll see how.

Compile and run spinning_cube.tar.bz2. A spinning cube is displayed with images of Geordi (white) and Juliette (calico), me on a New York City subway, and where I work.

However, when you press a key, some of the cube's textures mysteriously vanish. My first instinct was to use GDB to find the problem, but I discovered that SDL programs that use OpenGL can't be debugged via GDB. Upon investigation, I found that when you pass the flag SDL_OPENGL to the function SDL_SetVideoMode(), a SIGFPE is generated which terminates the program. If you try to handle the SIGFPE, you'll find that SDL_SetVideoMode() never returns, so GDB is left in a hung state.

I had just spent over 40 hours programming over the last 3 days and was getting punch-drunk. Not having GDB available pushed me over the edge and I sent an exasperated email to Mark for help. I got a reply within 10 minutes.

Before continuing you'll want to:

Run the program to see the bug in action. You need OpenGL and SDL to compile the program. Look at HandleKeyPress() in input.c, which handles keystrokes. Look at Debug(), in yerror.h, which is called from HandleKeyPress(). Spend 10 minutes trying to fix the bug. This will make Mark's email all the more impressive. As you read Mark's email, pay particular attention to steps 6, 7B, and 7C for particular examples of sheer debugging brilliance!

   Hey Peter,
   
   The problem was there was an overlapping memory area between the debugging
   variables and the texture variabes.  In video.[hc], the "texture[2]" array
   should have been declared "texture[NUM_TEXURES]" instead.  Attached is a
   patch file.
   
   The debugging process went like this:
   
      1. Try Debug() -- indeed it makes some textures disappear.
   
      2. Try debug_for_reals() into an empty function -- same happens,
         so that's not the problem.
   
      3. Try removing each line of Debug() macro.  This revealed that
         writing values into the "die_*" variables cause the texture
         to disappear.
   
      4. So instead of calling Debug(), try writing some values into
         the "die_*" variables -- the textures disappear again.
   
      5. Check if any other code is using those variables by changing
         variable names and looking out for compilation errors --
         nothing significant showed up.
   
      6. Perhaps someone is using the same memory space as the "die_*"
         variables unintentionally.  I tried shifting the memory locations of
         the "die_*" variables down by putting an array in front of them,
         like this:
   
          yerror.c:
   
            ...
            #include "yerror.h"
   
          + char buffer[1024];
   
            // Global Debugging/Dying Variables
            const char *die_filename;
            const char *die_function;
            int        die_line;
            bool       debug = true;
   
         which fixed the problem.  So now it's a matter of finding the
         overlapping memory.
   
      7. Tracking down the problem needs some narrowing down of the
         possiblilities, so I made the following assumptions:
   
         A. I know a problem like this occurs most often when an array
            size is declared too short at another place, so there's probably
            an array out there that's declared too short, and the "die_*"
            variables, placed in memory right after that array, is probably
            getting overwritten by some code expecting the array to be
            longer.
   
            It could also be a pointer combined with malloc() but at this
            point I'm just thinking about one problem at a time.
   
         B. The problem must be with either a global or static variable
            since it's overlapping with another global variable
            in the heap space.  So I'm looking for an array declared in
            global or static scope.  That narrows down my search quite a bit.
   
            BTW, the fact that I'm looking for a variable that overlaps with
            a global variable probably discounts malloc() from our potential
            list of problems since malloc(), if the way I view the memory is
            correct, should allocate memory only *after* all global
            variables, and it's unlikely code accidentally writes to
            a memory location before a pointer rather than an after
            (though it's certainly possible to write to memory before
            a pointer.)  But again, this is all an afterthought... I'm just
            thinking about another global array at this point.
   
         C. I know the global array I'm looking for must be somehow linked
            to a texture operation since that's what's being interfered by
            writing to the "die_*" variables.  So I'm looking for a global
            array that does something with textures, probably one that stores
            textures or pointers to textures or index to textures or
            something like that.
   
      8. And that's what I looked for.  texture[2] looked a little suspicious
         so I tried expanding its size and that fixed the problem.  Just to
         make sure, I looked for the code that writes to texture with index
         greater than 1 and found init.c:127 and several places in render.c.
   
   Hope that helps!
   
   -Mark

Chapter 3: Initialization, Listing And Running

Where Are We Now? In the last chapter we learned about an executing process's memory layout which is divided into segments. One important segment is the call stack (or stack), which is a collection of stack frames (or frames). There is one frame for each function call, and the frame holds three important things:

The local variables for the function.
The current address pointer within the function.
The arguments passed to the function.

When a function is called, a new frame is allocated and added to the stack. When the function returns, its frame is returned back to unused stack memory and execution resumes at the address pointed to by the previous function's current address pointer. We can ask GDB to tell us what the stack looks like with the backtrace command. We can also find out which frame GDB's context is in using the frame command. Lastly, we can change GDB's context to the n'th frame using the frame n command.

Executables don't contain references to object (function and variable) names or source code line numbers. It would be painful to debug a program without these things, so to debug a program, we generate an augmented symbol table using gcc's -g option.

Lastly, we briefly learned how to make GDB pause execution using the break command and execute one line of source code using the step command. We'll have much more to say about these commands shortly.

Chapter 4: Breakpoints and Watchpoints

Where Are We Going To Go?

In this chapter, we'll investigate the list command which (surprisingly) lists lines of source code. We'll take an in-depth look at GDB's initialization file .gdbinit. Lastly, we'll look at GDB's run command which executes a program from within GDB.

Basic Listing of Source Code

Download derivative, a program that calculates numerical derivatives, to follow along with the discussion: derivative.tar.bz2. Take a moment to familiarize yourself with the code. Note the use of groovy function pointers.

You can list source code with GDB's list command, abbreviated by l. Run GDB on the executable and use the list command:

   $ gdb driver
   (gdb) list
   12    }
   13
   14
   15
   16    int main(int argc, char *argv[])
   17    {
   18        double x, dx, ans;
   19        double Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta;
   20
   21        if (argc != 1) {

By default, GDB always lists 10 lines of source code. When you first issue list, GDB lists 10 lines of source code centred on main(). Subsequent use of list gives the next 10 lines of source code. Try it:

   (gdb) list
   22        printf("You must supply a value for the derivative location!\n");
   23        return EXIT_FAILURE;
   24    }
   25
   26    x   = atol(argv[1]);
   27    ans = sin(log(x)) / x;
   28
   29    printf("%23s%10s%10s%11s%10s%11s\n", "Forward", "error", "Central",
   30        "error", "Extrap", "error");
   31
   (gdb)

Use list three more times, and you'll see:

        ... output suppressed
   
   45            printf("dx=%e: %.5e %.4f  %.5e %.4f  %.5e %.4f\n",
   46                dx, Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta);
   47        }
   48  
   49        return 0;
   50    }
   (gdb) list
   Line number 51 out of range; driver.c has 50 lines.
   (gdb)

The second time we used list, only 9 lines were printed, since we reached the end of the file. The final list didn't print any lines. That's because list always prints 10 lines of code after the previously listed lines. There were simply no more lines of code to list.

list - works like list, except in reverse. It lists the 10 lines previous to the last listed lines. Since line 50 was the last listed line, list - should print lines 41 through 50:

   (gdb) list -
   41
   42                Extr      = ExtrapolatedDiff(x, dx, &f);
   43                ExtrDelta = fabs(Extr - ans);
   44
   45                printf("dx=%e: %.5e %.4f  %.5e %.4f  %.5e %.4f\n",
   46                    dx, Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta);
   47            }
   48
   49        return 0;
   50    }
   (gdb)

If you give list a line number, GDB lists 10 lines centered on that line number:

   (gdb) list 13
   8
   9      double f(double x)
   10     {
   11          return cos(log(x));
   12     }
   13
   14
   15
   16     int main(int argc, char *argv[])
   17     {
   (gdb)

I'm going to suppress the output to conserve space, however I strongly encourage you to follow along with my examples by performing the operations in GDB yourself. Try to imagine what the output looks like before you actually perform the operation.

Other listing operations you'll find useful:

    starting with some line number	(gdb) list 5,
    ending with some line number	(gdb) list ,28
    between two numbers:	(gdb) list 21,25
    by function name:	(gdb) list f
    functions in the other file:	(gdb) list CentralDiff
    by filename and line number:	(gdb) list derivative.c:12
    filename and function name:	(gdb) list derivative.c:ForwardDiff

list has a "memory" of what file was list used to print source code. We started out by listing lines from driver.c. We then switched to derivative.c by telling GDB to list CentralDiff(). So now, list is in the "context" of derivative.c. Therefore, if we use list by itself again, it'll list lines lines from derivative.c.

   (gdb) list
   11    }
   12
   13
   14
   15     double ExtrapolatedDiff( double x, double dx, double (*f)(double) )
   16     {
   17         double term1 = 8.0 * ( f(x + dx/4.0) - f(x - dx/4.0) );
   18         double term2 = ( f(x + dx/2.0) - f(x - dx/2.0) );
   19
   20         return (term1 - term2) / (3.0*dx);

But what if we wanted to start listing lines from driver.c again? How do we go back to that file? We simply list anything that lives in driver.c, like a function or line number. All these commands will reset list's command context from derivative.c back to driver.c:

   list main
   list f
   list driver.c:main
   list driver.c:f
   list driver.c:20

And so forth. The rules aren't complicated; you'll get the hang of them after debugging a few multi-file programs.

Listing By Memory Address (advanced)

Every function begins at some memory address. You can find this address with the print function (which we'll cover later). For instance, we'll find the address for main():

   (gdb) print *main
   $1 = {int (int, char **)} 0x8048647 <main>
   (gdb)

So main() lives at 0x8048647. We can use list using memory locations as well; the syntax is very C'ish:

   (gdb) list *0x8048647
   0x8048647 is in main (driver.c:17).
   12     }
   13
   14
   15
   16     int main(int argc, char *argv[])
   17     {
   18          double x, dx, ans;
   19          double Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta;
   20
   21          if (argc != 1) {
   (gdb)

It stands to reason that 0x8048690 is also somewhere inside of main(). Let's find out:

   (gdb) list *0x8048690
   0x8048690 is in main (driver.c:26).
   21          if (argc != 1) {
   22               printf("You must supply a value for the derivative location!\n");
   23               return EXIT_FAILURE;
   24          }
   25
   26          x   = atol(argv[1]);
   27          ans = sin(log(x)) / x;
   28
   29          printf("%23s%10s%10s%11s%10s%11s\n", "Forward", "error", "Central",
   30               "error", "Extrap", "error");
   (gdb)

Exercises

Using list and print *, figure out how many machine instructions are used for this line of code:

18          double x, dx, ans;
19          double Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta;

Think about this for a second; you'll learn a bit about compilers and machine instructions.

Setting The List Size

GDB lists code in increments of 10 lines. Maybe that's too much. Or maybe that's too little. You can tell GDB to change the listing size with the set command and listsize variable:

   (gdb) set listsize 5
   (gdb) list main
   15
   16     int main(int argc, char *argv[])
   17     {
   18          double x, dx, ans;
   19          double Forw, ForwDelta, Cent, CentDelta, Extr, ExtrDelta;
   (gdb)

Exercises

There's actually a lot of things you can set. Issue help set from GDB's prompt. I'm not expecting you to read it all---I just want you to marvel at how big the list is!

The `.gdbinit` File

Upon startup, GDB reads and executes an initialization file named .gdbinit. It can contain any command (eg set and break), and more. For example, "set listsize" and "set prompt" can go into .gdbinit. There are two locations where GDB will look for this file (in order):

In your home directory
In the current directory

You can put commands to be executed for all your programming projects in $HOME/.gdbinit and project-specific commands in $PWD/.gdbinit. You can comment your .gdbinit files with bash's "#". And blank lines, of course, are ignored.

Exercises

When you invoke GDB, it prints a copyright notice. Using GDB's man page, figure out how to prevent GDB from printing this notice. Using your shell's alias feature, make an alias for "gdb" that invokes GDB, but surpresses the copyright notice. I use this alias myself.
Figure out how to reset GDB's prompt from (gdb) to something that tickles your fancy. Google would be a great way of figuring this out. GDB's help utility would also be useful (hint: you want to "set" the prompt to something else). Modify .gdbinit so that GDB uses your chosen prompt on startup.
You can even use terminal escape codes to put color in your GDB prompt! If you don't know about terminal color escape codes, you can read about them here. One caveat: You have to use the octal code \033 for the escape character. So for example, bold blue would be \033[01;34m. And then don't forget to turn the blue off, otherwise everything will be blue. I'll let you figure out how to do that yourself! Thanks to Jeff Terrell for pointing this out to me!

A Caveat for Colored GDB Prompts

Thanks Eric Rannaud According to the GDB documentation binutils-gdb/readline/doc/rltech.texi that comes with the GDB source code:

Applications may indicate that the prompt contains characters that take up no physical screen space when displayed by bracketing a sequence of such characters with the special markers @code{RL_PROMPT_START_IGNORE} and @code{RL_PROMPT_END_IGNORE} (declared in @file{readline.h). This may be used to embed terminal-specific escape sequences in prompts.

   PROMPT_START_IGNORE is defined to \001
   PROMPT_END_IGNORE is defined to \002

Without these characters, at least with xterm, the cursor is not always in the right place when using backspace, as gdb includes the terminal escape codes in its computation of the visible length of the prompt, which is wrong. With a simple colored "(gdb) " prompt, gdb thinks its length is 17, instead of 6.

The fix is simple:

set prompt \001\033[1;32m\002(gdb)\001\033[0m\002\040

The \040 is just a space -- better to escape it than to have to copy and paste a trailing blank space.

`gdbinit` on MS Windows

Thanks Ted Alves The environment variable HOME, used by .gdbinit, is not normally defined in Windows. You must set/define it yourself: (you must be logged in as an Administrator to change the system variables) right-click My Computer, left-click Properties, left-click the Advanced tab, left-click the Environment Variables button. Now add New: HOME: "(your chosen path) c:\documents and settings\username and save it by clicking OK. Type "set" after rebooting to verify that it's there. You can also type set HOME=(your chosen path) to set it before rebooting, but this method isn't permanent.

Windows Explorer will not accept (create) a file named .gdbinit but Windows itself has no problem with it. Create a file named something like: gdb.init in your %HOME% directory, then go to command-line and type move gdb.init .gdbinit. This will create the file and Explorer will now work with it. You might want to copy this (empty) file to your intended working directory(ies) before you edit in your commands for the HOME file.

Running A Program In GDB

Let's properly introduce the run command. Download and compile arguments.tar.bz2.

The run command with no arguments runs your program without command line arguments. If you want to give the program arguments, use the run command with whatever arguments you want to pass to the program:

   $ gdb arguments
   (gdb) run 1 2
   Starting program: try2 1 2
   Argument 0: arguments
   Argument 1: 1
   Argument 2: 2
   
   Program exited normally.
   (gdb)

Nothing could be simpler. From now on, whenever you use run again, it'll automatically use the arguments you just used (ie, "1 2"):

   (gdb) run
   Starting program: arguments 1 2
   Argument 0: arguments
   Argument 1: 1
   Argument 2: 2
   
   Program exited normally.
   (gdb)

until you tell it to use different arguments:

   (gdb) run testing one two three
   Starting program: arguments testing one two three
   Argument 0: testing
   Argument 1: one
   Argument 2: two
   Argument 3: three
   
   Program exited normally.
   (gdb)

Suppose you want to run the program without command line arguments? How do you get run to stop automatically passing them? There's a "set args" command. If you give this command without any parameters, run will no longer automatically pass command line arguments to the program:

   (gdb) set args
   (gdb) run
   Starting program: arguments 
   Argument 0: try2
   
   Program exited normally.
   (gdb)

If you do give an argument to set args, those arguments will be passed to the program the next time you use run, just as if you had given those arguments directly to run.

There's one more use for set args. If you intend on passing the same arguments to a program every time you begin a debugging session, you can put it in your .gdbinit file. This will make run pass your arguments to the program without you having to specify them every time you start GDB on a given project.

Restarting A Program In GDB

Sometimes you'll want to re-start a program in GDB from the beginning. One reason why you'd want to do this is if you find that the breakpoint you set is too late in the program execution and you want to set the breakpoint earlier. There are three ways of restarting a program in GDB.

Quit GDB and start over.
Use the kill command to stop the program, and run to restart it.
Use the GDB command run. GDB will tell you the program is already running and ask if you want to re-run the program from the beginning.

The last two options will leave everything intact: breakpoints, watchpoints, commands, convenience variables, etc. However, if you don't mind starting fresh with nothing saved from your previous debugging session, quitting GDB is certainly an option.

You might be wondering why there's a kill command when you can either quit GDB with quit or re-run the program with run. The kill command seems kind of superfluous. There are some reasons why you'd use this command, and you can read about them here. Thanks to Suresh Babu for pointing out that the kill command may also be useful when you are remote debugging or if a process is debugged via the attach command. That said, I've never used kill myself.

Breakpoints and Watchpoints

So far you know how to list source code and run a program from within gdb. But you already knew how to do that without gdb. What else does gdb give us? To do anything really useful with gdb, you need to set breakpoints which temporarily pause your program's execution so you can do useful debugging work like inspecting variables and watching the program's execution in an atomic line-by-line fashion. This right here is the magic of a symbolic debugger.

Breakpoints come in three flavors:

A breakpoint stops your program whenever a particular point in the program is reached. We will discuss breakpoints momentarily.
A watchpoint stops your program whenever the value of a variable or expression changes. We'll discuss watchpoints later on in this chapter.
A catchpoint stops your program whenever a particular event occurs. We won't discuss catchpoints until I get a chance to write about them.

A breakpoint stops your program whenever a particular place in the program is reached. Here are some examples of what a breakpoint does:

Mr. Computer, won't you please stop when...

you reach line 420 of the current source code file?
you enter the function validateInput()?
you reach line 2718 of the file video.c?

All those requests have one thing in common: they ask gdb to stop based on reaching some location within the program. That's what a breakpoint does. There are two things I'd like to mention before we start:

What does "stopping at line 5" mean? When gdb stops at "line 5", this means that gdb is currently waiting "between" lines 4 and 5. Line 5 hasn't executed yet. Keep this in mind! You can execute line 5 with the next command, but line 5 has not happened yet.

Why did gdb stop here? Sometimes you may be surprised at where gdb stops. You may have specified a breakpoint at line 5 of the source code, but gdb could stop at line 7, for instance. This can happen for 2 reasons. First, if you compile a program with optimization set, some lines of source code may be optimized out of existence; they exist in your source code, but not in the executable. Secondly, not every line of source code gets compiled into machine code instruction. Consider the code below:

1   #include <stdio.h>
2      
3   int main( void )
4   {
5        int i;
6        i = 3;
7   
8        return 0;
9   }

Inserting a breakpoint at line X makes your program pause at line Y...

Unoptimized code		Optimized code
Breakpoint at line	Program pauses at line	Breakpoint set at line	Program pauses at line
1--4, main()	4	1--4, main()	4
5, 6	6	5--9	9
7, 8	8
9	9
Each breakpoint, watchpoint, and catchpoint you set is assigned a number starting with 1. You use this number to refer to that breakpoint. To see the list of all breakpoints and watchpoints you've set, type info breakpoints (which can be abbreviated by i b. I show a sample resulting output:

	(gdb) info breakpoints 
	Num Type           Disp Enb Address    What
	1   breakpoint     keep y   0x080483f6 in main at try5.c:4
			  breakpoint already hit 1 time
	2   breakpoint     keep n   0x0804841a in display at try5.c:14
			  breakpoint already hit 1 time
	3   hw watchpoint  keep y   i

According to the output, there are two breakpoints, one at line 4 and the other at line 14 of the source code. They are assigned to numbers 1 and 2 respectively. There is also a watchpoint set: the program will halt whenever the variable i (local to display()) changes value.

In addition to being assigned a number, each breakpoint and watchpoint can be enabled or disabled. A program's execution won't stop at a disabled breakpoint or watchpoint. By default, when you create a new breakpoint or watchpoint, it's enabled. To disable the breakpoint or watchpoint assigned to number n, type:

	disable n

To re-enable this breakpoint or watchpoint, type:

	enable n

If you look at the sample output of info breakpoints above, you'll see that breakpoint 2 has been disabled.

Chapter 5: Stepping and Resuming

Chapter 6: Debugging A Running Process

Chapter 7: Debugging Ncurses Programs

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md

cameronfarvin/GNU-GDB-Salzman

Folders and files

Latest commit

History

Repository files navigation

Using GNU's GDB Debugger

By Peter Jay Salzman (re-written from WebArchive by Cameron Farvin @assemblyDruid)

Chapter 0: Administrata

Why Write This Tutorial?

Acknowledgements and Dedication

Authorship and Copyright

About Exercises

Thank You's

A Plug For The EFF

A Request For Help

Chapter 1: Introduction

What Is A Debugger?

Why Not Use printf()?

What Is GDB?

Other Symbolic Debuggers

Debuggers

Front Ends

Chapter 2: Memory Layout and the Stack

Where Are We Going To Go?

Virtual Memory

Exercises

Memory Layout

Exercises

Stack Frames And The Stack

Exercises

The Symbol Table

Preparing An Executable For Debugging

Exercises

Investigating The Stack With GDB

Exercises

Interlude: Debugging With Your Brian

Please Read Before Continuing

Debugging With Your Brain

Chapter 3: Initialization, Listing And Running

Chapter 4: Breakpoints and Watchpoints

Where Are We Going To Go?

Basic Listing of Source Code

Listing By Memory Address (advanced)

Exercises

Setting The List Size

Exercises

The .gdbinit File

Exercises

A Caveat for Colored GDB Prompts

gdbinit on MS Windows

Running A Program In GDB

Restarting A Program In GDB

Breakpoints and Watchpoints

Chapter 5: Stepping and Resuming

Chapter 6: Debugging A Running Process

Chapter 7: Debugging Ncurses Programs

Chapter 8: Other Stuff

Original Source: http://web.archive.org/web/20180701022737/http://dirac.org/linux/gdb/

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Why Not Use `printf()`?

The `.gdbinit` File

`gdbinit` on MS Windows

Packages