NOTE This guide is heavily adopted from Google's Java Style Guide. Some taste and preference tweaks have been made.
Code is more often read than written. Style guidelines exist to make reading code more pleasant and efficient by standardizing formatting across development teams. The style guide covers the following topics:
- Java Style Guide
- A Note on Provenance
- Introduction
- Source File Basics
- Source File Structure
- Formatting
- Naming
- Javadoc
Guiding principles and religious debates. We recognize that most formatting and style rules described by this document will be subject to widely varying opinions and frenetic debate. We submit that this debate is of mostly religious nature and the significant outcome is to agree on a style. The widest consensus is likely achieved by having based the Palantir style guide heavily on the Google Java Style Guide, which has emerged as the de facto code style for many teams inside and outside Palantir.
One principle underlying the style recommendations is to favor concise and condensed source code to maximize the amount of program logic that is visible per unit of screen estate. For example,
// BAD. Don't do this.
int myMethod(int parameter)
{
if (notValid(parameter))
{
return -1;
}
if (something())
{
return parameter;
}
else
{
return -parameter;
}
}
occupies almost twice the screen real estate of the more concise, equivalent version below:
int myMethod(int parameter) {
if (notValid(parameter)) {
return -1;
}
if (something()) {
return parameter;
} else {
return -parameter;
}
}
A second principle is to avoid formatting variants that can easily give rise to bugs. For example, multi-line if/else statements requiring brackets and fall-through switch statements are discouraged unless the behavior is explicitly documented.
Consistency. Although Palantir-wide consistency is great, it is not as important as consistency within a product, module, or file:
- Value internal consistency higher than alignment with the style guide: First, aim for consistency within a class or file, then within a module, and eventually within a product or even Palantir.
- When refactoring a file or module, use this opportunity to make it consistent with the style guide.
- When writing software from scratch, follow the style guide.
There are rare cases when following the style guide makes code less readable. In such cases, value readability over consistency with the guide.
This style guide derives heavily from the Google Java Style Guide, which itself borrows from Oracle's Java style guide. The major changes we've made to Google's guide:
- Substitute "Palantir" for "Google"
- License or copyright information
- Block indentation: +4 spaces
- Column limit: 120
- Import statements
- Trailing Horizontal whitespace is discouraged
- Add a few minor sections
- Remove section 6 ("Programming Practices"), which have been incorporated into our separate "Java Coding Guidelines" page
This document provides the definition of Palantir's coding standards for source code in the Java programming language. A Java source file is described as being in Palantir Style if and only if it adheres to the rules herein.
Like in other programming style guides, the issues covered span not only aesthetic issues of formatting, but also other types of conventions or coding standards as well. However, this document focuses primarily on the hard-and-fast rules that we follow universally and avoids giving advice that isn't clearly enforceable (whether by human or tool).
In this document, unless otherwise clarified:
-
The term class is used inclusively to mean an "ordinary" class, enum class, interface or annotation type (
@interface
). -
The term comment always refers to implementation comments. We do not use the phrase "documentation comments", instead using the common term "Javadoc." Other "terminology notes" will appear occasionally throughout the document.
Example code in this document is non-normative. That is, while the examples are in Palantir Style, they may not illustrate the only stylish way to represent the code. Optional formatting choices made in examples should not be enforced as rules.
The source file name consists of the case-sensitive name of the
top-level class it contains, plus the .java
extension.
Source files are encoded in UTF-8.
(not in Google Java Style Guide)
The newline character is LF ('Line feed' aka '\n' aka 0x0A). The last line of every file ends with LF with no additional blank lines.
Aside from the line terminator sequence, the ASCII horizontal space character (0x20) is the only whitespace character that appears anywhere in a source file. This implies that:
Note: All other whitespace characters in string and character literals are escaped. Tab characters are not used for indentation.
For any character that has a special escape sequence (\b
, \t
, \n
,
\f
, \r
, \"
, \'
and \\
), that sequence is used rather than
the corresponding octal (e.g. \012
) or Unicode (e.g. \u000a
) escape.
For the remaining non-ASCII characters, either the actual Unicode
character (e.g. ∞
) or the equivalent Unicode escape (e.g. \u221e
) is
used, depending only on which makes the code easier to read and
understand.
Tip: In the Unicode escape case, and occasionally even when actual Unicode characters are used, an explanatory comment can be very helpful.
Examples:
String unitAbbrev = "μs"
;- Best: perfectly clear even without a comment.
String unitAbbrev = "u03bcs"; // "μs"
- Allowed, but there's no reason to do this.
String unitAbbrev = "u03bcs"; // Greek letter mu, "s"
- Allowed, but awkward and prone to mistakes.
String unitAbbrev = "u03bcs";
- Poor: the reader has no idea what this is.
return 'ufeff' + content; // byte order mark
- Good: use escapes for non-printable characters and comment if necessary.
Tip: Never make your code less readable simply out of fear that some programs might not handle non-ASCII characters properly. If that should happen, those programs are broken and they must be fixed.
A source file consists of, in order:
- License or copyright information, if present
- Package statement
- Import statements
- Exactly one top-level class
Exactly one blank line separates each section that is present.
The package name of internal as well as open source projects starts with
com.palantir.<project name>
. The package statement is not
line-wrapped, i.e., the column limit (Column limit: 120) does not
apply to package statements.
Package annotations occur in dedicated package-info.java
files only.
Wildcard imports, static or otherwise, are not used.
Import statements are not line-wrapped. The column limit (Column limit: 120) does not apply to import statements.
(changed from Google Java Style Guide) Import statements are divided into the following groups, in this order, with each group separated by a single blank line:
- All static imports in a single group
- All non-static imports in a single group
Within a group there are no blank lines, and the imported names appear in ASCII sort order.
Unused or redundant imports should be removed.
(not in Google Java Style Guide)
Static imports of project-specific methods limit readability as they obfuscate method provenance. Static imports are thus discouraged with the exception of the members of the following classes:
java.util.Collections
java.util.stream.Collectors
com.palantir.logsafe.Preconditions
com.google.common.base.Preconditions
org.apache.commons.lang3.Validate
Each top-level class resides in a source file of its own.
The ordering of the members of a class can have a great effect on learnability, but there is no single correct recipe for how to do it. Different classes may order their members differently.
What is important is that each class order its members in some logical order, which its maintainer could explain if asked. For example, new methods are not just habitually added to the end of the class, as that would yield "chronological by date added" ordering, which is not a logical ordering.
When a class has multiple constructors, or multiple methods with the same name, these appear sequentially, with no intervening members.
Terminology Note: block-like construct refers to the body of a class, method or constructor. Note that, by Section Array initializers: can be "block-like", any array initializer may optionally be treated as if it were a block-like construct.
Braces are used with if
, else
, for
, do
and while
statements,
even when the body is empty or contains only a single statement.
Braces follow the Kernighan and Ritchie style ("Egyptian brackets") for nonempty blocks and block-like constructs:
- No line break before the opening brace.
- Line break after the opening brace.
- Line break before the closing brace.
- Line break after the closing brace if that brace terminates a statement or the body of a method, constructor or named class. For example, there is no line break after the brace if it is followed by else or a comma. Example:
return new MyClass() {
@Override public void method() {
if (condition()) {
try {
something();
} catch (ProblemException e) {
recover();
}
}
}
};
A few exceptions for enum classes are given in Section Enum classes.
An empty block or block-like construct may be closed immediately after
it is opened, with no characters or line break in between ({}
), unless
it is part of a multi-block statement (one that directly contains
multiple blocks: if/else-if/else
or try/catch/finally
).
Example:
void doNothing() {}
(changed from Google Java Style Guide)
Each time a new block or block-like construct is opened, the indent increases by four spaces. When the block ends, the indent returns to the previous indent level. The indent level applies to both code and comments throughout the block. (See the example in Nonempty blocks: K & R Style.)
Empty blocks (of any type) are rare and allowed only if a justification of their existence is clearly documented, usually by a comment. For example:
try {
int i = Integer.parseInt(response);
return handleNumericResponse(i);
} catch (NumberFormatException expected) {
// It's not numeric; that's fine, just continue.
}
Empty catch
blocks in test code are discouraged and should be replaced
by a verification of the expected error message.
Each statement is followed by a line-break. Empty statements are discouraged.
(changed from Google Java Style Guide)
The column limit is 120 characters. Except as noted below, any line that would exceed this limit must be line-wrapped, as explained in Section Line-wrapping.
There are at least two strong reasons for limiting the line length:
- Very long lines are hard to grok for humans
- Given a typical monitor and resolution, side-by-side diffs (e.g. in Stash or Gerrit) require soft-wrapping or scrolling when lines are larger than about 120 characters.
Exceptions:
- Lines where obeying the column limit is not possible (for example, a long URL in Javadoc, or a long JSNI method reference).
- package and import statements (see Sections Package statement and Import statements)
- Command lines in a comment that may be cut-and-pasted into a shell.
Terminology Note: When code that might otherwise legally occupy a single line is divided into multiple lines, typically to avoid overflowing the column limit, this activity is called line-wrapping.
There is no comprehensive, deterministic formula showing exactly how to line-wrap in every situation. Very often there are several valid ways to line-wrap the same piece of code.
Tip: Extracting a method or local variable may solve the problem without the need to line-wrap.
The prime directive of line-wrapping is: prefer to break at a higher syntactic level. Also:
- When a line is broken at a non-assignment operator the break comes before the symbol. (Note that this is not the same practice used in Palantir Style for other languages, such as C++ and JavaScript.)
- This also applies to the following "operator-like" symbols: the dot
separator (
.
), the ampersand in type bounds (<T extends Foo & Bar>
), and the pipe in catch blocks (catch (FooException | BarException e)
). - When a line is broken at an assignment operator the break typically comes after the symbol, but either way is acceptable.
- This also applies to the "assignment-operator-like" colon in an enhanced for ("foreach") statement.
- A method or constructor name stays attached to the open parenthesis
(
(
) that follows it. - A comma (
,
) stays attached to the token that precedes it.
(changed from Google Java Style Guide)
When line-wrapping, each line after the first (each continuation line) is indented at least +8 from the original line.
When there are multiple continuation lines, indentation may be varied beyond +8 as desired. In general, two continuation lines use the same indentation level if and only if they begin with syntactically parallel elements.
Section Horizontal alignment: never required addresses the discouraged practice of using a variable number of spaces to align certain tokens with previous lines.
(changed from Google Java Style Guide)
A single blank line appears:
- Between consecutive members (or initializers) of a class: fields, constructors, methods, nested classes, static initializers, instance initializers.
- Exception: A blank line between two consecutive fields (having no other code between them) is optional. Such blank lines are used as needed to create logical groupings of fields.
- Within method bodies, as needed to create logical groupings of statements.
- Optionally before the first member or after the last member of the class (neither encouraged nor discouraged).
- As required by other sections of this document (such as Section Import statements).
Two consecutive blank lines are not permitted.
Beyond where required by the language or other style rules, and apart from literals, comments and Javadoc, a single ASCII space also appears in the following places only.
- Separating any reserved word, such as if, for or catch, from an open
parenthesis (
(
) that follows it on that line - Separating any reserved word, such as else or catch, from a closing
curly brace (
}
) that precedes it on that line - Before any open curly brace (
{
), with two exceptions:@SomeAnnotation({a, b})
(no space is used)String[][] x = {{"foo"}};
(no space is required between{{
, by item 8 below)
- On both sides of any binary or ternary operator. This also applies to the following "operator-like" symbols:
- the ampersand in a conjunctive type bound:
<T extends Foo & Bar>
- the pipe for a catch block that handles multiple exceptions:
catch (FooException | BarException e)
- the colon (
:
) in an enhanced for ("foreach") statement
- After
,:;
or the closing parenthesis ()
) of a cast - On both sides of the double slash (
//
) that begins an end-of-line comment. Here, multiple spaces are allowed, but not required. - Between the type and variable of a declaration:
List<String> list
- Optional just inside both braces of an array initializer
new int[] {5, 6}
andnew int[] { 5, 6 }
are both valid
Note: Whitespace at the end of a line is discouraged.
Terminology Note: Horizontal alignment is the practice of adding a variable number of additional spaces in your code with the goal of making certain tokens appear directly below certain other tokens on previous lines.
This practice is permitted, but is never required by Palantir Style. It is not even required to maintain horizontal alignment in places where it was already used.
Here is an example without alignment, then using alignment:
private int x; // this is fine
private Color color; // this too
private int x; // permitted, but future edits
private Color color; // may leave it unaligned
Tip: Alignment can aid readability, but it creates problems for future maintenance. Consider a future change that needs to touch just one line. This change may leave the formerly-pleasing formatting mangled, and that is allowed. More often it prompts the coder (perhaps you) to adjust whitespace on nearby lines as well, possibly triggering a cascading series of reformattings. That one-line change now has a "blast radius." This can at worst result in pointless busywork, but at best it still corrupts version history information, slows down reviewers and exacerbates merge conflicts.
Optional grouping parentheses are omitted only when author and reviewer agree that there is no reasonable chance the code will be misinterpreted without them, nor would they have made the code easier to read. It is not reasonable to assume that every reader has the entire Java operator precedence table memorized.
After each comma that follows an enum constant, a line-break is optional.
An enum class with no methods and no documentation on its constants may optionally be formatted as if it were an array initializer (see Section Array initializers: can be "block-like").
private enum Suit { CLUBS, HEARTS, SPADES, DIAMONDS }
Since enum classes are classes, all other rules for formatting classes apply.
Every variable declaration (field or local) declares only one variable: declarations such as int a, b; are not used.
Local variables are not habitually declared at the start of their containing block or block-like construct. Instead, local variables are declared close to the point they are first used (within reason), to minimize their scope. Local variable declarations typically have initializers, or are initialized immediately after declaration.
Any array initializer may optionally be formatted as if it were a "block-like construct." For example, the following are all valid (not an exhaustive list):
new int[] { new int[] {
0, 1, 2, 3 0,
} 1,
2,
new int[] { 3,
0, 1, }
2, 3
} new int[]
{0, 1, 2, 3}
The square brackets form a part of the type, not the variable:
String[] args
, not String args[]
.
Terminology Note: Inside the braces of a switch block are one or more
statement groups. Each statement group consists of one or more switch
labels (either case FOO:
or default:
), followed by one or more
statements.
(changed from Google guide: +2 to +4)
As with any other block, the contents of a switch block are indented +4.
After a switch label, a newline appears, and the indentation level is increased +4, exactly as if a block were being opened. The following switch label returns to the previous indentation level, as if a block had been closed.
Within a switch block, each statement group either terminates abruptly
(with a break
, continue
, return
or thrown exception), or is marked
with a comment to indicate that execution will or might continue into
the next statement group. Any comment that communicates the idea of
fall-through is sufficient (typically // fall through
). This special
comment is not required in the last statement group of the switch block.
Example:
switch (input) {
case 1:
case 2:
prepareOneOrTwo();
// fall through
case 3:
handleOneTwoOrThree();
break;
default:
handleLargeNumber(input);
}
The default case is always the last case in a switch statement.
The default case is omitted for switch statements on an enumeration to enable error checking tools to alert for unhandled new enumeration values.
(changed from Google Java Style Guide)
Annotations applying to a class, method or constructor appear immediately after the documentation block, and each annotation is listed on a line of its own (that is, one annotation per line). These line breaks do not constitute line-wrapping (Section Line-wrapping), so the indentation level is not increased. Example:
@Override
@Nullable
public String getNameIfPresent() { ... }
Exception: A single parameterless annotation may instead appear together with the first line of the signature, for example:
@Override public int hashCode() { ... }
Annotations applying to a field also appear immediately after the documentation block, but in this case, multiple annotations (possibly parameterized) may be listed on the same line; for example:
@Partial @Mock DataLoader loader;
There are no specific rules for formatting parameter and local variable annotations.
Annotation parameters are unnamed (if possible) and compact:
// BAD. Don't do this.
@SuppressWarnings(value = {"unchecked", "unused"}) // Redundant name.
@SuppressWarnings({"unchecked"}) // Redundant array.
// Good.
@SuppressWarnings({"unchecked", "unused"})
@SuppressWarnings("unchecked")
Block comments are indented at the same level as the surrounding code.
They may be in /* ... */
style or // ...
style. For multi-line
/* ... */
comments, subsequent lines must start with * aligned with
the * on the previous line.
/*
* This is // And so /* Or you can
* okay. // is this. * even do this. */
*/
Comments are not enclosed in boxes drawn with asterisks or other characters.
Tip: When writing multi-line comments, use the /* ... */
style if you
want automatic code formatters to re-wrap the lines when necessary
(paragraph-style). Most formatters don't re-wrap lines
in // ...
style comment blocks.
(not in Google Java Style Guide)
Single-line comments can appear above or next to the respective code fragment:
// Check all path components for validity.
for (Path component : path) {
validate(component); // throws if invalid.
}
(not in Google Java Style Guide)
"TODO" comments carry the name or the author of the comment, not the name of the person who is supposed to resolve it, and are formatted as follows:
// TODO(rfink): This is a todo comment.
compute();
Class and member modifiers, when present, appear in the order recommended by the Java Language Specification:
public protected private abstract static final transient volatile synchronized native strictfp
long
-valued integer literals use an uppercase L
suffix, never
lowercase (to avoid confusion with the digit 1
). For example,
3000000000L
rather than 3000000000l
.
Identifiers use only ASCII letters and digits, and in two cases noted
below, underscores. Thus each valid identifier name is matched by the
regular expression \w+
.
In Palantir Style special prefixes or suffixes, like those seen in the
examples name_
, mName
, s_name
and kName
, are not used.
Package names are all lowercase, with consecutive words simply
concatenated together (no underscores). For example,
com.example.deepspace
, not com.example.deepSpace
or
com.example.deep_space
.
Class names are written in UpperCamelCase
.
Class names are typically nouns or noun phrases. For example,
Character
or ImmutableList
. Interface names may also be nouns or
noun phrases (for example, List
), but may sometimes be adjectives or
adjective phrases instead (for example, Readable
).
There are no specific rules or even well-established conventions for naming annotation types.
Test classes are named starting with the name of the class they are
testing, and ending with Test
. For example, HashTest
or
HashIntegrationTest
.
Method names are written in lowerCamelCase
.
Method names are typically verbs or verb phrases. For example,
sendMessage
or stop
.
Underscores may appear in JUnit test method names to separate logical
components of the name. One typical pattern is
test<MethodUnderTest>_<state>
, for example testPop_emptyStack
. There
is no One Correct Way to name test methods.
Constant names use CONSTANT_CASE
: all uppercase letters, with words
separated by underscores. But what is a constant, exactly?
Every constant is a static final field, but not all static final fields are constants. Before choosing constant case, consider whether the field really feels like a constant. For example, if any of that instance's observable state can change, it is almost certainly not a constant. Merely intending to never mutate the object is generally not enough. Examples:
// Constants
static final int NUMBER = 5;
static final ImmutableList<String> NAMES = ImmutableList.of("Ed", "Ann");
static final Joiner COMMA_JOINER = Joiner.on(','); // because Joiner is immutable
static final SomeMutableType[] EMPTY_ARRAY = {};
enum SomeEnum { ENUM_CONSTANT }
// Not constants
static String nonFinal = "non-final";
final String nonStatic = "non-static";
static final Set<String> mutableCollection = new HashSet<String>();
static final ImmutableSet<SomeMutableType> mutableElements = ImmutableSet.of(mutable);
static final Logger logger = Logger.getLogger(MyClass.getName());
static final String[] nonEmptyArray = {"these", "can", "change"};
These names are typically nouns or noun phrases.
Non-constant field names (static or otherwise) are written in
lowerCamelCase
.
These names are typically nouns or noun phrases. For example,
computedValues
or index
.
Parameter names are written in lowerCamelCase
. One-character parameter
names should be avoided.
Local variable names are written in lowerCamelCase
, and can be
abbreviated more liberally than other types of names.
However, one-character names should be avoided, except for temporary and looping variables.
Even when final and immutable, local variables are not considered to be constants, and should not be styled as constants.
Each type variable is named in one of two styles:
- A single capital letter, optionally followed by a single numeral
(such as
E, T, X, T2
) - A name in the form used for classes (see Section Class names),
followed by the capital letter
T
(examples:RequestT
,FooBarT
).
Sometimes there is more than one reasonable way to convert an English phrase into camel case, such as when acronyms or unusual constructs like "IPv6" or "iOS" are present. To improve predictability, Palantir Style specifies the following (nearly) deterministic scheme.
Beginning with the prose form of the name:
- Convert the phrase to plain ASCII and remove any apostrophes. For example, "Müller's algorithm" might become "Muellers algorithm".
- Divide this result into words, splitting on spaces and any remaining punctuation (typically hyphens).
- Recommended: if any word already has a conventional camel-case appearance in common usage, split this into its constituent parts (e.g., "AdWords" becomes "ad words"). Note that a word such as "iOS" is not really in camel case per se; it defies any convention, so this recommendation does not apply.
- Now lowercase everything (including acronyms), then uppercase only
the first character of:
- each word, to yield upper camel case, or
- each word except the first, to yield lower camel case
- Finally, join all the words into a single identifier.
Note that the casing of the original words is almost entirely disregarded. Examples:
Prose form | Correct | Incorrect |
---|---|---|
"XML HTTP request" | XmlHttpRequest | XMLHTTPRequest |
"new customer ID" | newCustomerId | newCustomerID |
"inner stopwatch" | innerStopwatch | innerStopWatch |
"supports IPv6 on iOS?" | supportsIpv6OnIos | supportsIPv6OnIOS |
"YouTube importer" | YouTubeImporter | Youtubeimporter |
Note: Some words are ambiguously hyphenated in the English language: for example "nonempty" and "non-empty" are both correct, so the method names checkNonempty and checkNonEmpty are likewise both correct.
The basic formatting of Javadoc blocks is as seen in this example:
/**
* Multiple lines of Javadoc text are written here,
* wrapped normally...
*/
public int method(String p1) { ... }
... or in this single-line example:
/** An especially short bit of Javadoc. */
The basic form is always acceptable. The single-line form may be substituted when there are no at-clauses present, and the entirety of the Javadoc block (including comment markers) can fit on a single line.
One blank line—that is, a line containing only the aligned leading
asterisk (*
)—appears before the group of "at-clauses" if present.
Paragraphs are separated by a <p>
tags as follows:
/**
* This is paragraph one.
* <p>
* This is paragraph two.
*/
Any of the standard "at-clauses" that are used appear in the order
@param
, @return
, @throws
, @deprecated
, and these four types
never appear with an empty description. When an at-clause doesn't fit on
a single line, continuation lines are not indented.
The Javadoc for each class and member begins with a brief summary fragment. This fragment is very important: it is the only part of the text that appears in certain contexts such as class and method indexes.
This is a fragment -— a noun phrase or verb phrase, not a complete
sentence. It begins with a 3rd-person-singular verb such as "Returns
...", "Computes ...", "Saves ...", etc. It does not begin with
A {@code Foo} is a...
, or This method returns...
, nor does it form a
complete imperative sentence like Save the record.
. However, the
fragment is capitalized and punctuated as if it were a complete
sentence.
Tip: A common mistake is to write simple Javadoc in the
form /** @return the customer ID */
. This is incorrect,
and should be changed to /** Returns the customer ID. */
.
At the minimum, Javadoc is strongly encouraged for every public class and interface. Javadoc is encouraged for every public or protected member of a class, with a few exceptions noted below.
Other classes and members may have Javadoc as needed. Whenever an implementation comment would be used to define the overall purpose or behavior of a class, method or field, that comment is usually written as Javadoc instead. (It's more uniform, and more tool-friendly.)
Javadoc is discouraged for "simple, obvious" methods like getFoo()
, in
cases where there really and truly is nothing else worthwhile to say but
"Returns the foo".
Note: It is not appropriate to cite this exception to justify
omitting relevant information that a typical reader
might need to know. For example, for a method named
getCanonicalName
, don't omit its documentation (with the
rationale that it would say only
/** Returns the canonical name. */
) if a typical reader may have
no idea what the term "canonical name" means.
Javadoc is not always present on a method that overrides a supertype method.
(not in Google Java Style Guide)
Detailed Javadoc is discouraged for methods whose exact behavior is subject to frequent change; in such cases, the cost of synchronizing method behavior and Javadoc is high and out-of-sync Javadoc is worse than no Javadoc. Javadoc may be present to indicate that method behavior is defined by code or unit tests.