effective_go.html 113 KB
Newer Older
1
<!--{
2 3
	"Title": "Effective Go",
	"Template": true
4
}-->
Rob Pike's avatar
Rob Pike committed
5

Russ Cox's avatar
Russ Cox committed
6 7
<h2 id="introduction">Introduction</h2>

Rob Pike's avatar
Rob Pike committed
8
<p>
Rob Pike's avatar
slices  
Rob Pike committed
9 10 11
Go is a new language.  Although it borrows ideas from
existing languages,
it has unusual properties that make effective Go programs
Russ Cox's avatar
Russ Cox committed
12
different in character from programs written in its relatives.
13
A straightforward translation of a C++ or Java program into Go
Rob Pike's avatar
slices  
Rob Pike committed
14
is unlikely to produce a satisfactory result&mdash;Java programs
15 16 17 18 19 20
are written in Java, not Go.
On the other hand, thinking about the problem from a Go
perspective could produce a successful but quite different
program.
In other words,
to write Go well, it's important to understand its properties
Rob Pike's avatar
Rob Pike committed
21
and idioms.
22 23 24 25
It's also important to know the established conventions for
programming in Go, such as naming, formatting, program
construction, and so on, so that programs you write
will be easy for other Go programmers to understand.
Rob Pike's avatar
Rob Pike committed
26 27
</p>

Russ Cox's avatar
Russ Cox committed
28
<p>
29
This document gives tips for writing clear, idiomatic Go code.
30
It augments the <a href="/ref/spec">language specification</a>,
Rob Pike's avatar
Rob Pike committed
31 32 33
the <a href="http://tour.golang.org/">Tour of Go</a>,
and <a href="/doc/code.html">How to Write Go Code</a>,
all of which you
Rob Pike's avatar
Rob Pike committed
34
should read first.
Russ Cox's avatar
Russ Cox committed
35 36
</p>

Rob Pike's avatar
Rob Pike committed
37
<h3 id="examples">Examples</h3>
Russ Cox's avatar
Russ Cox committed
38 39

<p>
Rob Pike's avatar
Rob Pike committed
40
The <a href="/src/pkg/">Go package sources</a>
Russ Cox's avatar
Russ Cox committed
41 42
are intended to serve not
only as the core library but also as examples of how to
43
use the language.
44 45 46
Moreover, many of the packages contain working, self-contained
executable examples you can run directly from the
<a href="http://golang.org">golang.org</a> web site, such as
47 48
<a href="http://golang.org/pkg/strings/#example_Map">this one</a> (if
necessary, click on the word "Example" to open it up).
49
If you have a question about how to approach a problem or how something
50 51
might be implemented, the documentation, code and examples in the
library can provide answers, ideas and
52
background.
Russ Cox's avatar
Russ Cox committed
53 54 55 56 57 58 59 60
</p>


<h2 id="formatting">Formatting</h2>

<p>
Formatting issues are the most contentious
but the least consequential.
61 62 63 64 65 66
People can adapt to different formatting styles
but it's better if they don't have to, and
less time is devoted to the topic
if everyone adheres to the same style.
The problem is how to approach this Utopia without a long
prescriptive style guide.
Russ Cox's avatar
Russ Cox committed
67 68
</p>

69
<p>
Rob Pike's avatar
Rob Pike committed
70
With Go we take an unusual
71 72
approach and let the machine
take care of most formatting issues.
73
The <code>gofmt</code> program
Shenghou Ma's avatar
Shenghou Ma committed
74
(also available as <code>go fmt</code>, which
75 76
operates at the package level rather than source file level)
reads a Go program
77 78 79 80 81
and emits the source in a standard style of indentation
and vertical alignment, retaining and if necessary
reformatting comments.
If you want to know how to handle some new layout
situation, run <code>gofmt</code>; if the answer doesn't
Rob Pike's avatar
Rob Pike committed
82 83
seem right, rearrange your program (or file a bug about <code>gofmt</code>),
don't work around it.
84
</p>
Russ Cox's avatar
Russ Cox committed
85 86

<p>
87 88 89 90
As an example, there's no need to spend time lining up
the comments on the fields of a structure.
<code>Gofmt</code> will do that for you.  Given the
declaration
Rob Pike's avatar
Rob Pike committed
91 92
</p>

93 94
<pre>
type T struct {
95 96
    name string // name of the object
    value int // its value
97 98
}
</pre>
Rob Pike's avatar
Rob Pike committed
99 100

<p>
Russ Cox's avatar
Russ Cox committed
101
<code>gofmt</code> will line up the columns:
Russ Cox's avatar
Russ Cox committed
102 103
</p>

104 105
<pre>
type T struct {
106 107
    name    string // name of the object
    value   int    // its value
108 109
}
</pre>
Russ Cox's avatar
Russ Cox committed
110 111

<p>
Rob Pike's avatar
Rob Pike committed
112
All Go code in the standard packages has been formatted with <code>gofmt</code>.
Russ Cox's avatar
Russ Cox committed
113 114 115 116
</p>


<p>
117
Some formatting details remain.  Very briefly:
Russ Cox's avatar
Russ Cox committed
118 119
</p>

120
<dl>
121 122 123 124 125 126 127 128 129 130 131
    <dt>Indentation</dt>
    <dd>We use tabs for indentation and <code>gofmt</code> emits them by default.
    Use spaces only if you must.
    </dd>
    <dt>Line length</dt>
    <dd>
    Go has no line length limit.  Don't worry about overflowing a punched card.
    If a line feels too long, wrap it and indent with an extra tab.
    </dd>
    <dt>Parentheses</dt>
    <dd>
132
    Go needs fewer parentheses than C and Java: control structures (<code>if</code>,
133
    <code>for</code>, <code>switch</code>) do not have parentheses in
134 135
    their syntax.
    Also, the operator precedence hierarchy is shorter and clearer, so
136 137 138
<pre>
x&lt;&lt;8 + y&lt;&lt;16
</pre>
139
    means what the spacing implies, unlike in the other languages.
140
    </dd>
141
</dl>
Russ Cox's avatar
Russ Cox committed
142

Rob Pike's avatar
Rob Pike committed
143
<h2 id="commentary">Commentary</h2>
Russ Cox's avatar
Russ Cox committed
144 145

<p>
146
Go provides C-style <code>/* */</code> block comments
Russ Cox's avatar
Russ Cox committed
147
and C++-style <code>//</code> line comments.
Rob Pike's avatar
Rob Pike committed
148
Line comments are the norm;
149 150
block comments appear mostly as package comments, but
are useful within an expression or to disable large swaths of code.
Russ Cox's avatar
Russ Cox committed
151 152
</p>

Rob Pike's avatar
Rob Pike committed
153 154 155 156 157 158 159 160 161
<p>
The program—and web server—<code>godoc</code> processes
Go source files to extract documentation about the contents of the
package.
Comments that appear before top-level declarations, with no intervening newlines,
are extracted along with the declaration to serve as explanatory text for the item.
The nature and style of these comments determines the
quality of the documentation <code>godoc</code> produces.
</p>
Rob Pike's avatar
Rob Pike committed
162 163

<p>
Rob Pike's avatar
Rob Pike committed
164
Every package should have a <i>package comment</i>, a block
Rob Pike's avatar
Rob Pike committed
165
comment preceding the package clause.
Rob Pike's avatar
Rob Pike committed
166 167 168
For multi-file packages, the package comment only needs to be
present in one file, and any one will do.
The package comment should introduce the package and
Rob Pike's avatar
Rob Pike committed
169
provide information relevant to the package as a whole.
Rob Pike's avatar
Rob Pike committed
170 171
It will appear first on the <code>godoc</code> page and
should set up the detailed documentation that follows.
Rob Pike's avatar
Rob Pike committed
172 173 174 175
</p>

<pre>
/*
176
Package regexp implements a simple library for regular expressions.
177

178
The syntax of the regular expressions accepted is:
179 180 181 182 183 184 185 186 187 188 189 190 191 192

    regexp:
        concatenation { '|' concatenation }
    concatenation:
        { closure }
    closure:
        term [ '*' | '+' | '?' ]
    term:
        '^'
        '$'
        '.'
        character
        '[' [ '^' ] character-ranges ']'
        '(' regexp ')'
Rob Pike's avatar
Rob Pike committed
193 194 195 196 197
*/
package regexp
</pre>

<p>
Rob Pike's avatar
Rob Pike committed
198
If the package is simple, the package comment can be brief.
Rob Pike's avatar
Rob Pike committed
199 200 201
</p>

<pre>
202
// Package path implements utility routines for
Rob Pike's avatar
Rob Pike committed
203 204 205
// manipulating slash-separated filename paths.
</pre>

Russ Cox's avatar
Russ Cox committed
206
<p>
Rob Pike's avatar
Rob Pike committed
207 208
Comments do not need extra formatting such as banners of stars.
The generated output may not even be presented in a fixed-width font, so don't depend
209
on spacing for alignment&mdash;<code>godoc</code>, like <code>gofmt</code>,
Rob Pike's avatar
Rob Pike committed
210
takes care of that.
211
The comments are uninterpreted plain text, so HTML and other
Rob Pike's avatar
Rob Pike committed
212 213
annotations such as <code>_this_</code> will reproduce <i>verbatim</i> and should
not be used.
214 215 216 217 218 219 220
One adjustment <code>godoc</code> does do is to display indented
text in a fixed-width font, suitable for program snippets.
The package comment for the
<a href="http://golang.org/pkg/fmt/"><code>fmt</code> package</a> uses this to good effect.
</p>

<p>
221 222 223 224
Depending on the context, <code>godoc</code> might not even
reformat comments, so make sure they look good straight up:
use correct spelling, punctuation, and sentence structure,
fold long lines, and so on.
Russ Cox's avatar
Russ Cox committed
225 226 227
</p>

<p>
Rob Pike's avatar
Rob Pike committed
228 229
Inside a package, any comment immediately preceding a top-level declaration
serves as a <i>doc comment</i> for that declaration.
Russ Cox's avatar
Russ Cox committed
230
Every exported (capitalized) name in a program should
Rob Pike's avatar
Rob Pike committed
231
have a doc comment.
Russ Cox's avatar
Russ Cox committed
232 233 234
</p>

<p>
235
Doc comments work best as complete sentences, which allow
Rob Pike's avatar
Rob Pike committed
236
a wide variety of automated presentations.
Russ Cox's avatar
Russ Cox committed
237
The first sentence should be a one-sentence summary that
238
starts with the name being declared.
Russ Cox's avatar
Russ Cox committed
239 240 241
</p>

<pre>
Rob Pike's avatar
Rob Pike committed
242 243
// Compile parses a regular expression and returns, if successful, a Regexp
// object that can be used to match against text.
244
func Compile(str string) (regexp *Regexp, err error) {
Russ Cox's avatar
Russ Cox committed
245 246
</pre>

247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273
<p>
If the name always begins the comment, the output of <code>godoc</code>
can usefully be run through <code>grep</code>.
Imagine you couldn't remember the name "Compile" but were looking for
the parsing function for regular expressions, so you ran
the command,
</p>

<pre>
$ godoc regexp | grep parse
</pre>

<p>
If all the doc comments in the package began, "This function...", <code>grep</code>
wouldn't help you remember the name. But because the package starts each
doc comment with the name, you'd see something like this,
which recalls the word you're looking for.
</p>

<pre>
$ godoc regexp | grep parse
    Compile parses a regular expression and returns, if successful, a Regexp
    parsed. It simplifies safe initialization of global variables holding
    cannot be parsed. It simplifies safe initialization of global variables
$
</pre>

Rob Pike's avatar
Rob Pike committed
274 275
<p>
Go's declaration syntax allows grouping of declarations.
Rob Pike's avatar
Rob Pike committed
276 277
A single doc comment can introduce a group of related constants or variables.
Since the whole declaration is presented, such a comment can often be perfunctory.
Rob Pike's avatar
Rob Pike committed
278 279 280
</p>

<pre>
Rob Pike's avatar
Rob Pike committed
281 282
// Error codes returned by failures to parse an expression.
var (
283 284 285
    ErrInternal      = errors.New("regexp: internal error")
    ErrUnmatchedLpar = errors.New("regexp: unmatched '('")
    ErrUnmatchedRpar = errors.New("regexp: unmatched ')'")
286
    ...
Rob Pike's avatar
Rob Pike committed
287 288 289 290
)
</pre>

<p>
291
Grouping can also indicate relationships between items,
Russ Cox's avatar
Russ Cox committed
292
such as the fact that a set of variables is protected by a mutex.
Rob Pike's avatar
Rob Pike committed
293 294 295 296
</p>

<pre>
var (
297 298 299
    countLock   sync.Mutex
    inputCount  uint32
    outputCount uint32
300
    errorCount  uint32
Rob Pike's avatar
Rob Pike committed
301 302 303
)
</pre>

Russ Cox's avatar
Russ Cox committed
304 305
<h2 id="names">Names</h2>

Rob Pike's avatar
names  
Rob Pike committed
306 307
<p>
Names are as important in Go as in any other language.
308
They even have semantic effect:
Rob Pike's avatar
names  
Rob Pike committed
309
the visibility of a name outside a package is determined by whether its
Rob Pike's avatar
Rob Pike committed
310
first character is upper case.
Rob Pike's avatar
names  
Rob Pike committed
311 312 313 314 315 316
It's therefore worth spending a little time talking about naming conventions
in Go programs.
</p>


<h3 id="package-names">Package names</h3>
Russ Cox's avatar
Russ Cox committed
317 318

<p>
Rob Pike's avatar
names  
Rob Pike committed
319 320
When a package is imported, the package name becomes an accessor for the
contents.  After
Russ Cox's avatar
Russ Cox committed
321 322
</p>

Rob Pike's avatar
names  
Rob Pike committed
323 324 325
<pre>
import "bytes"
</pre>
Russ Cox's avatar
Russ Cox committed
326 327

<p>
Rob Pike's avatar
names  
Rob Pike committed
328 329 330 331 332 333 334 335 336 337 338 339
the importing package can talk about <code>bytes.Buffer</code>.  It's
helpful if everyone using the package can use the same name to refer to
its contents, which implies that the package name should be good:
short, concise, evocative.  By convention, packages are given
lower case, single-word names; there should be no need for underscores
or mixedCaps.
Err on the side of brevity, since everyone using your
package will be typing that name.
And don't worry about collisions <i>a priori</i>.
The package name is only the default name for imports; it need not be unique
across all source code, and in the rare case of a collision the
importing package can choose a different name to use locally.
Rob Pike's avatar
slices  
Rob Pike committed
340
In any case, confusion is rare because the file name in the import
Ian Lance Taylor's avatar
Ian Lance Taylor committed
341
determines just which package is being used.
Rob Pike's avatar
names  
Rob Pike committed
342 343 344 345 346
</p>

<p>
Another convention is that the package name is the base name of
its source directory;
347 348 349
the package in <code>src/pkg/encoding/base64</code>
is imported as <code>"encoding/base64"</code> but has name <code>base64</code>,
not <code>encoding_base64</code> and not <code>encodingBase64</code>.
Russ Cox's avatar
Russ Cox committed
350 351 352
</p>

<p>
353
The importer of a package will use the name to refer to its contents,
Rob Pike's avatar
Rob Pike committed
354
so exported names in the package can use that fact
Rob Pike's avatar
names  
Rob Pike committed
355
to avoid stutter.
356 357
(Don't use the <code>import .</code> notation, which can simplify
tests that must run outside the package they are testing, but should otherwise be avoided.)
Rob Pike's avatar
names  
Rob Pike committed
358 359 360 361 362 363
For instance, the buffered reader type in the <code>bufio</code> package is called <code>Reader</code>,
not <code>BufReader</code>, because users see it as <code>bufio.Reader</code>,
which is a clear, concise name.
Moreover,
because imported entities are always addressed with their package name, <code>bufio.Reader</code>
does not conflict with <code>io.Reader</code>.
364
Similarly, the function to make new instances of <code>ring.Ring</code>&mdash;which
Russ Cox's avatar
Russ Cox committed
365
is the definition of a <em>constructor</em> in Go&mdash;would
366 367
normally be called <code>NewRing</code>, but since
<code>Ring</code> is the only type exported by the package, and since the
Rob Pike's avatar
Rob Pike committed
368 369
package is called <code>ring</code>, it's called just <code>New</code>,
which clients of the package see as <code>ring.New</code>.
Rob Pike's avatar
names  
Rob Pike committed
370
Use the package structure to help you choose good names.
Russ Cox's avatar
Russ Cox committed
371 372 373
</p>

<p>
Rob Pike's avatar
names  
Rob Pike committed
374 375 376 377
Another short example is <code>once.Do</code>;
<code>once.Do(setup)</code> reads well and would not be improved by
writing <code>once.DoOrWaitUntilDone(setup)</code>.
Long names don't automatically make things more readable.
378
A helpful doc comment can often be more valuable than an extra long name.
Russ Cox's avatar
Russ Cox committed
379 380
</p>

Rob Pike's avatar
Rob Pike committed
381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397
<h3 id="Getters">Getters</h3>

<p>
Go doesn't provide automatic support for getters and setters.
There's nothing wrong with providing getters and setters yourself,
and it's often appropriate to do so, but it's neither idiomatic nor necessary
to put <code>Get</code> into the getter's name.  If you have a field called
<code>owner</code> (lower case, unexported), the getter method should be
called <code>Owner</code> (upper case, exported), not <code>GetOwner</code>.
The use of upper-case names for export provides the hook to discriminate
the field from the method.
A setter function, if needed, will likely be called <code>SetOwner</code>.
Both names read well in practice:
</p>
<pre>
owner := obj.Owner()
if owner != user {
398
    obj.SetOwner(user)
Rob Pike's avatar
Rob Pike committed
399 400 401
}
</pre>

Rob Pike's avatar
names  
Rob Pike committed
402
<h3 id="interface-names">Interface names</h3>
Russ Cox's avatar
Russ Cox committed
403 404

<p>
Rob Pike's avatar
names  
Rob Pike committed
405
By convention, one-method interfaces are named by
406 407 408 409
the method name plus an -er suffix or similar modification
to construct an agent noun: <code>Reader</code>,
<code>Writer</code>, <code>Formatter</code>,
<code>CloseNotifier</code> etc.
Russ Cox's avatar
Russ Cox committed
410 411 412
</p>

<p>
Rob Pike's avatar
names  
Rob Pike committed
413 414 415 416
There are a number of such names and it's productive to honor them and the function
names they capture.
<code>Read</code>, <code>Write</code>, <code>Close</code>, <code>Flush</code>,
<code>String</code> and so on have
Russ Cox's avatar
Russ Cox committed
417 418 419 420 421
canonical signatures and meanings.  To avoid confusion,
don't give your method one of those names unless it
has the same signature and meaning.
Conversely, if your type implements a method with the
same meaning as a method on a well-known type,
Rob Pike's avatar
names  
Rob Pike committed
422 423
give it the same name and signature;
call your string-converter method <code>String</code> not <code>ToString</code>.
Russ Cox's avatar
Russ Cox committed
424 425
</p>

Rob Pike's avatar
names  
Rob Pike committed
426 427
<h3 id="mixed-caps">MixedCaps</h3>

Russ Cox's avatar
Russ Cox committed
428
<p>
Rob Pike's avatar
slices  
Rob Pike committed
429
Finally, the convention in Go is to use <code>MixedCaps</code>
Rob Pike's avatar
names  
Rob Pike committed
430 431
or <code>mixedCaps</code> rather than underscores to write
multiword names.
Russ Cox's avatar
Russ Cox committed
432 433
</p>

Rob Pike's avatar
Rob Pike committed
434
<h2 id="semicolons">Semicolons</h2>
Rob Pike's avatar
names  
Rob Pike committed
435

Rob Pike's avatar
Rob Pike committed
436
<p>
437 438
Like C, Go's formal grammar uses semicolons to terminate statements,
but unlike in C, those semicolons do not appear in the source.
439 440
Instead the lexer uses a simple rule to insert semicolons automatically
as it scans, so the input text is mostly free of them.
Rob Pike's avatar
Rob Pike committed
441
</p>
Russ Cox's avatar
Russ Cox committed
442

443 444 445 446 447 448 449 450 451 452 453 454
<p>
The rule is this. If the last token before a newline is an identifier
(which includes words like <code>int</code> and <code>float64</code>),
a basic literal such as a number or string constant, or one of the
tokens
</p>
<pre>
break continue fallthrough return ++ -- ) }
</pre>
<p>
the lexer always inserts a semicolon after the token.
This could be summarized as, &ldquo;if the newline comes
455
after a token that could end a statement, insert a semicolon&rdquo;.
456 457 458 459 460 461
</p>

<p>
A semicolon can also be omitted immediately before a closing brace,
so a statement such as
</p>
Rob Pike's avatar
Rob Pike committed
462 463 464
<pre>
    go func() { for { dst &lt;- &lt;-src } }()
</pre>
465 466 467 468 469 470 471
<p>
needs no semicolons.
Idiomatic Go programs have semicolons only in places such as
<code>for</code> loop clauses, to separate the initializer, condition, and
continuation elements.  They are also necessary to separate multiple
statements on a line, should you write code that way.
</p>
Russ Cox's avatar
Russ Cox committed
472 473

<p>
474 475
One consequence of the semicolon insertion rules
is that you cannot put the opening brace of a
476 477 478 479
control structure (<code>if</code>, <code>for</code>, <code>switch</code>,
or <code>select</code>) on the next line.  If you do, a semicolon
will be inserted before the brace, which could cause unwanted
effects.  Write them like this
Russ Cox's avatar
Russ Cox committed
480 481
</p>

Rob Pike's avatar
Rob Pike committed
482
<pre>
483
if i &lt; f() {
484
    g()
Rob Pike's avatar
Rob Pike committed
485
}
486
</pre>
Rob Pike's avatar
Rob Pike committed
487
<p>
488
not like this
Rob Pike's avatar
Rob Pike committed
489
</p>
490
<pre>
491
if i &lt; f()  // wrong!
492 493 494 495 496
{           // wrong!
    g()
}
</pre>

Rob Pike's avatar
Rob Pike committed
497 498 499 500

<h2 id="control-structures">Control structures</h2>

<p>
501
The control structures of Go are related to those of C but differ
Rob Pike's avatar
Rob Pike committed
502 503 504 505 506 507 508
in important ways.
There is no <code>do</code> or <code>while</code> loop, only a
slightly generalized
<code>for</code>;
<code>switch</code> is more flexible;
<code>if</code> and <code>switch</code> accept an optional
initialization statement like that of <code>for</code>;
509 510
<code>break</code> and <code>continue</code> statements
take an optional label to identify what to break or continue;
Rob Pike's avatar
Rob Pike committed
511 512
and there are new control structures including a type switch and a
multiway communications multiplexer, <code>select</code>.
Rob Pike's avatar
slices  
Rob Pike committed
513
The syntax is also slightly different:
514
there are no parentheses
Rob Pike's avatar
Rob Pike committed
515 516 517 518 519 520 521 522
and the bodies must always be brace-delimited.
</p>

<h3 id="if">If</h3>

<p>
In Go a simple <code>if</code> looks like this:
</p>
523
<pre>
524
if x &gt; 0 {
Rob Pike's avatar
Rob Pike committed
525 526
    return y
}
Rob Pike's avatar
Rob Pike committed
527
</pre>
Russ Cox's avatar
Russ Cox committed
528

Rob Pike's avatar
Rob Pike committed
529 530 531 532 533 534 535 536 537
<p>
Mandatory braces encourage writing simple <code>if</code> statements
on multiple lines.  It's good style to do so anyway,
especially when the body contains a control statement such as a
<code>return</code> or <code>break</code>.
</p>

<p>
Since <code>if</code> and <code>switch</code> accept an initialization
538
statement, it's common to see one used to set up a local variable.
Rob Pike's avatar
Rob Pike committed
539
</p>
Russ Cox's avatar
Russ Cox committed
540 541

<pre>
Rob Pike's avatar
Rob Pike committed
542
if err := file.Chmod(0664); err != nil {
543
    log.Print(err)
544
    return err
Rob Pike's avatar
Rob Pike committed
545
}
Russ Cox's avatar
Russ Cox committed
546 547
</pre>

Rob Pike's avatar
Rob Pike committed
548 549 550 551 552 553 554
<p id="else">
In the Go libraries, you'll find that
when an <code>if</code> statement doesn't flow into the next statement—that is,
the body ends in <code>break</code>, <code>continue</code>,
<code>goto</code>, or <code>return</code>—the unnecessary
<code>else</code> is omitted.
</p>
Russ Cox's avatar
Russ Cox committed
555

Rob Pike's avatar
Rob Pike committed
556
<pre>
557
f, err := os.Open(name)
Rob Pike's avatar
Rob Pike committed
558
if err != nil {
559
    return err
Rob Pike's avatar
Rob Pike committed
560
}
561
codeUsing(f)
Rob Pike's avatar
Rob Pike committed
562
</pre>
Russ Cox's avatar
Russ Cox committed
563 564

<p>
Rob Pike's avatar
Rob Pike committed
565 566
This is an example of a common situation where code must guard against a
sequence of error conditions.  The code reads well if the
Rob Pike's avatar
Rob Pike committed
567 568
successful flow of control runs down the page, eliminating error cases
as they arise.  Since error cases tend to end in <code>return</code>
569
statements, the resulting code needs no <code>else</code> statements.
Russ Cox's avatar
Russ Cox committed
570 571 572
</p>

<pre>
573
f, err := os.Open(name)
Russ Cox's avatar
Russ Cox committed
574
if err != nil {
575
    return err
Russ Cox's avatar
Russ Cox committed
576
}
577
d, err := f.Stat()
Rob Pike's avatar
Rob Pike committed
578
if err != nil {
Rob Pike's avatar
Rob Pike committed
579
    f.Close()
580
    return err
Rob Pike's avatar
Rob Pike committed
581
}
582
codeUsing(f, d)
Russ Cox's avatar
Russ Cox committed
583 584
</pre>

Rob Pike's avatar
Rob Pike committed
585

586
<h3 id="redeclaration">Redeclaration and reassignment</h3>
Rob Pike's avatar
Rob Pike committed
587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622

<p>
An aside: The last example in the previous section demonstrates a detail of how the
<code>:=</code> short declaration form works.
The declaration that calls <code>os.Open</code> reads,
</p>

<pre>
f, err := os.Open(name)
</pre>

<p>
This statement declares two variables, <code>f</code> and <code>err</code>.
A few lines later, the call to <code>f.Stat</code> reads,
</p>

<pre>
d, err := f.Stat()
</pre>

<p>
which looks as if it declares <code>d</code> and <code>err</code>.
Notice, though, that <code>err</code> appears in both statements.
This duplication is legal: <code>err</code> is declared by the first statement,
but only <em>re-assigned</em> in the second.
This means that the call to <code>f.Stat</code> uses the existing
<code>err</code> variable declared above, and just gives it a new value.
</p>

<p>
In a <code>:=</code> declaration a variable <code>v</code> may appear even
if it has already been declared, provided:
</p>

<ul>
<li>this declaration is in the same scope as the existing declaration of <code>v</code>
623
(if <code>v</code> is already declared in an outer scope, the declaration will create a new variable §),</li>
Rob Pike's avatar
Rob Pike committed
624 625 626 627 628 629 630 631 632 633 634
<li>the corresponding value in the initialization is assignable to <code>v</code>, and</li>
<li>there is at least one other variable in the declaration that is being declared anew.</li>
</ul>

<p>
This unusual property is pure pragmatism,
making it easy to use a single <code>err</code> value, for example,
in a long <code>if-else</code> chain.
You'll see it used often.
</p>

635 636 637 638 639 640
<p>
§ It's worth noting here that in Go the scope of function parameters and return values
is the same as the function body, even though they appear lexically outside the braces
that enclose the body.
</p>

Rob Pike's avatar
Rob Pike committed
641 642 643
<h3 id="for">For</h3>

<p>
644
The Go <code>for</code> loop is similar to&mdash;but not the same as&mdash;C's.
Rob Pike's avatar
Rob Pike committed
645 646
It unifies <code>for</code>
and <code>while</code> and there is no <code>do-while</code>.
647
There are three forms, only one of which has semicolons.
Rob Pike's avatar
Rob Pike committed
648 649
</p>
<pre>
Rob Pike's avatar
Rob Pike committed
650
// Like a C for
Rob Pike's avatar
Rob Pike committed
651 652
for init; condition; post { }

Rob Pike's avatar
Rob Pike committed
653
// Like a C while
Rob Pike's avatar
Rob Pike committed
654 655 656 657 658 659 660
for condition { }

// Like a C for(;;)
for { }
</pre>

<p>
661
Short declarations make it easy to declare the index variable right in the loop.
Rob Pike's avatar
Rob Pike committed
662 663
</p>
<pre>
664
sum := 0
665
for i := 0; i &lt; 10; i++ {
Rob Pike's avatar
Rob Pike committed
666 667 668 669 670
    sum += i
}
</pre>

<p>
Rob Pike's avatar
Rob Pike committed
671 672
If you're looping over an array, slice, string, or map,
or reading from a channel, a <code>range</code> clause can
673
manage the loop.
Rob Pike's avatar
Rob Pike committed
674 675
</p>
<pre>
Rob Pike's avatar
Rob Pike committed
676 677 678 679 680 681 682 683 684 685
for key, value := range oldMap {
    newMap[key] = value
}
</pre>

<p>
If you only need the first item in the range (the key or index), drop the second:
</p>
<pre>
for key := range m {
686
    if key.expired() {
Rob Pike's avatar
Rob Pike committed
687 688 689 690 691 692 693 694 695
        delete(m, key)
    }
}
</pre>

<p>
If you only need the second item in the range (the value), use the <em>blank identifier</em>, an underscore, to discard the first:
</p>
<pre>
696
sum := 0
Rob Pike's avatar
Rob Pike committed
697
for _, value := range array {
Rob Pike's avatar
Rob Pike committed
698 699 700 701
    sum += value
}
</pre>

702 703
<p>
The blank identifier has many uses, as described in <a href="#blank">a later section</a>.
704
</p>
705

Rob Pike's avatar
Rob Pike committed
706
<p>
Rob Pike's avatar
Rob Pike committed
707
For strings, the <code>range</code> does more work for you, breaking out individual
708
Unicode code points by parsing the UTF-8.
Rob Pike's avatar
Rob Pike committed
709
Erroneous encodings consume one byte and produce the
710 711 712
replacement rune U+FFFD.
(The name (with associated builtin type) <code>rune</code> is Go terminology for a
single Unicode code point.
713
See <a href="http://golang.org/ref/spec#Rune_literals">the language specification</a>
714 715
for details.)
The loop
Rob Pike's avatar
Rob Pike committed
716 717
</p>
<pre>
718 719
for pos, char := range "日本\x80語" { // \x80 is an illegal UTF-8 encoding
    fmt.Printf("character %#U starts at byte position %d\n", char, pos)
Rob Pike's avatar
Rob Pike committed
720 721 722 723 724 725
}
</pre>
<p>
prints
</p>
<pre>
726 727 728 729
character U+65E5 '日' starts at byte position 0
character U+672C '本' starts at byte position 3
character U+FFFD '�' starts at byte position 6
character U+8A9E '語' starts at byte position 7
Rob Pike's avatar
Rob Pike committed
730 731
</pre>

Rob Pike's avatar
Rob Pike committed
732
<p>
Rob Pike's avatar
Rob Pike committed
733 734 735
Finally, Go has no comma operator and <code>++</code> and <code>--</code>
are statements not expressions.
Thus if you want to run multiple variables in a <code>for</code>
736
you should use parallel assignment (although that precludes <code>++</code> and <code>--</code>).
Rob Pike's avatar
Rob Pike committed
737 738 739
</p>
<pre>
// Reverse a
740
for i, j := 0, len(a)-1; i &lt; j; i, j = i+1, j-1 {
741
    a[i], a[j] = a[j], a[i]
Rob Pike's avatar
Rob Pike committed
742 743 744
}
</pre>

Russ Cox's avatar
Russ Cox committed
745 746 747
<h3 id="switch">Switch</h3>

<p>
Rob Pike's avatar
Rob Pike committed
748
Go's <code>switch</code> is more general than C's.
Rob Pike's avatar
Rob Pike committed
749 750 751 752
The expressions need not be constants or even integers,
the cases are evaluated top to bottom until a match is found,
and if the <code>switch</code> has no expression it switches on
<code>true</code>.
753
It's therefore possible&mdash;and idiomatic&mdash;to write an
Rob Pike's avatar
Rob Pike committed
754
<code>if</code>-<code>else</code>-<code>if</code>-<code>else</code>
755
chain as a <code>switch</code>.
Russ Cox's avatar
Russ Cox committed
756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771
</p>

<pre>
func unhex(c byte) byte {
    switch {
    case '0' &lt;= c &amp;&amp; c &lt;= '9':
        return c - '0'
    case 'a' &lt;= c &amp;&amp; c &lt;= 'f':
        return c - 'a' + 10
    case 'A' &lt;= c &amp;&amp; c &lt;= 'F':
        return c - 'A' + 10
    }
    return 0
}
</pre>

Rob Pike's avatar
Rob Pike committed
772 773
<p>
There is no automatic fall through, but cases can be presented
774
in comma-separated lists.
775
</p>
Russ Cox's avatar
Russ Cox committed
776 777 778
<pre>
func shouldEscape(c byte) bool {
    switch c {
Rob Pike's avatar
Rob Pike committed
779
    case ' ', '?', '&amp;', '=', '#', '+', '%':
Russ Cox's avatar
Russ Cox committed
780 781 782 783 784 785
        return true
    }
    return false
}
</pre>

Rob Pike's avatar
Rob Pike committed
786
<p>
787 788 789 790 791 792 793 794 795 796 797 798
Although they are not nearly as common in Go as some other C-like
languages, <code>break</code> statements can be used to terminate
a <code>switch</code> early.
Sometimes, though, it's necessary to break out of a surrounding loop,
not the switch, and in Go that can be accomplished by putting a label
on the loop and "breaking" to that label.
This example shows both uses.
</p>

<pre>
Loop:
	for n := 0; n &lt; len(src); n += size {
799
		switch {
800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827
		case src[n] &lt; sizeOne:
			if validateOnly {
				break
			}
			size = 1
			update(src[n])

		case src[n] &lt; sizeTwo:
			if n+1 &gt;= len(src) {
				err = errShortInput
				break Loop
			}
			if validateOnly {
				break
			}
			size = 2
			update(src[n] + src[n+1]&lt;&lt;shift)
		}
	}
</pre>

<p>
Of course, the <code>continue</code> statement also accepts an optional label
but it applies only to loops.
</p>

<p>
To close this section, here's a comparison routine for byte slices that uses two
Rob Pike's avatar
Rob Pike committed
828
<code>switch</code> statements:
829
</p>
Rob Pike's avatar
Rob Pike committed
830
<pre>
831
// Compare returns an integer comparing the two byte slices,
Rob Pike's avatar
Rob Pike committed
832
// lexicographically.
Rob Pike's avatar
Rob Pike committed
833
// The result will be 0 if a == b, -1 if a &lt; b, and +1 if a &gt; b
Rob Pike's avatar
Rob Pike committed
834 835 836 837 838 839 840 841 842 843 844 845
func Compare(a, b []byte) int {
    for i := 0; i &lt; len(a) &amp;&amp; i &lt; len(b); i++ {
        switch {
        case a[i] &gt; b[i]:
            return 1
        case a[i] &lt; b[i]:
            return -1
        }
    }
    switch {
    case len(a) &gt; len(b):
        return 1
846 847
    case len(a) &lt; len(b):
        return -1
Rob Pike's avatar
Rob Pike committed
848 849 850 851 852
    }
    return 0
}
</pre>

853
<h3 id="type_switch">Type switch</h3>
854

Rob Pike's avatar
Rob Pike committed
855 856 857 858 859 860
<p>
A switch can also be used to discover the dynamic type of an interface
variable.  Such a <em>type switch</em> uses the syntax of a type
assertion with the keyword <code>type</code> inside the parentheses.
If the switch declares a variable in the expression, the variable will
have the corresponding type in each clause.
861 862
It's also idiomatic to reuse the name in such cases, in effect declaring
a new variable with the same name but a different type in each case.
Rob Pike's avatar
Rob Pike committed
863 864
</p>
<pre>
865 866 867
var t interface{}
t = functionOfSomeType()
switch t := t.(type) {
Rob Pike's avatar
Rob Pike committed
868
default:
869
    fmt.Printf("unexpected type %T", t)       // %T prints whatever type t has
Rob Pike's avatar
Rob Pike committed
870
case bool:
871
    fmt.Printf("boolean %t\n", t)             // t has type bool
Rob Pike's avatar
Rob Pike committed
872
case int:
873
    fmt.Printf("integer %d\n", t)             // t has type int
Rob Pike's avatar
Rob Pike committed
874
case *bool:
875
    fmt.Printf("pointer to boolean %t\n", *t) // t has type *bool
Rob Pike's avatar
Rob Pike committed
876
case *int:
877
    fmt.Printf("pointer to integer %d\n", *t) // t has type *int
Rob Pike's avatar
Rob Pike committed
878 879 880
}
</pre>

Russ Cox's avatar
Russ Cox committed
881 882
<h2 id="functions">Functions</h2>

Rob Pike's avatar
Rob Pike committed
883 884 885
<h3 id="multiple-returns">Multiple return values</h3>

<p>
Ian Lance Taylor's avatar
Ian Lance Taylor committed
886
One of Go's unusual features is that functions and methods
Rob Pike's avatar
Rob Pike committed
887
can return multiple values.  This form can be used to
Rob Pike's avatar
slices  
Rob Pike committed
888
improve on a couple of clumsy idioms in C programs: in-band
889 890
error returns such as <code>-1</code> for <code>EOF</code>
and modifying an argument passed by address.
Rob Pike's avatar
Rob Pike committed
891 892 893
</p>

<p>
Russ Cox's avatar
Russ Cox committed
894
In C, a write error is signaled by a negative count with the
Rob Pike's avatar
Rob Pike committed
895 896
error code secreted away in a volatile location.
In Go, <code>Write</code>
Russ Cox's avatar
Russ Cox committed
897 898
can return a count <i>and</i> an error: &ldquo;Yes, you wrote some
bytes but not all of them because you filled the device&rdquo;.
899 900
The signature of the <code>Write</code> method on files from
package <code>os</code> is:
Rob Pike's avatar
Rob Pike committed
901 902 903
</p>

<pre>
904
func (file *File) Write(b []byte) (n int, err error)
Rob Pike's avatar
Rob Pike committed
905 906 907 908
</pre>

<p>
and as the documentation says, it returns the number of bytes
909
written and a non-nil <code>error</code> when <code>n</code>
Rob Pike's avatar
Rob Pike committed
910 911 912 913 914 915
<code>!=</code> <code>len(b)</code>.
This is a common style; see the section on error handling for more examples.
</p>

<p>
A similar approach obviates the need to pass a pointer to a return
Rob Pike's avatar
slices  
Rob Pike committed
916 917
value to simulate a reference parameter.
Here's a simple-minded function to
918
grab a number from a position in a byte slice, returning the number
Rob Pike's avatar
Rob Pike committed
919 920 921 922 923
and the next position.
</p>

<pre>
func nextInt(b []byte, i int) (int, int) {
924
    for ; i &lt; len(b) &amp;&amp; !isDigit(b[i]); i++ {
925 926
    }
    x := 0
927
    for ; i &lt; len(b) &amp;&amp; isDigit(b[i]); i++ {
928
        x = x*10 + int(b[i]) - '0'
929 930
    }
    return x, i
Rob Pike's avatar
Rob Pike committed
931 932 933 934
}
</pre>

<p>
935
You could use it to scan the numbers in an input slice <code>b</code> like this:
Rob Pike's avatar
Rob Pike committed
936 937 938
</p>

<pre>
939 940
    for i := 0; i &lt; len(b); {
        x, i = nextInt(b, i)
941 942
        fmt.Println(x)
    }
Rob Pike's avatar
Rob Pike committed
943 944 945
</pre>

<h3 id="named-results">Named result parameters</h3>
Russ Cox's avatar
Russ Cox committed
946 947

<p>
Rob Pike's avatar
Rob Pike committed
948 949
The return or result "parameters" of a Go function can be given names and
used as regular variables, just like the incoming parameters.
Rob Pike's avatar
slices  
Rob Pike committed
950
When named, they are initialized to the zero values for their types when
Rob Pike's avatar
Rob Pike committed
951
the function begins; if the function executes a <code>return</code> statement
952
with no arguments, the current values of the result parameters are
Rob Pike's avatar
Rob Pike committed
953
used as the returned values.
Russ Cox's avatar
Russ Cox committed
954 955
</p>

Rob Pike's avatar
Rob Pike committed
956 957 958 959 960 961 962 963 964 965 966
<p>
The names are not mandatory but they can make code shorter and clearer:
they're documentation.
If we name the results of <code>nextInt</code> it becomes
obvious which returned <code>int</code>
is which.
</p>

<pre>
func nextInt(b []byte, pos int) (value, nextPos int) {
</pre>
Russ Cox's avatar
Russ Cox committed
967 968

<p>
Rob Pike's avatar
Rob Pike committed
969 970 971
Because named results are initialized and tied to an unadorned return, they can simplify
as well as clarify.  Here's a version
of <code>io.ReadFull</code> that uses them well:
Russ Cox's avatar
Russ Cox committed
972 973
</p>

Rob Pike's avatar
Rob Pike committed
974
<pre>
975
func ReadFull(r Reader, buf []byte) (n int, err error) {
976
    for len(buf) &gt; 0 &amp;&amp; err == nil {
977 978 979
        var nr int
        nr, err = r.Read(buf)
        n += nr
980
        buf = buf[nr:]
981 982
    }
    return
Rob Pike's avatar
Rob Pike committed
983 984 985
}
</pre>

986 987 988 989 990 991 992 993 994 995 996 997 998
<h3 id="defer">Defer</h3>

<p>
Go's <code>defer</code> statement schedules a function call (the
<i>deferred</i> function) to be run immediately before the function
executing the <code>defer</code> returns.  It's an unusual but
effective way to deal with situations such as resources that must be
released regardless of which path a function takes to return.  The
canonical examples are unlocking a mutex or closing a file.
</p>

<pre>
// Contents returns the file's contents as a string.
999
func Contents(filename string) (string, error) {
1000
    f, err := os.Open(filename)
1001 1002 1003 1004 1005 1006 1007 1008 1009
    if err != nil {
        return "", err
    }
    defer f.Close()  // f.Close will run when we're finished.

    var result []byte
    buf := make([]byte, 100)
    for {
        n, err := f.Read(buf[0:])
1010
        result = append(result, buf[0:n]...) // append is discussed later.
1011
        if err != nil {
1012
            if err == io.EOF {
1013 1014 1015 1016 1017 1018 1019 1020 1021 1022
                break
            }
            return "", err  // f will be closed if we return here.
        }
    }
    return string(result), nil // f will be closed if we return here.
}
</pre>

<p>
Rob Pike's avatar
Rob Pike committed
1023
Deferring a call to a function such as <code>Close</code> has two advantages.  First, it
1024 1025 1026 1027 1028 1029 1030
guarantees that you will never forget to close the file, a mistake
that's easy to make if you later edit the function to add a new return
path.  Second, it means that the close sits near the open,
which is much clearer than placing it at the end of the function.
</p>

<p>
1031
The arguments to the deferred function (which include the receiver if
1032 1033 1034 1035 1036 1037 1038 1039
the function is a method) are evaluated when the <i>defer</i>
executes, not when the <i>call</i> executes.  Besides avoiding worries
about variables changing values as the function executes, this means
that a single deferred call site can defer multiple function
executions.  Here's a silly example.
</p>

<pre>
1040
for i := 0; i &lt; 5; i++ {
1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114
    defer fmt.Printf("%d ", i)
}
</pre>

<p>
Deferred functions are executed in LIFO order, so this code will cause
<code>4 3 2 1 0</code> to be printed when the function returns.  A
more plausible example is a simple way to trace function execution
through the program.  We could write a couple of simple tracing
routines like this:
</p>

<pre>
func trace(s string)   { fmt.Println("entering:", s) }
func untrace(s string) { fmt.Println("leaving:", s) }

// Use them like this:
func a() {
    trace("a")
    defer untrace("a")
    // do something....
}
</pre>

<p>
We can do better by exploiting the fact that arguments to deferred
functions are evaluated when the <code>defer</code> executes.  The
tracing routine can set up the argument to the untracing routine.
This example:
</p>

<pre>
func trace(s string) string {
    fmt.Println("entering:", s)
    return s
}

func un(s string) {
    fmt.Println("leaving:", s)
}

func a() {
    defer un(trace("a"))
    fmt.Println("in a")
}

func b() {
    defer un(trace("b"))
    fmt.Println("in b")
    a()
}

func main() {
    b()
}
</pre>

<p>
prints
</p>

<pre>
entering: b
in b
entering: a
in a
leaving: a
leaving: b
</pre>

<p>
For programmers accustomed to block-level resource management from
other languages, <code>defer</code> may seem peculiar, but its most
interesting and powerful applications come precisely from the fact
Rob Pike's avatar
Rob Pike committed
1115 1116 1117
that it's not block-based but function-based.  In the section on
<code>panic</code> and <code>recover</code> we'll see another
example of its possibilities.
1118 1119
</p>

1120
<h2 id="data">Data</h2>
Rob Pike's avatar
Rob Pike committed
1121

1122
<h3 id="allocation_new">Allocation with <code>new</code></h3>
Rob Pike's avatar
Rob Pike committed
1123

1124
<p>
1125 1126
Go has two allocation primitives, the built-in functions
<code>new</code> and <code>make</code>.
1127 1128
They do different things and apply to different types, which can be confusing,
but the rules are simple.
1129
Let's talk about <code>new</code> first.
1130 1131
It's a built-in function that allocates memory, but unlike its namesakes
in some other languages it does not <em>initialize</em> the memory,
Rob Pike's avatar
Rob Pike committed
1132
it only <em>zeros</em> it.
1133 1134
That is,
<code>new(T)</code> allocates zeroed storage for a new item of type
1135 1136 1137 1138 1139 1140
<code>T</code> and returns its address, a value of type <code>*T</code>.
In Go terminology, it returns a pointer to a newly allocated zero value of type
<code>T</code>.
</p>

<p>
1141 1142 1143
Since the memory returned by <code>new</code> is zeroed, it's helpful to arrange
when designing your data structures that the
zero value of each type can be used without further initialization.  This means a user of
1144
the data structure can create one with <code>new</code> and get right to
1145 1146 1147 1148 1149 1150 1151 1152 1153 1154
work.
For example, the documentation for <code>bytes.Buffer</code> states that
"the zero value for <code>Buffer</code> is an empty buffer ready to use."
Similarly, <code>sync.Mutex</code> does not
have an explicit constructor or <code>Init</code> method.
Instead, the zero value for a <code>sync.Mutex</code>
is defined to be an unlocked mutex.
</p>

<p>
1155
The zero-value-is-useful property works transitively. Consider this type declaration.
1156 1157 1158 1159
</p>

<pre>
type SyncedBuffer struct {
1160 1161
    lock    sync.Mutex
    buffer  bytes.Buffer
1162 1163 1164 1165 1166
}
</pre>

<p>
Values of type <code>SyncedBuffer</code> are also ready to use immediately upon allocation
Rob Pike's avatar
Rob Pike committed
1167
or just declaration.  In the next snippet, both <code>p</code> and <code>v</code> will work
1168
correctly without further arrangement.
1169 1170 1171
</p>

<pre>
1172 1173
p := new(SyncedBuffer)  // type *SyncedBuffer
var v SyncedBuffer      // type  SyncedBuffer
1174 1175 1176 1177 1178 1179 1180
</pre>

<h3 id="composite_literals">Constructors and composite literals</h3>

<p>
Sometimes the zero value isn't good enough and an initializing
constructor is necessary, as in this example derived from
1181
package <code>os</code>.
1182 1183 1184 1185
</p>

<pre>
func NewFile(fd int, name string) *File {
1186 1187 1188 1189 1190 1191 1192 1193 1194
    if fd &lt; 0 {
        return nil
    }
    f := new(File)
    f.fd = fd
    f.name = name
    f.dirinfo = nil
    f.nepipe = 0
    return f
1195 1196 1197 1198
}
</pre>

<p>
1199
There's a lot of boiler plate in there.  We can simplify it
1200 1201 1202 1203 1204 1205 1206
using a <i>composite literal</i>, which is
an expression that creates a
new instance each time it is evaluated.
</p>

<pre>
func NewFile(fd int, name string) *File {
1207 1208 1209 1210 1211
    if fd &lt; 0 {
        return nil
    }
    f := File{fd, name, nil, 0}
    return &amp;f
1212 1213 1214 1215
}
</pre>

<p>
Rob Pike's avatar
Rob Pike committed
1216
Note that, unlike in C, it's perfectly OK to return the address of a local variable;
1217 1218
the storage associated with the variable survives after the function
returns.
Rob Pike's avatar
slices  
Rob Pike committed
1219 1220
In fact, taking the address of a composite literal
allocates a fresh instance each time it is evaluated,
1221
so we can combine these last two lines.
1222 1223 1224
</p>

<pre>
1225
    return &amp;File{fd, name, nil, 0}
1226 1227 1228 1229 1230 1231 1232 1233 1234 1235
</pre>

<p>
The fields of a composite literal are laid out in order and must all be present.
However, by labeling the elements explicitly as <i>field</i><code>:</code><i>value</i>
pairs, the initializers can appear in any
order, with the missing ones left as their respective zero values.  Thus we could say
</p>

<pre>
1236
    return &amp;File{fd: fd, name: name}
1237 1238 1239 1240
</pre>

<p>
As a limiting case, if a composite literal contains no fields at all, it creates
Russ Cox's avatar
Russ Cox committed
1241
a zero value for the type.  The expressions <code>new(File)</code> and <code>&amp;File{}</code> are equivalent.
1242 1243 1244 1245 1246
</p>

<p>
Composite literals can also be created for arrays, slices, and maps,
with the field labels being indices or map keys as appropriate.
Russ Cox's avatar
Russ Cox committed
1247
In these examples, the initializations work regardless of the values of <code>Enone</code>,
1248
<code>Eio</code>, and <code>Einval</code>, as long as they are distinct.
1249
</p>
Rob Pike's avatar
Rob Pike committed
1250

1251
<pre>
1252 1253 1254
a := [...]string   {Enone: "no error", Eio: "Eio", Einval: "invalid argument"}
s := []string      {Enone: "no error", Eio: "Eio", Einval: "invalid argument"}
m := map[int]string{Enone: "no error", Eio: "Eio", Einval: "invalid argument"}
1255 1256
</pre>

1257
<h3 id="allocation_make">Allocation with <code>make</code></h3>
Rob Pike's avatar
Rob Pike committed
1258 1259

<p>
1260 1261 1262
Back to allocation.
The built-in function <code>make(T, </code><i>args</i><code>)</code> serves
a purpose different from <code>new(T)</code>.
1263 1264 1265
It creates slices, maps, and channels only, and it returns an <em>initialized</em>
(not <em>zeroed</em>)
value of type <code>T</code> (not <code>*T</code>).
1266
The reason for the distinction
1267
is that these three types represent, under the covers, references to data structures that
1268 1269 1270
must be initialized before use.
A slice, for example, is a three-item descriptor
containing a pointer to the data (inside an array), the length, and the
Rob Pike's avatar
Rob Pike committed
1271
capacity, and until those items are initialized, the slice is <code>nil</code>.
1272
For slices, maps, and channels,
1273 1274 1275
<code>make</code> initializes the internal data structure and prepares
the value for use.
For instance,
Rob Pike's avatar
Rob Pike committed
1276 1277 1278
</p>

<pre>
1279
make([]int, 10, 100)
Rob Pike's avatar
Rob Pike committed
1280 1281
</pre>

1282 1283 1284 1285 1286 1287 1288 1289
<p>
allocates an array of 100 ints and then creates a slice
structure with length 10 and a capacity of 100 pointing at the first
10 elements of the array.
(When making a slice, the capacity can be omitted; see the section on slices
for more information.)
In contrast, <code>new([]int)</code> returns a pointer to a newly allocated, zeroed slice
structure, that is, a pointer to a <code>nil</code> slice value.
1290
</p>
1291 1292

<p>
1293 1294
These examples illustrate the difference between <code>new</code> and
<code>make</code>.
1295 1296 1297
</p>

<pre>
1298
var p *[]int = new([]int)       // allocates slice structure; *p == nil; rarely useful
1299
var v  []int = make([]int, 100) // the slice v now refers to a new array of 100 ints
1300 1301

// Unnecessarily complex:
1302 1303
var p *[]int = new([]int)
*p = make([]int, 100, 100)
1304 1305

// Idiomatic:
1306
v := make([]int, 100)
1307 1308 1309
</pre>

<p>
1310
Remember that <code>make</code> applies only to maps, slices and channels
Russ Cox's avatar
Russ Cox committed
1311
and does not return a pointer.
1312 1313
To obtain an explicit pointer allocate with <code>new</code> or take the address
of a variable explicitly.
1314 1315 1316 1317 1318 1319
</p>

<h3 id="arrays">Arrays</h3>

<p>
Arrays are useful when planning the detailed layout of memory and sometimes
Russ Cox's avatar
Russ Cox committed
1320
can help avoid allocation, but primarily
1321 1322 1323 1324 1325 1326
they are a building block for slices, the subject of the next section.
To lay the foundation for that topic, here are a few words about arrays.
</p>

<p>
There are major differences between the ways arrays work in Go and C.
1327
In Go,
1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343
</p>
<ul>
<li>
Arrays are values. Assigning one array to another copies all the elements.
</li>
<li>
In particular, if you pass an array to a function, it
will receive a <i>copy</i> of the array, not a pointer to it.
<li>
The size of an array is part of its type.  The types <code>[10]int</code>
and <code>[20]int</code> are distinct.
</li>
</ul>

<p>
The value property can be useful but also expensive; if you want C-like behavior and efficiency,
1344
you can pass a pointer to the array.
1345 1346
</p>

Rob Pike's avatar
Rob Pike committed
1347
<pre>
1348
func Sum(a *[3]float64) (sum float64) {
1349 1350 1351 1352
    for _, v := range *a {
        sum += v
    }
    return
1353 1354
}

1355
array := [...]float64{7.0, 8.5, 9.1}
1356
x := Sum(&amp;array)  // Note the explicit address-of operator
Rob Pike's avatar
Rob Pike committed
1357 1358
</pre>

1359
<p>
1360 1361
But even this style isn't idiomatic Go.
Use slices instead.
1362 1363 1364 1365 1366
</p>

<h3 id="slices">Slices</h3>

<p>
Rob Pike's avatar
slices  
Rob Pike committed
1367 1368 1369 1370 1371 1372
Slices wrap arrays to give a more general, powerful, and convenient
interface to sequences of data.  Except for items with explicit
dimension such as transformation matrices, most array programming in
Go is done with slices rather than simple arrays.
</p>
<p>
1373 1374 1375
Slices hold references to an underlying array, and if you assign one
slice to another, both refer to the same array.
If a function takes a slice argument, changes it makes to
Rob Pike's avatar
slices  
Rob Pike committed
1376 1377
the elements of the slice will be visible to the caller, analogous to
passing a pointer to the underlying array.  A <code>Read</code>
Russ Cox's avatar
Russ Cox committed
1378 1379
function can therefore accept a slice argument rather than a pointer
and a count; the length within the slice sets an upper
Rob Pike's avatar
slices  
Rob Pike committed
1380 1381 1382 1383 1384
limit of how much data to read.  Here is the signature of the
<code>Read</code> method of the <code>File</code> type in package
<code>os</code>:
</p>
<pre>
1385
func (file *File) Read(buf []byte) (n int, err error)
Rob Pike's avatar
slices  
Rob Pike committed
1386 1387 1388
</pre>
<p>
The method returns the number of bytes read and an error value, if
Rob Pike's avatar
Rob Pike committed
1389 1390 1391
any.
To read into the first 32 bytes of a larger buffer
<code>buf</code>, <i>slice</i> (here used as a verb) the buffer.
Rob Pike's avatar
slices  
Rob Pike committed
1392 1393
</p>
<pre>
1394
    n, err := f.Read(buf[0:32])
Rob Pike's avatar
slices  
Rob Pike committed
1395 1396 1397
</pre>
<p>
Such slicing is common and efficient.  In fact, leaving efficiency aside for
Rob Pike's avatar
Rob Pike committed
1398
the moment, the following snippet would also read the first 32 bytes of the buffer.
Rob Pike's avatar
slices  
Rob Pike committed
1399 1400
</p>
<pre>
1401
    var n int
1402
    var err error
1403
    for i := 0; i &lt; 32; i++ {
1404 1405 1406 1407 1408 1409 1410
        nbytes, e := f.Read(buf[i:i+1])  // Read one byte.
        if nbytes == 0 || e != nil {
            err = e
            break
        }
        n += nbytes
    }
Rob Pike's avatar
slices  
Rob Pike committed
1411 1412 1413
</pre>
<p>
The length of a slice may be changed as long as it still fits within
1414
the limits of the underlying array; just assign it to a slice of
Rob Pike's avatar
slices  
Rob Pike committed
1415 1416 1417 1418 1419 1420 1421 1422 1423 1424
itself.  The <i>capacity</i> of a slice, accessible by the built-in
function <code>cap</code>, reports the maximum length the slice may
assume.  Here is a function to append data to a slice.  If the data
exceeds the capacity, the slice is reallocated.  The
resulting slice is returned.  The function uses the fact that
<code>len</code> and <code>cap</code> are legal when applied to the
<code>nil</code> slice, and return 0.
</p>
<pre>
func Append(slice, data[]byte) []byte {
1425
    l := len(slice)
1426
    if l + len(data) &gt; cap(slice) {  // reallocate
1427 1428
        // Allocate double what's needed, for future growth.
        newSlice := make([]byte, (l+len(data))*2)
1429 1430
        // The copy function is predeclared and works for any slice type.
        copy(newSlice, slice)
1431 1432 1433 1434 1435 1436 1437
        slice = newSlice
    }
    slice = slice[0:l+len(data)]
    for i, c := range data {
        slice[l+i] = c
    }
    return slice
Rob Pike's avatar
slices  
Rob Pike committed
1438 1439 1440 1441 1442 1443
}
</pre>
<p>
We must return the slice afterwards because, although <code>Append</code>
can modify the elements of <code>slice</code>, the slice itself (the run-time data
structure holding the pointer, length, and capacity) is passed by value.
1444 1445
</p>

1446 1447 1448 1449 1450
<p>
The idea of appending to a slice is so useful it's captured by the
<code>append</code> built-in function.  To understand that function's
design, though, we need a little more information, so we'll return
to it later.
1451 1452
</p>

1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491
<h3 id="two_dimensional_slices">Two-dimensional slices</h3>

<p>
Go's arrays and slices are one-dimensional.
To create the equivalent of a 2D array or slice, it is necessary to define an array-of-arrays
or slice-of-slices, like this:
</p>

<pre>
type Transform [3][3]float64  // A 3x3 array, really an array of arrays.
type LinesOfText [][]byte     // A slice of byte slices.
</pre>

<p>
Because slices are variable-length, it is possible to have each inner
slice be a different length.
That can be a common situation, as in our <code>LinesOfText</code>
example: each line has an independent length.
</p>

<pre>
text := LinesOfText{
	[]byte("Now is the time"),
	[]byte("for all good gophers"),
	[]byte("to bring some fun to the party."),
}
</pre>

<p>
Sometimes it's necessary to allocate a 2D slice, a situation that can arise when
processing scan lines of pixels, for instance.
There are two ways to achieve this.
One is to allocate each slice independently; the other
is to allocate a single array and point the individual slices into it.
Which to use depends on your application.
If the slices might grow or shrink, they should be allocated independently
to avoid overwriting the next line; if not, it can be more efficient to construct
the object with a single allocation.
For reference, here are sketches of the two methods.
Rob Pike's avatar
Rob Pike committed
1492
First, a line at a time:
1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517
</p>

<pre>
// Allocate the top-level slice.
picture := make([][]uint8, YSize) // One row per unit of y.
// Loop over the rows, allocating the slice for each row.
for i := range picture {
	picture[i] = make([]uint8, XSize)
}
</pre>

<p>
And now as one allocation, sliced into lines:
</p>

<pre>
// Allocate the top-level slice, the same as before.
picture := make([][]uint8, YSize) // One row per unit of y.
// Allocate one large slice to hold all the pixels.
pixels := make([]uint8, XSize*YSize) // Has type []uint8 even though picture is [][]uint8.
// Loop over the rows, slicing each row from the front of the remaining pixels slice.
for i := range picture {
	picture[i], pixels = pixels[:XSize], pixels[XSize:]
}
</pre>
1518 1519

<h3 id="maps">Maps</h3>
Rob Pike's avatar
Rob Pike committed
1520 1521

<p>
1522 1523 1524
Maps are a convenient and powerful built-in data structure that associate
values of one type (the <em>key</em>) with values of another type
(the <em>element</em> or <em>value</em>)
Russ Cox's avatar
Russ Cox committed
1525 1526
The key can be of any type for which the equality operator is defined,
such as integers,
1527
floating point and complex numbers,
1528
strings, pointers, interfaces (as long as the dynamic type
1529 1530
supports equality), structs and arrays.
Slices cannot be used as map keys,
1531
because equality is not defined on them.
1532 1533
Like slices, maps hold references to an underlying data structure.
If you pass a map to a function
Rob Pike's avatar
Rob Pike committed
1534 1535 1536 1537 1538 1539 1540 1541 1542
that changes the contents of the map, the changes will be visible
in the caller.
</p>
<p>
Maps can be constructed using the usual composite literal syntax
with colon-separated key-value pairs,
so it's easy to build them during initialization.
</p>
<pre>
1543
var timeZone = map[string]int{
1544 1545 1546 1547 1548
    "UTC":  0*60*60,
    "EST": -5*60*60,
    "CST": -6*60*60,
    "MST": -7*60*60,
    "PST": -8*60*60,
Rob Pike's avatar
Rob Pike committed
1549 1550 1551 1552
}
</pre>
<p>
Assigning and fetching map values looks syntactically just like
1553 1554
doing the same for arrays and slices except that the index doesn't
need to be an integer.
1555 1556 1557 1558 1559 1560 1561 1562 1563 1564
</p>
<pre>
offset := timeZone["EST"]
</pre>
<p>
An attempt to fetch a map value with a key that
is not present in the map will return the zero value for the type
of the entries
in the map.  For instance, if the map contains integers, looking
up a non-existent key will return <code>0</code>.
Rob Pike's avatar
Rob Pike committed
1565 1566 1567
A set can be implemented as a map with value type <code>bool</code>.
Set the map entry to <code>true</code> to put the value in the set, and then
test it by simple indexing.
1568
</p>
Rob Pike's avatar
Rob Pike committed
1569
<pre>
1570
attended := map[string]bool{
Rob Pike's avatar
Rob Pike committed
1571 1572 1573 1574 1575 1576 1577 1578 1579
    "Ann": true,
    "Joe": true,
    ...
}

if attended[person] { // will be false if person is not in the map
    fmt.Println(person, "was at the meeting")
}
</pre>
1580 1581 1582
<p>
Sometimes you need to distinguish a missing entry from
a zero value.  Is there an entry for <code>"UTC"</code>
1583
or is that the empty string because it's not in the map at all?
1584
You can discriminate with a form of multiple assignment.
Rob Pike's avatar
Rob Pike committed
1585 1586
</p>
<pre>
1587 1588
var seconds int
var ok bool
Rob Pike's avatar
Rob Pike committed
1589 1590 1591 1592 1593 1594 1595 1596
seconds, ok = timeZone[tz]
</pre>
<p>
For obvious reasons this is called the &ldquo;comma ok&rdquo; idiom.
In this example, if <code>tz</code> is present, <code>seconds</code>
will be set appropriately and <code>ok</code> will be true; if not,
<code>seconds</code> will be set to zero and <code>ok</code> will
be false.
1597
Here's a function that puts it together with a nice error report:
Rob Pike's avatar
Rob Pike committed
1598 1599 1600
</p>
<pre>
func offset(tz string) int {
1601 1602 1603
    if seconds, ok := timeZone[tz]; ok {
        return seconds
    }
Rob Pike's avatar
Rob Pike committed
1604
    log.Println("unknown time zone:", tz)
1605
    return 0
Rob Pike's avatar
Rob Pike committed
1606 1607 1608 1609
}
</pre>
<p>
To test for presence in the map without worrying about the actual value,
1610 1611
you can use the <a href="#blank">blank identifier</a> (<code>_</code>)
in place of the usual variable for the value.
Rob Pike's avatar
Rob Pike committed
1612 1613
</p>
<pre>
1614
_, present := timeZone[tz]
Rob Pike's avatar
Rob Pike committed
1615 1616
</pre>
<p>
1617 1618
To delete a map entry, use the <code>delete</code>
built-in function, whose arguments are the map and the key to be deleted.
1619
It's safe to do this even if the key is already absent
Rob Pike's avatar
Rob Pike committed
1620 1621 1622
from the map.
</p>
<pre>
1623
delete(timeZone, "PDT")  // Now on Standard Time
Rob Pike's avatar
Rob Pike committed
1624
</pre>
1625

1626 1627
<h3 id="printing">Printing</h3>

Rob Pike's avatar
Rob Pike committed
1628 1629 1630 1631 1632 1633 1634 1635 1636
<p>
Formatted printing in Go uses a style similar to C's <code>printf</code>
family but is richer and more general. The functions live in the <code>fmt</code>
package and have capitalized names: <code>fmt.Printf</code>, <code>fmt.Fprintf</code>,
<code>fmt.Sprintf</code> and so on.  The string functions (<code>Sprintf</code> etc.)
return a string rather than filling in a provided buffer.
</p>
<p>
You don't need to provide a format string.  For each of <code>Printf</code>,
Ian Lance Taylor's avatar
Ian Lance Taylor committed
1637
<code>Fprintf</code> and <code>Sprintf</code> there is another pair
Rob Pike's avatar
Rob Pike committed
1638 1639
of functions, for instance <code>Print</code> and <code>Println</code>.
These functions do not take a format string but instead generate a default
1640 1641 1642
format for each argument. The <code>Println</code> versions also insert a blank
between arguments and append a newline to the output while
the <code>Print</code> versions add blanks only if the operand on neither side is a string.
Rob Pike's avatar
Rob Pike committed
1643 1644 1645
In this example each line produces the same output.
</p>
<pre>
1646 1647
fmt.Printf("Hello %d\n", 23)
fmt.Fprint(os.Stdout, "Hello ", 23, "\n")
1648
fmt.Println("Hello", 23)
1649
fmt.Println(fmt.Sprint("Hello ", 23))
Rob Pike's avatar
Rob Pike committed
1650 1651
</pre>
<p>
1652
The formatted print functions <code>fmt.Fprint</code>
Ian Lance Taylor's avatar
Ian Lance Taylor committed
1653
and friends take as a first argument any object
Rob Pike's avatar
Rob Pike committed
1654 1655 1656 1657 1658 1659 1660 1661 1662
that implements the <code>io.Writer</code> interface; the variables <code>os.Stdout</code>
and <code>os.Stderr</code> are familiar instances.
</p>
<p>
Here things start to diverge from C.  First, the numeric formats such as <code>%d</code>
do not take flags for signedness or size; instead, the printing routines use the
type of the argument to decide these properties.
</p>
<pre>
1663
var x uint64 = 1&lt;&lt;64 - 1
1664
fmt.Printf("%d %x; %d %x\n", x, x, int64(x), int64(x))
Rob Pike's avatar
Rob Pike committed
1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675
</pre>
<p>
prints
</p>
<pre>
18446744073709551615 ffffffffffffffff; -1 -1
</pre>
<p>
If you just want the default conversion, such as decimal for integers, you can use
the catchall format <code>%v</code> (for &ldquo;value&rdquo;); the result is exactly
what <code>Print</code> and <code>Println</code> would produce.
1676
Moreover, that format can print <em>any</em> value, even arrays, slices, structs, and
Rob Pike's avatar
Rob Pike committed
1677 1678 1679
maps.  Here is a print statement for the time zone map defined in the previous section.
</p>
<pre>
1680
fmt.Printf("%v\n", timeZone)  // or just fmt.Println(timeZone)
Rob Pike's avatar
Rob Pike committed
1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695
</pre>
<p>
which gives output
</p>
<pre>
map[CST:-21600 PST:-28800 EST:-18000 UTC:0 MST:-25200]
</pre>
<p>
For maps the keys may be output in any order, of course.
When printing a struct, the modified format <code>%+v</code> annotates the
fields of the structure with their names, and for any value the alternate
format <code>%#v</code> prints the value in full Go syntax.
</p>
<pre>
type T struct {
1696
    a int
1697
    b float64
1698
    c string
Rob Pike's avatar
Rob Pike committed
1699
}
1700 1701 1702 1703 1704
t := &amp;T{ 7, -2.35, "abc\tdef" }
fmt.Printf("%v\n", t)
fmt.Printf("%+v\n", t)
fmt.Printf("%#v\n", t)
fmt.Printf("%#v\n", timeZone)
Rob Pike's avatar
Rob Pike committed
1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717
</pre>
<p>
prints
</p>
<pre>
&amp;{7 -2.35 abc   def}
&amp;{a:7 b:-2.35 c:abc     def}
&amp;main.T{a:7, b:-2.35, c:"abc\tdef"}
map[string] int{"CST":-21600, "PST":-28800, "EST":-18000, "UTC":0, "MST":-25200}
</pre>
<p>
(Note the ampersands.)
That quoted string format is also available through <code>%q</code> when
1718 1719 1720 1721
applied to a value of type <code>string</code> or <code>[]byte</code>.
The alternate format <code>%#q</code> will use backquotes instead if possible.
(The <code>%q</code> format also applies to integers and runes, producing a
single-quoted rune constant.)
1722 1723
Also, <code>%x</code> works on strings, byte arrays and byte slices as well as
on integers, generating a long hexadecimal string, and with
Rob Pike's avatar
Rob Pike committed
1724 1725 1726 1727
a space in the format (<code>%&nbsp;x</code>) it puts spaces between the bytes.
</p>
<p>
Another handy format is <code>%T</code>, which prints the <em>type</em> of a value.
1728
</p>
Rob Pike's avatar
Rob Pike committed
1729
<pre>
1730
fmt.Printf(&quot;%T\n&quot;, timeZone)
Rob Pike's avatar
Rob Pike committed
1731 1732 1733 1734 1735 1736 1737 1738 1739
</pre>
<p>
prints
</p>
<pre>
map[string] int
</pre>
<p>
If you want to control the default format for a custom type, all that's required is to define
1740
a method with the signature <code>String() string</code> on the type.
1741
For our simple type <code>T</code>, that might look like this.
Rob Pike's avatar
Rob Pike committed
1742 1743 1744
</p>
<pre>
func (t *T) String() string {
1745
    return fmt.Sprintf("%d/%g/%q", t.a, t.b, t.c)
Rob Pike's avatar
Rob Pike committed
1746
}
1747
fmt.Printf("%v\n", t)
Rob Pike's avatar
Rob Pike committed
1748 1749 1750 1751 1752 1753 1754 1755
</pre>
<p>
to print in the format
</p>
<pre>
7/-2.35/"abc\tdef"
</pre>
<p>
1756 1757 1758 1759 1760
(If you need to print <em>values</em> of type <code>T</code> as well as pointers to <code>T</code>,
the receiver for <code>String</code> must be of value type; this example used a pointer because
that's more efficient and idiomatic for struct types.
See the section below on <a href="#pointers_vs_values">pointers vs. value receivers</a> for more information.)
</p>
1761

1762
<p>
1763
Our <code>String</code> method is able to call <code>Sprintf</code> because the
1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799
print routines are fully reentrant and can be wrapped this way.
There is one important detail to understand about this approach,
however: don't construct a <code>String</code> method by calling
<code>Sprintf</code> in a way that will recur into your <code>String</code>
method indefinitely.  This can happen if the <code>Sprintf</code>
call attempts to print the receiver directly as a string, which in
turn will invoke the method again.  It's a common and easy mistake
to make, as this example shows.
</p>

<pre>
type MyString string

func (m MyString) String() string {
    return fmt.Sprintf("MyString=%s", m) // Error: will recur forever.
}
</pre>

<p>
It's also easy to fix: convert the argument to the basic string type, which does not have the
method.
</p>

<pre>
type MyString string
func (m MyString) String() string {
    return fmt.Sprintf("MyString=%s", string(m)) // OK: note conversion.
}
</pre>

<p>
In the <a href="#initialization">initialization section</a> we'll see another technique that avoids this recursion.
</p>

<p>
Another printing technique is to pass a print routine's arguments directly to another such routine.
1800 1801 1802
The signature of <code>Printf</code> uses the type <code>...interface{}</code>
for its final argument to specify that an arbitrary number of parameters (of arbitrary type)
can appear after the format.
Rob Pike's avatar
Rob Pike committed
1803 1804
</p>
<pre>
1805
func Printf(format string, v ...interface{}) (n int, err error) {
Rob Pike's avatar
Rob Pike committed
1806 1807
</pre>
<p>
1808 1809 1810 1811
Within the function <code>Printf</code>, <code>v</code> acts like a variable of type
<code>[]interface{}</code> but if it is passed to another variadic function, it acts like
a regular list of arguments.
Here is the implementation of the
1812
function <code>log.Println</code> we used above. It passes its arguments directly to
Rob Pike's avatar
Rob Pike committed
1813 1814 1815
<code>fmt.Sprintln</code> for the actual formatting.
</p>
<pre>
1816 1817 1818
// Println prints to the standard logger in the manner of fmt.Println.
func Println(v ...interface{}) {
    std.Output(2, fmt.Sprintln(v...))  // Output takes parameters (int, string)
Rob Pike's avatar
Rob Pike committed
1819 1820 1821
}
</pre>
<p>
1822
We write <code>...</code> after <code>v</code> in the nested call to <code>Sprintln</code> to tell the
1823 1824
compiler to treat <code>v</code> as a list of arguments; otherwise it would just pass
<code>v</code> as a single slice argument.
1825
</p>
1826
<p>
Rob Pike's avatar
Rob Pike committed
1827 1828 1829
There's even more to printing than we've covered here.  See the <code>godoc</code> documentation
for package <code>fmt</code> for the details.
</p>
1830 1831 1832 1833 1834 1835 1836 1837
<p>
By the way, a <code>...</code> parameter can be of a specific type, for instance <code>...int</code>
for a min function that chooses the least of a list of integers:
</p>
<pre>
func Min(a ...int) int {
    min := int(^uint(0) >> 1)  // largest int
    for _, i := range a {
1838
        if i &lt; min {
1839 1840
            min = i
        }
1841 1842 1843 1844
    }
    return min
}
</pre>
Rob Pike's avatar
Rob Pike committed
1845

1846 1847 1848 1849 1850 1851
<h3 id="append">Append</h3>
<p>
Now we have the missing piece we needed to explain the design of
the <code>append</code> built-in function.  The signature of <code>append</code>
is different from our custom <code>Append</code> function above.
Schematically, it's like this:
1852
</p>
1853
<pre>
1854
func append(slice []<i>T</i>, elements ...<i>T</i>) []<i>T</i>
1855
</pre>
1856
<p>
1857 1858 1859 1860 1861
where <i>T</i> is a placeholder for any given type.  You can't
actually write a function in Go where the type <code>T</code>
is determined by the caller.
That's why <code>append</code> is built in: it needs support from the
compiler.
1862
</p>
1863 1864 1865 1866 1867
<p>
What <code>append</code> does is append the elements to the end of
the slice and return the result.  The result needs to be returned
because, as with our hand-written <code>Append</code>, the underlying
array may change.  This simple example
1868
</p>
1869 1870 1871 1872 1873
<pre>
x := []int{1,2,3}
x = append(x, 4, 5, 6)
fmt.Println(x)
</pre>
1874
<p>
1875 1876 1877
prints <code>[1 2 3 4 5 6]</code>.  So <code>append</code> works a
little like <code>Printf</code>, collecting an arbitrary number of
arguments.
1878
</p>
1879 1880 1881 1882 1883
<p>
But what if we wanted to do what our <code>Append</code> does and
append a slice to a slice?  Easy: use <code>...</code> at the call
site, just as we did in the call to <code>Output</code> above.  This
snippet produces identical output to the one above.
1884
</p>
1885 1886 1887 1888 1889 1890
<pre>
x := []int{1,2,3}
y := []int{4,5,6}
x = append(x, y...)
fmt.Println(x)
</pre>
1891
<p>
1892 1893
Without that <code>...</code>, it wouldn't compile because the types
would be wrong; <code>y</code> is not of type <code>int</code>.
1894
</p>
1895

Rob Pike's avatar
Rob Pike committed
1896 1897 1898 1899 1900 1901
<h2 id="initialization">Initialization</h2>

<p>
Although it doesn't look superficially very different from
initialization in C or C++, initialization in Go is more powerful.
Complex structures can be built during initialization and the ordering
1902
issues among initialized objects, even among different packages, are handled
Rob Pike's avatar
Rob Pike committed
1903 1904 1905 1906 1907 1908 1909 1910 1911
correctly.
</p>

<h3 id="constants">Constants</h3>

<p>
Constants in Go are just that&mdash;constant.
They are created at compile time, even when defined as
locals in functions,
1912
and can only be numbers, characters (runes), strings or booleans.
Rob Pike's avatar
Rob Pike committed
1913 1914 1915 1916 1917 1918 1919 1920 1921 1922 1923 1924 1925 1926
Because of the compile-time restriction, the expressions
that define them must be constant expressions,
evaluatable by the compiler.  For instance,
<code>1&lt;&lt;3</code> is a constant expression, while
<code>math.Sin(math.Pi/4)</code> is not because
the function call to <code>math.Sin</code> needs
to happen at run time.
</p>

<p>
In Go, enumerated constants are created using the <code>iota</code>
enumerator.  Since <code>iota</code> can be part of an expression and
expressions can be implicitly repeated, it is easy to build intricate
sets of values.
Rob Pike's avatar
Rob Pike committed
1927
</p>
1928
{{code "/doc/progs/eff_bytesize.go" `/^type ByteSize/` `/^\)/`}}
Rob Pike's avatar
Rob Pike committed
1929
<p>
1930 1931 1932 1933 1934
The ability to attach a method such as <code>String</code> to any
user-defined type makes it possible for arbitrary values to format themselves
automatically for printing.
Although you'll see it most often applied to structs, this technique is also useful for
scalar types such as floating-point types like <code>ByteSize</code>.
Rob Pike's avatar
Rob Pike committed
1935
</p>
1936
{{code "/doc/progs/eff_bytesize.go" `/^func.*ByteSize.*String/` `/^}/`}}
Rob Pike's avatar
Rob Pike committed
1937 1938
<p>
The expression <code>YB</code> prints as <code>1.00YB</code>,
1939
while <code>ByteSize(1e13)</code> prints as <code>9.09TB</code>.
Rob Pike's avatar
Rob Pike committed
1940 1941
</p>

Rob Pike's avatar
Rob Pike committed
1942
<p>
1943 1944 1945 1946 1947 1948 1949
The use here of <code>Sprintf</code>
to implement <code>ByteSize</code>'s <code>String</code> method is safe
(avoids recurring indefinitely) not because of a conversion but
because it calls <code>Sprintf</code> with <code>%f</code>,
which is not a string format: <code>Sprintf</code> will only call
the <code>String</code> method when it wants a string, and <code>%f</code>
wants a floating-point value.
Rob Pike's avatar
Rob Pike committed
1950 1951
</p>

Rob Pike's avatar
Rob Pike committed
1952 1953 1954 1955 1956 1957 1958 1959
<h3 id="variables">Variables</h3>

<p>
Variables can be initialized just like constants but the
initializer can be a general expression computed at run time.
</p>
<pre>
var (
1960 1961
    home   = os.Getenv("HOME")
    user   = os.Getenv("USER")
1962
    gopath = os.Getenv("GOPATH")
Rob Pike's avatar
Rob Pike committed
1963 1964 1965 1966 1967 1968
)
</pre>

<h3 id="init">The init function</h3>

<p>
1969 1970
Finally, each source file can define its own niladic <code>init</code> function to
set up whatever state is required.  (Actually each file can have multiple
1971
<code>init</code> functions.)
1972
And finally means finally: <code>init</code> is called after all the
Rob Pike's avatar
Rob Pike committed
1973 1974 1975 1976 1977 1978
variable declarations in the package have evaluated their initializers,
and those are evaluated only after all the imported packages have been
initialized.
</p>
<p>
Besides initializations that cannot be expressed as declarations,
1979
a common use of <code>init</code> functions is to verify or repair
Rob Pike's avatar
Rob Pike committed
1980 1981 1982 1983 1984
correctness of the program state before real execution begins.
</p>

<pre>
func init() {
1985
    if user == "" {
Rob Pike's avatar
Rob Pike committed
1986
        log.Fatal("$USER not set")
1987
    }
1988 1989
    if home == "" {
        home = "/home/" + user
1990
    }
1991 1992
    if gopath == "" {
        gopath = home + "/go"
1993
    }
1994 1995
    // gopath may be overridden by --gopath flag on command line.
    flag.StringVar(&amp;gopath, "gopath", gopath, "override default GOPATH")
Rob Pike's avatar
Rob Pike committed
1996 1997 1998 1999
}
</pre>

<h2 id="methods">Methods</h2>
Rob Pike's avatar
slices  
Rob Pike committed
2000

2001
<h3 id="pointers_vs_values">Pointers vs. Values</h3>
Rob Pike's avatar
slices  
Rob Pike committed
2002
<p>
2003 2004
As we saw with <code>ByteSize</code>,
methods can be defined for any named type (except a pointer or an interface);
Rob Pike's avatar
slices  
Rob Pike committed
2005
the receiver does not have to be a struct.
2006
</p>
Rob Pike's avatar
slices  
Rob Pike committed
2007 2008 2009 2010 2011 2012 2013 2014
<p>
In the discussion of slices above, we wrote an <code>Append</code>
function.  We can define it as a method on slices instead.  To do
this, we first declare a named type to which we can bind the method, and
then make the receiver for the method a value of that type.
</p>
<pre>
type ByteSlice []byte
2015

Rob Pike's avatar
Rob Pike committed
2016
func (slice ByteSlice) Append(data []byte) []byte {
2017
    // Body exactly the same as above
Rob Pike's avatar
slices  
Rob Pike committed
2018 2019 2020 2021 2022 2023 2024 2025 2026 2027
}
</pre>
<p>
This still requires the method to return the updated slice.  We can
eliminate that clumsiness by redefining the method to take a
<i>pointer</i> to a <code>ByteSlice</code> as its receiver, so the
method can overwrite the caller's slice.
</p>
<pre>
func (p *ByteSlice) Append(data []byte) {
2028 2029 2030
    slice := *p
    // Body as above, without the return.
    *p = slice
Rob Pike's avatar
slices  
Rob Pike committed
2031 2032 2033 2034 2035 2036 2037
}
</pre>
<p>
In fact, we can do even better.  If we modify our function so it looks
like a standard <code>Write</code> method, like this,
</p>
<pre>
2038
func (p *ByteSlice) Write(data []byte) (n int, err error) {
2039 2040 2041 2042
    slice := *p
    // Again as above.
    *p = slice
    return len(data), nil
Rob Pike's avatar
slices  
Rob Pike committed
2043 2044 2045 2046 2047
}
</pre>
<p>
then the type <code>*ByteSlice</code> satisfies the standard interface
<code>io.Writer</code>, which is handy.  For instance, we can
2048
print into one.
Rob Pike's avatar
slices  
Rob Pike committed
2049 2050
</p>
<pre>
2051 2052
    var b ByteSlice
    fmt.Fprintf(&amp;b, "This hour has %d days\n", 7)
Rob Pike's avatar
slices  
Rob Pike committed
2053 2054
</pre>
<p>
2055
We pass the address of a <code>ByteSlice</code>
Rob Pike's avatar
slices  
Rob Pike committed
2056 2057 2058 2059 2060 2061 2062 2063 2064
because only <code>*ByteSlice</code> satisfies <code>io.Writer</code>.
The rule about pointers vs. values for receivers is that value methods
can be invoked on pointers and values, but pointer methods can only be
invoked on pointers.  This is because pointer methods can modify the
receiver; invoking them on a copy of the value would cause those
modifications to be discarded.
</p>
<p>
By the way, the idea of using <code>Write</code> on a slice of bytes
2065
is central to the implementation of <code>bytes.Buffer</code>.
Rob Pike's avatar
slices  
Rob Pike committed
2066
</p>
2067

Rob Pike's avatar
Rob Pike committed
2068
<h2 id="interfaces_and_types">Interfaces and other types</h2>
2069

2070 2071 2072 2073 2074 2075 2076 2077
<h3 id="interfaces">Interfaces</h3>
<p>
Interfaces in Go provide a way to specify the behavior of an
object: if something can do <em>this</em>, then it can be used
<em>here</em>.  We've seen a couple of simple examples already;
custom printers can be implemented by a <code>String</code> method
while <code>Fprintf</code> can generate output to anything
with a <code>Write</code> method.
Rob Pike's avatar
Rob Pike committed
2078
Interfaces with only one or two methods are common in Go code, and are
2079 2080 2081 2082 2083 2084 2085 2086
usually given a name derived from the method, such as <code>io.Writer</code>
for something that implements <code>Write</code>.
</p>
<p>
A type can implement multiple interfaces.
For instance, a collection can be sorted
by the routines in package <code>sort</code> if it implements
<code>sort.Interface</code>, which contains <code>Len()</code>,
Russ Cox's avatar
Russ Cox committed
2087
<code>Less(i, j int) bool</code>, and <code>Swap(i, j int)</code>,
2088 2089 2090
and it could also have a custom formatter.
In this contrived example <code>Sequence</code> satisfies both.
</p>
2091
{{code "/doc/progs/eff_sequence.go" `/^type/` "$"}}
2092

2093
<h3 id="conversions">Conversions</h3>
2094

Rob Pike's avatar
Rob Pike committed
2095
<p>
2096 2097 2098 2099 2100 2101 2102
The <code>String</code> method of <code>Sequence</code> is recreating the
work that <code>Sprint</code> already does for slices.  We can share the
effort if we convert the <code>Sequence</code> to a plain
<code>[]int</code> before calling <code>Sprint</code>.
</p>
<pre>
func (s Sequence) String() string {
2103 2104
    sort.Sort(s)
    return fmt.Sprint([]int(s))
2105 2106 2107
}
</pre>
<p>
2108 2109
This method is another example of the conversion technique for calling
<code>Sprintf</code> safely from a <code>String</code> method.
2110 2111 2112 2113
Because the two types (<code>Sequence</code> and <code>[]int</code>)
are the same if we ignore the type name, it's legal to convert between them.
The conversion doesn't create a new value, it just temporarily acts
as though the existing value has a new type.
2114
(There are other legal conversions, such as from integer to floating point, that
2115 2116 2117
do create a new value.)
</p>
<p>
Rob Pike's avatar
Rob Pike committed
2118
It's an idiom in Go programs to convert the
2119 2120
type of an expression to access a different
set of methods. As an example, we could use the existing
Rob Pike's avatar
Rob Pike committed
2121
type <code>sort.IntSlice</code> to reduce the entire example
2122
to this:
Rob Pike's avatar
Rob Pike committed
2123
</p>
2124 2125
<pre>
type Sequence []int
Rob Pike's avatar
Rob Pike committed
2126

2127 2128
// Method for printing - sorts the elements before printing
func (s Sequence) String() string {
Rob Pike's avatar
Rob Pike committed
2129
    sort.IntSlice(s).Sort()
2130
    return fmt.Sprint([]int(s))
2131 2132
}
</pre>
Rob Pike's avatar
Rob Pike committed
2133
<p>
2134 2135
Now, instead of having <code>Sequence</code> implement multiple
interfaces (sorting and printing), we're using the ability of a data item to be
Rob Pike's avatar
Rob Pike committed
2136
converted to multiple types (<code>Sequence</code>, <code>sort.IntSlice</code>
2137 2138 2139 2140
and <code>[]int</code>), each of which does some part of the job.
That's more unusual in practice but can be effective.
</p>

2141 2142 2143 2144 2145 2146 2147 2148 2149 2150 2151 2152 2153 2154 2155 2156 2157 2158 2159 2160 2161 2162 2163 2164 2165 2166 2167 2168 2169 2170 2171 2172 2173 2174 2175 2176 2177
<h3 id="interface_conversions">Interface conversions and type assertions</h3>

<p>
<a href="#type_switch">Type switches</a> are a form of conversion: they take an interface and, for
each case in the switch, in a sense convert it to the type of that case.
Here's a simplified version of how the code under <code>fmt.Printf</code> turns a value into
a string using a type switch.
If it's already a string, we want the actual string value held by the interface, while if it has a
<code>String</code> method we want the result of calling the method.
</p>

<pre>
type Stringer interface {
    String() string
}

var value interface{} // Value provided by caller.
switch str := value.(type) {
case string:
    return str
case Stringer:
    return str.String()
}
</pre>

<p>
The first case finds a concrete value; the second converts the interface into another interface.
It's perfectly fine to mix types this way.
</p>

<p>
What if there's only one type we care about? If we know the value holds a <code>string</code>
and we just want to extract it?
A one-case type switch would do, but so would a <em>type assertion</em>.
A type assertion takes an interface value and extracts from it a value of the specified explicit type.
The syntax borrows from the clause opening a type switch, but with an explicit
type rather than the <code>type</code> keyword:
2178
</p>
2179 2180 2181 2182 2183 2184 2185 2186 2187 2188 2189 2190 2191 2192 2193 2194 2195 2196 2197 2198 2199 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 2215 2216 2217 2218 2219 2220 2221 2222 2223 2224 2225 2226

<pre>
value.(typeName)
</pre>

<p>
and the result is a new value with the static type <code>typeName</code>.
That type must either be the concrete type held by the interface, or a second interface
type that the value can be converted to.
To extract the string we know is in the value, we could write:
</p>

<pre>
str := value.(string)
</pre>

<p>
But if it turns out that the value does not contain a string, the program will crash with a run-time error.
To guard against that, use the "comma, ok" idiom to test, safely, whether the value is a string:
</p>

<pre>
str, ok := value.(string)
if ok {
    fmt.Printf("string value is: %q\n", str)
} else {
    fmt.Printf("value is not a string\n")
}
</pre>

<p>
If the type assertion fails, <code>str</code> will still exist and be of type string, but it will have
the zero value, an empty string.
</p>

<p>
As an illustration of the capability, here's an <code>if</code>-<code>else</code>
statement that's equivalent to the type switch that opened this section.
</p>

<pre>
if str, ok := value.(string); ok {
    return str
} else if str, ok := value.(Stringer); ok {
    return str.String()
}
</pre>

2227 2228 2229 2230
<h3 id="generality">Generality</h3>
<p>
If a type exists only to implement an interface
and has no exported methods beyond that interface,
Rob Pike's avatar
Rob Pike committed
2231 2232 2233 2234 2235
there is no need to export the type itself.
Exporting just the interface makes it clear that
it's the behavior that matters, not the implementation,
and that other implementations with different properties
can mirror the behavior of the original type.
2236 2237 2238 2239 2240 2241 2242
It also avoids the need to repeat the documentation
on every instance of a common method.
</p>
<p>
In such cases, the constructor should return an interface value
rather than the implementing type.
As an example, in the hash libraries
2243
both <code>crc32.NewIEEE</code> and <code>adler32.New</code>
2244
return the interface type <code>hash.Hash32</code>.
Rob Pike's avatar
Rob Pike committed
2245
Substituting the CRC-32 algorithm for Adler-32 in a Go program
2246
requires only changing the constructor call;
Rob Pike's avatar
Rob Pike committed
2247 2248
the rest of the code is unaffected by the change of algorithm.
</p>
2249 2250
<p>
A similar approach allows the streaming cipher algorithms
2251
in the various <code>crypto</code> packages to be
2252
separated from the block ciphers they chain together.
2253
The <code>Block</code> interface
2254
in the <code>crypto/cipher</code> package specifies the
2255 2256 2257 2258 2259 2260 2261
behavior of a block cipher, which provides encryption
of a single block of data.
Then, by analogy with the <code>bufio</code> package,
cipher packages that implement this interface
can be used to construct streaming ciphers, represented
by the <code>Stream</code> interface, without
knowing the details of the block encryption.
2262 2263
</p>
<p>
2264
The  <code>crypto/cipher</code> interfaces look like this:
2265 2266
</p>
<pre>
2267
type Block interface {
2268 2269 2270
    BlockSize() int
    Encrypt(src, dst []byte)
    Decrypt(src, dst []byte)
2271
}
Rob Pike's avatar
Rob Pike committed
2272

2273 2274 2275 2276 2277 2278 2279 2280 2281 2282
type Stream interface {
    XORKeyStream(dst, src []byte)
}
</pre>

<p>
Here's the definition of the counter mode (CTR) stream,
which turns a block cipher into a streaming cipher; notice
that the block cipher's details are abstracted away:
</p>
Rob Pike's avatar
Rob Pike committed
2283

2284 2285 2286 2287
<pre>
// NewCTR returns a Stream that encrypts/decrypts using the given Block in
// counter mode. The length of iv must be the same as the Block's block size.
func NewCTR(block Block, iv []byte) Stream
2288 2289
</pre>
<p>
2290
<code>NewCTR</code> applies not
2291
just to one specific encryption algorithm and data source but to any
2292 2293 2294 2295
implementation of the <code>Block</code> interface and any
<code>Stream</code>.  Because they return
interface values, replacing CTR
encryption with other encryption modes is a localized change.  The constructor
Russ Cox's avatar
Russ Cox committed
2296
calls must be edited, but because the surrounding code must treat the result only
2297
as a <code>Stream</code>, it won't notice the difference.
2298
</p>
Russ Cox's avatar
Russ Cox committed
2299

Rob Pike's avatar
Rob Pike committed
2300 2301 2302 2303 2304 2305 2306 2307 2308
<h3 id="interface_methods">Interfaces and methods</h3>
<p>
Since almost anything can have methods attached, almost anything can
satisfy an interface.  One illustrative example is in the <code>http</code>
package, which defines the <code>Handler</code> interface.  Any object
that implements <code>Handler</code> can serve HTTP requests.
</p>
<pre>
type Handler interface {
2309
    ServeHTTP(ResponseWriter, *Request)
Rob Pike's avatar
Rob Pike committed
2310 2311 2312
}
</pre>
<p>
2313 2314 2315 2316 2317 2318 2319
<code>ResponseWriter</code> is itself an interface that provides access
to the methods needed to return the response to the client.
Those methods include the standard <code>Write</code> method, so an
<code>http.ResponseWriter</code> can be used wherever an <code>io.Writer</code>
can be used.
<code>Request</code> is a struct containing a parsed representation
of the request from the client.
2320
</p>
2321
<p>
Rob Pike's avatar
Rob Pike committed
2322 2323
For brevity, let's ignore POSTs and assume HTTP requests are always
GETs; that simplification does not affect the way the handlers are
Rob Pike's avatar
Rob Pike committed
2324
set up.  Here's a trivial but complete implementation of a handler to
Rob Pike's avatar
Rob Pike committed
2325 2326 2327 2328 2329 2330
count the number of times the
page is visited.
</p>
<pre>
// Simple counter server.
type Counter struct {
2331
    n int
Rob Pike's avatar
Rob Pike committed
2332 2333
}

2334
func (ctr *Counter) ServeHTTP(w http.ResponseWriter, req *http.Request) {
2335
    ctr.n++
2336
    fmt.Fprintf(w, "counter = %d\n", ctr.n)
Rob Pike's avatar
Rob Pike committed
2337 2338 2339
}
</pre>
<p>
2340 2341
(Keeping with our theme, note how <code>Fprintf</code> can print to an
<code>http.ResponseWriter</code>.)
Rob Pike's avatar
Rob Pike committed
2342
For reference, here's how to attach such a server to a node on the URL tree.
2343
</p>
Rob Pike's avatar
Rob Pike committed
2344
<pre>
2345
import "net/http"
Rob Pike's avatar
Rob Pike committed
2346
...
2347 2348
ctr := new(Counter)
http.Handle("/counter", ctr)
Rob Pike's avatar
Rob Pike committed
2349 2350 2351 2352 2353 2354 2355 2356 2357
</pre>
<p>
But why make <code>Counter</code> a struct?  An integer is all that's needed.
(The receiver needs to be a pointer so the increment is visible to the caller.)
</p>
<pre>
// Simpler counter server.
type Counter int

2358
func (ctr *Counter) ServeHTTP(w http.ResponseWriter, req *http.Request) {
2359
    *ctr++
2360
    fmt.Fprintf(w, "counter = %d\n", *ctr)
Rob Pike's avatar
Rob Pike committed
2361 2362 2363 2364 2365 2366 2367 2368 2369
}
</pre>
<p>
What if your program has some internal state that needs to be notified that a page
has been visited?  Tie a channel to the web page.
</p>
<pre>
// A channel that sends a notification on each visit.
// (Probably want the channel to be buffered.)
Rob Pike's avatar
Rob Pike committed
2370
type Chan chan *http.Request
Rob Pike's avatar
Rob Pike committed
2371

2372
func (ch Chan) ServeHTTP(w http.ResponseWriter, req *http.Request) {
2373
    ch &lt;- req
2374
    fmt.Fprint(w, "notification sent")
Rob Pike's avatar
Rob Pike committed
2375 2376 2377 2378 2379
}
</pre>
<p>
Finally, let's say we wanted to present on <code>/args</code> the arguments
used when invoking the server binary.
Rob Pike's avatar
Rob Pike committed
2380
It's easy to write a function to print the arguments.
Rob Pike's avatar
Rob Pike committed
2381 2382 2383
</p>
<pre>
func ArgServer() {
2384
    fmt.Println(os.Args)
Rob Pike's avatar
Rob Pike committed
2385 2386 2387 2388 2389
}
</pre>
<p>
How do we turn that into an HTTP server?  We could make <code>ArgServer</code>
a method of some type whose value we ignore, but there's a cleaner way.
Rob Pike's avatar
Rob Pike committed
2390 2391
Since we can define a method for any type except pointers and interfaces,
we can write a method for a function.
Rob Pike's avatar
Rob Pike committed
2392 2393 2394 2395 2396 2397 2398
The <code>http</code> package contains this code:
</p>
<pre>
// The HandlerFunc type is an adapter to allow the use of
// ordinary functions as HTTP handlers.  If f is a function
// with the appropriate signature, HandlerFunc(f) is a
// Handler object that calls f.
2399
type HandlerFunc func(ResponseWriter, *Request)
Rob Pike's avatar
Rob Pike committed
2400 2401

// ServeHTTP calls f(c, req).
2402 2403
func (f HandlerFunc) ServeHTTP(w ResponseWriter, req *Request) {
    f(w, req)
Rob Pike's avatar
Rob Pike committed
2404 2405 2406 2407 2408 2409
}
</pre>
<p>
<code>HandlerFunc</code> is a type with a method, <code>ServeHTTP</code>,
so values of that type can serve HTTP requests.  Look at the implementation
of the method: the receiver is a function, <code>f</code>, and the method
Rob Pike's avatar
Rob Pike committed
2410
calls <code>f</code>.  That may seem odd but it's not that different from, say,
Rob Pike's avatar
Rob Pike committed
2411 2412 2413
the receiver being a channel and the method sending on the channel.
</p>
<p>
Rob Pike's avatar
Rob Pike committed
2414 2415
To make <code>ArgServer</code> into an HTTP server, we first modify it
to have the right signature.
Rob Pike's avatar
Rob Pike committed
2416 2417 2418
</p>
<pre>
// Argument server.
2419
func ArgServer(w http.ResponseWriter, req *http.Request) {
2420
    fmt.Fprintln(w, os.Args)
Rob Pike's avatar
Rob Pike committed
2421 2422 2423
}
</pre>
<p>
Rob Pike's avatar
Rob Pike committed
2424 2425
<code>ArgServer</code> now has same signature as <code>HandlerFunc</code>,
so it can be converted to that type to access its methods,
Rob Pike's avatar
Rob Pike committed
2426 2427
just as we converted <code>Sequence</code> to <code>IntSlice</code>
to access <code>IntSlice.Sort</code>.
Rob Pike's avatar
Rob Pike committed
2428
The code to set it up is concise:
Rob Pike's avatar
Rob Pike committed
2429 2430
</p>
<pre>
2431
http.Handle("/args", http.HandlerFunc(ArgServer))
Rob Pike's avatar
Rob Pike committed
2432 2433 2434
</pre>
<p>
When someone visits the page <code>/args</code>,
Rob Pike's avatar
Rob Pike committed
2435 2436
the handler installed at that page has value <code>ArgServer</code>
and type <code>HandlerFunc</code>.
Rob Pike's avatar
Rob Pike committed
2437
The HTTP server will invoke the method <code>ServeHTTP</code>
Rob Pike's avatar
Rob Pike committed
2438
of that type, with <code>ArgServer</code> as the receiver, which will in turn call
Rob Pike's avatar
Rob Pike committed
2439
<code>ArgServer</code> (via the invocation <code>f(c, req)</code>
Rob Pike's avatar
Rob Pike committed
2440 2441
inside <code>HandlerFunc.ServeHTTP</code>).
The arguments will then be displayed.
Rob Pike's avatar
Rob Pike committed
2442 2443
</p>
<p>
Rob Pike's avatar
Rob Pike committed
2444
In this section we have made an HTTP server from a struct, an integer,
Rob Pike's avatar
Rob Pike committed
2445 2446 2447 2448
a channel, and a function, all because interfaces are just sets of
methods, which can be defined for (almost) any type.
</p>

2449 2450 2451 2452 2453 2454 2455 2456 2457 2458 2459 2460 2461 2462 2463 2464 2465 2466 2467 2468
<h2 id="blank">The blank identifier</h2>

<p>
We've mentioned the blank identifier a couple of times now, in the context of
<a href="#for"><code>for</code> <code>range</code> loops</a>
and <a href="#maps">maps</a>.
The blank identifier can be assigned or declared with any value of any type, with the
value discarded harmlessly.
It's a bit like writing to the Unix <code>/dev/null</code> file:
it represents a write-only value
to be used as a place-holder
where a variable is needed but the actual value is irrelevant.
It has uses beyond those we've seen already.
</p>

<h3 id="blank_assign">The blank identifier in multiple assignment</h3>

<p>
The use of a blank identifier in a <code>for</code> <code>range</code> loop is a
special case of a general situation: multiple assignment.
2469 2470
</p>

2471 2472 2473
<p>
If an assignment requires multiple values on the left side,
but one of the values will not be used by the program,
2474
a blank identifier on the left-hand-side of
2475 2476 2477 2478 2479 2480 2481 2482 2483 2484 2485 2486 2487 2488 2489 2490 2491 2492 2493 2494 2495 2496 2497 2498 2499 2500 2501 2502 2503 2504 2505 2506 2507 2508 2509 2510 2511 2512 2513 2514 2515 2516 2517
the assignment avoids the need
to create a dummy variable and makes it clear that the
value is to be discarded.
For instance, when calling a function that returns
a value and an error, but only the error is important,
use the blank identifier to discard the irrelevant value.
</p>

<pre>
if _, err := os.Stat(path); os.IsNotExist(err) {
	fmt.Printf("%s does not exist\n", path)
}
</pre>

<p>
Occasionally you'll see code that discards the error value in order
to ignore the error; this is terrible practice. Always check error returns;
they're provided for a reason.
</p>

<pre>
// Bad! This code will crash if path does not exist.
fi, _ := os.Stat(path)
if fi.IsDir() {
    fmt.Printf("%s is a directory\n", path)
}
</pre>

<h3 id="blank_unused">Unused imports and variables</h3>

<p>
It is an error to import a package or to declare a variable without using it.
Unused imports bloat the program and slow compilation,
while a variable that is initialized but not used is at least
a wasted computation and perhaps indicative of a
larger bug.
When a program is under active development, however,
unused imports and variables often arise and it can
be annoying to delete them just to have the compilation proceed,
only to have them be needed again later.
The blank identifier provides a workaround.
</p>
<p>
2518
This half-written program has two unused imports
2519 2520 2521 2522 2523 2524 2525 2526 2527 2528 2529 2530 2531 2532 2533 2534 2535 2536 2537 2538 2539 2540 2541 2542 2543 2544 2545 2546 2547 2548 2549 2550 2551 2552 2553 2554 2555 2556 2557 2558 2559 2560 2561 2562 2563 2564 2565 2566 2567 2568 2569 2570 2571 2572 2573 2574 2575 2576 2577 2578 2579 2580 2581 2582 2583 2584 2585
(<code>fmt</code> and <code>io</code>)
and an unused variable (<code>fd</code>),
so it will not compile, but it would be nice to see if the
code so far is correct.
</p>
{{code "/doc/progs/eff_unused1.go" `/package/` `$`}}
<p>
To silence complaints about the unused imports, use a
blank identifier to refer to a symbol from the imported package.
Similarly, assigning the unused variable <code>fd</code>
to the blank identifier will silence the unused variable error.
This version of the program does compile.
</p>
{{code "/doc/progs/eff_unused2.go" `/package/` `$`}}

<p>
By convention, the global declarations to silence import errors
should come right after the imports and be commented,
both to make them easy to find and as a reminder to clean things up later.
</p>

<h3 id="blank_import">Import for side effect</h3>

<p>
An unused import like <code>fmt</code> or <code>io</code> in the
previous example should eventually be used or removed:
blank assignments identify code as a work in progress.
But sometimes it is useful to import a package only for its
side effects, without any explicit use.
For example, during its <code>init</code> function,
the <code><a href="/pkg/net/http/pprof/">net/http/pprof</a></code>
package registers HTTP handlers that provide
debugging information. It has an exported API, but
most clients need only the handler registration and
access the data through a web page.
To import the package only for its side effects, rename the package
to the blank identifier:
</p>
<pre>
import _ "net/http/pprof"
</pre>
<p>
This form of import makes clear that the package is being
imported for its side effects, because there is no other possible
use of the package: in this file, it doesn't have a name.
(If it did, and we didn't use that name, the compiler would reject the program.)
</p>

<h3 id="blank_implements">Interface checks</h3>

<p>
As we saw in the discussion of <a href="#interfaces_and_types">interfaces</a> above,
a type need not declare explicitly that it implements an interface.
Instead, a type implements the interface just by implementing the interface's methods.
In practice, most interface conversions are static and therefore checked at compile time.
For example, passing an <code>*os.File</code> to a function
expecting an <code>io.Reader</code> will not compile unless
<code>*os.File</code> implements the <code>io.Reader</code> interface.
</p>

<p>
Some interface checks do happen at run-time, though.
One instance is in the <code><a href="/pkg/encoding/json/">encoding/json</a></code>
package, which defines a <code><a href="/pkg/encoding/json/#Marshaler">Marshaler</a></code>
interface. When the JSON encoder receives a value that implements that interface,
the encoder invokes the value's marshaling method to convert it to JSON
instead of doing the standard conversion.
2586
The encoder checks this property at run time with a <a href="#interface_conversions">type assertion</a> like:
2587 2588 2589 2590 2591 2592 2593 2594 2595 2596 2597 2598 2599 2600 2601 2602 2603 2604 2605 2606
</p>

<pre>
m, ok := val.(json.Marshaler)
</pre>

<p>
If it's necessary only to ask whether a type implements an interface, without
actually using the interface itself, perhaps as part of an error check, use the blank
identifier to ignore the type-asserted value:
</p>

<pre>
if _, ok := val.(json.Marshaler); ok {
    fmt.Printf("value %v of type %T implements json.Marshaler\n", val, val)
}
</pre>

<p>
One place this situation arises is when it is necessary to guarantee within the package implementing the type that
Shenghou Ma's avatar
Shenghou Ma committed
2607
it actually satisfies the interface.
2608 2609
If a type—for example,
<code><a href="/pkg/encoding/json/#RawMessage">json.RawMessage</a></code>—needs
2610
a custom JSON representation, it should implement
2611 2612 2613 2614 2615 2616 2617 2618 2619 2620 2621 2622 2623 2624 2625 2626 2627 2628 2629 2630 2631 2632 2633 2634 2635 2636 2637 2638 2639 2640
<code>json.Marshaler</code>, but there are no static conversions that would
cause the compiler to verify this automatically.
If the type inadvertently fails to satisfy the interface, the JSON encoder will still work,
but will not use the custom implementation.
To guarantee that the implementation is correct,
a global declaration using the blank identifier can be used in the package:
</p>
<pre>
var _ json.Marshaler = (*RawMessage)(nil)
</pre>
<p>
In this declaration, the assignment involving a conversion of a
<code>*RawMessage</code> to a <code>Marshaler</code>
requires that <code>*RawMessage</code> implements <code>Marshaler</code>,
and that property will be checked at compile time.
Should the <code>json.Marshaler</code> interface change, this package
will no longer compile and we will be on notice that it needs to be updated.
</p>

<p>
The appearance of the blank identifier in this construct indicates that
the declaration exists only for the type checking,
not to create a variable.
Don't do this for every type that satisfies an interface, though.
By convention, such declarations are only used
when there are no static conversions already present in the code,
which is a rare event.
</p>


Rob Pike's avatar
Rob Pike committed
2641 2642 2643 2644 2645 2646 2647 2648 2649 2650 2651 2652 2653 2654 2655
<h2 id="embedding">Embedding</h2>

<p>
Go does not provide the typical, type-driven notion of subclassing,
but it does have the ability to &ldquo;borrow&rdquo; pieces of an
implementation by <em>embedding</em> types within a struct or
interface.
</p>
<p>
Interface embedding is very simple.
We've mentioned the <code>io.Reader</code> and <code>io.Writer</code> interfaces before;
here are their definitions.
</p>
<pre>
type Reader interface {
2656
    Read(p []byte) (n int, err error)
Rob Pike's avatar
Rob Pike committed
2657 2658 2659
}

type Writer interface {
2660
    Write(p []byte) (n int, err error)
Rob Pike's avatar
Rob Pike committed
2661 2662 2663 2664 2665 2666 2667 2668 2669 2670 2671 2672
}
</pre>
<p>
The <code>io</code> package also exports several other interfaces
that specify objects that can implement several such methods.
For instance, there is <code>io.ReadWriter</code>, an interface
containing both <code>Read</code> and <code>Write</code>.
We could specify <code>io.ReadWriter</code> by listing the
two methods explicitly, but it's easier and more evocative
to embed the two interfaces to form the new one, like this:
</p>
<pre>
2673
// ReadWriter is the interface that combines the Reader and Writer interfaces.
Rob Pike's avatar
Rob Pike committed
2674
type ReadWriter interface {
2675 2676
    Reader
    Writer
Rob Pike's avatar
Rob Pike committed
2677 2678 2679 2680 2681 2682 2683 2684
}
</pre>
<p>
This says just what it looks like: A <code>ReadWriter</code> can do
what a <code>Reader</code> does <em>and</em> what a <code>Writer</code>
does; it is a union of the embedded interfaces (which must be disjoint
sets of methods).
Only interfaces can be embedded within interfaces.
2685
</p>
Rob Pike's avatar
Rob Pike committed
2686 2687 2688 2689 2690 2691 2692 2693 2694 2695 2696 2697 2698 2699 2700
<p>
The same basic idea applies to structs, but with more far-reaching
implications.  The <code>bufio</code> package has two struct types,
<code>bufio.Reader</code> and <code>bufio.Writer</code>, each of
which of course implements the analogous interfaces from package
<code>io</code>.
And <code>bufio</code> also implements a buffered reader/writer,
which it does by combining a reader and a writer into one struct
using embedding: it lists the types within the struct
but does not give them field names.
</p>
<pre>
// ReadWriter stores pointers to a Reader and a Writer.
// It implements io.ReadWriter.
type ReadWriter struct {
2701 2702
    *Reader  // *bufio.Reader
    *Writer  // *bufio.Writer
Rob Pike's avatar
Rob Pike committed
2703 2704 2705
}
</pre>
<p>
2706 2707 2708 2709
The embedded elements are pointers to structs and of course
must be initialized to point to valid structs before they
can be used.
The <code>ReadWriter</code> struct could be written as
Rob Pike's avatar
Rob Pike committed
2710 2711 2712
</p>
<pre>
type ReadWriter struct {
2713 2714
    reader *Reader
    writer *Writer
Rob Pike's avatar
Rob Pike committed
2715 2716 2717 2718 2719 2720 2721 2722
}
</pre>
<p>
but then to promote the methods of the fields and to
satisfy the <code>io</code> interfaces, we would also need
to provide forwarding methods, like this:
</p>
<pre>
2723
func (rw *ReadWriter) Read(p []byte) (n int, err error) {
2724
    return rw.reader.Read(p)
Rob Pike's avatar
Rob Pike committed
2725 2726 2727 2728 2729 2730 2731 2732 2733 2734 2735 2736
}
</pre>
<p>
By embedding the structs directly, we avoid this bookkeeping.
The methods of embedded types come along for free, which means that <code>bufio.ReadWriter</code>
not only has the methods of <code>bufio.Reader</code> and <code>bufio.Writer</code>,
it also satisfies all three interfaces:
<code>io.Reader</code>,
<code>io.Writer</code>, and
<code>io.ReadWriter</code>.
</p>
<p>
Rob Pike's avatar
Rob Pike committed
2737
There's an important way in which embedding differs from subclassing.  When we embed a type,
2738 2739
the methods of that type become methods of the outer type,
but when they are invoked the receiver of the method is the inner type, not the outer one.
Rob Pike's avatar
Rob Pike committed
2740
In our example, when the <code>Read</code> method of a <code>bufio.ReadWriter</code> is
Rob Pike's avatar
Rob Pike committed
2741
invoked, it has exactly the same effect as the forwarding method written out above;
Rob Pike's avatar
Rob Pike committed
2742 2743 2744
the receiver is the <code>reader</code> field of the <code>ReadWriter</code>, not the
<code>ReadWriter</code> itself.
</p>
Rob Pike's avatar
Rob Pike committed
2745 2746 2747 2748 2749 2750
<p>
Embedding can also be a simple convenience.
This example shows an embedded field alongside a regular, named field.
</p>
<pre>
type Job struct {
2751 2752
    Command string
    *log.Logger
Rob Pike's avatar
Rob Pike committed
2753 2754 2755 2756 2757
}
</pre>
<p>
The <code>Job</code> type now has the <code>Log</code>, <code>Logf</code>
and other
2758
methods of <code>*log.Logger</code>.  We could have given the <code>Logger</code>
2759 2760 2761
a field name, of course, but it's not necessary to do so.  And now, once
initialized, we can
log to the <code>Job</code>:
Rob Pike's avatar
Rob Pike committed
2762 2763
</p>
<pre>
2764
job.Log("starting now...")
Rob Pike's avatar
Rob Pike committed
2765 2766
</pre>
<p>
2767 2768
The <code>Logger</code> is a regular field of the <code>Job</code> struct,
so we can initialize it in the usual way inside the constructor for <code>Job</code>, like this,
2769 2770 2771
</p>
<pre>
func NewJob(command string, logger *log.Logger) *Job {
2772
    return &amp;Job{command, logger}
2773 2774 2775
}
</pre>
<p>
2776 2777 2778
or with a composite literal,
</p>
<pre>
2779
job := &amp;Job{command, log.New(os.Stderr, "Job: ", log.Ldate)}
2780 2781
</pre>
<p>
Rob Pike's avatar
Rob Pike committed
2782
If we need to refer to an embedded field directly, the type name of the field,
2783 2784 2785
ignoring the package qualifier, serves as a field name, as it did
in the <code>Read</code> method of our <code>ReaderWriter</code> struct.
Here, if we needed to access the
Rob Pike's avatar
Rob Pike committed
2786
<code>*log.Logger</code> of a <code>Job</code> variable <code>job</code>,
2787 2788
we would write <code>job.Logger</code>,
which would be useful if we wanted to refine the methods of <code>Logger</code>.
Rob Pike's avatar
Rob Pike committed
2789 2790
</p>
<pre>
2791
func (job *Job) Logf(format string, args ...interface{}) {
2792
    job.Logger.Logf("%q: %s", job.Command, fmt.Sprintf(format, args...))
Rob Pike's avatar
Rob Pike committed
2793 2794 2795 2796 2797 2798 2799 2800 2801 2802 2803 2804
}
</pre>
<p>
Embedding types introduces the problem of name conflicts but the rules to resolve
them are simple.
First, a field or method <code>X</code> hides any other item <code>X</code> in a more deeply
nested part of the type.
If <code>log.Logger</code> contained a field or method called <code>Command</code>, the <code>Command</code> field
of <code>Job</code> would dominate it.
</p>
<p>
Second, if the same name appears at the same nesting level, it is usually an error;
2805
it would be erroneous to embed <code>log.Logger</code> if the <code>Job</code> struct
Rob Pike's avatar
Rob Pike committed
2806 2807 2808
contained another field or method called <code>Logger</code>.
However, if the duplicate name is never mentioned in the program outside the type definition, it is OK.
This qualification provides some protection against changes made to types embedded from outside; there
Rob Pike's avatar
Rob Pike committed
2809 2810
is no problem if a field is added that conflicts with another field in another subtype if neither field
is ever used.
Rob Pike's avatar
Rob Pike committed
2811
</p>
Rob Pike's avatar
Rob Pike committed
2812 2813


2814 2815 2816 2817
<h2 id="concurrency">Concurrency</h2>

<h3 id="sharing">Share by communicating</h3>

Rob Pike's avatar
Rob Pike committed
2818 2819 2820 2821
<p>
Concurrent programming is a large topic and there is space only for some
Go-specific highlights here.
</p>
2822 2823 2824 2825 2826 2827 2828 2829 2830 2831 2832 2833 2834 2835 2836 2837 2838 2839 2840 2841
<p>
Concurrent programming in many environments is made difficult by the
subtleties required to implement correct access to shared variables.  Go encourages
a different approach in which shared values are passed around on channels
and, in fact, never actively shared by separate threads of execution.
Only one goroutine has access to the value at any given time.
Data races cannot occur, by design.
To encourage this way of thinking we have reduced it to a slogan:
</p>
<blockquote>
Do not communicate by sharing memory;
instead, share memory by communicating.
</blockquote>
<p>
This approach can be taken too far.  Reference counts may be best done
by putting a mutex around an integer variable, for instance.  But as a
high-level approach, using channels to control access makes it easier
to write clear, correct programs.
</p>
<p>
Rob Pike's avatar
Rob Pike committed
2842
One way to think about this model is to consider a typical single-threaded
2843 2844 2845
program running on one CPU. It has no need for synchronization primitives.
Now run another such instance; it too needs no synchronization.  Now let those
two communicate; if the communication is the synchronizer, there's still no need
Rob Pike's avatar
Rob Pike committed
2846
for other synchronization.  Unix pipelines, for example, fit this model
Rob Pike's avatar
Rob Pike committed
2847
perfectly.  Although Go's approach to concurrency originates in Hoare's
2848 2849 2850 2851 2852 2853
Communicating Sequential Processes (CSP),
it can also be seen as a type-safe generalization of Unix pipes.
</p>

<h3 id="goroutines">Goroutines</h3>

Rob Pike's avatar
Rob Pike committed
2854 2855 2856 2857
<p>
They're called <em>goroutines</em> because the existing
terms&mdash;threads, coroutines, processes, and so on&mdash;convey
inaccurate connotations.  A goroutine has a simple model: it is a
2858
function executing concurrently with other goroutines in the same
Rob Pike's avatar
Rob Pike committed
2859 2860 2861 2862 2863 2864 2865 2866 2867 2868 2869 2870 2871 2872 2873 2874 2875 2876 2877 2878
address space.  It is lightweight, costing little more than the
allocation of stack space.
And the stacks start small, so they are cheap, and grow
by allocating (and freeing) heap storage as required.
</p>
<p>
Goroutines are multiplexed onto multiple OS threads so if one should
block, such as while waiting for I/O, others continue to run.  Their
design hides many of the complexities of thread creation and
management.
</p>
<p>
Prefix a function or method call with the <code>go</code>
keyword to run the call in a new goroutine.
When the call completes, the goroutine
exits, silently.  (The effect is similar to the Unix shell's
<code>&amp;</code> notation for running a command in the
background.)
</p>
<pre>
2879
go list.Sort()  // run list.Sort concurrently; don't wait for it.
Rob Pike's avatar
Rob Pike committed
2880 2881 2882
</pre>
<p>
A function literal can be handy in a goroutine invocation.
2883
</p>
Rob Pike's avatar
Rob Pike committed
2884
<pre>
2885
func Announce(message string, delay time.Duration) {
2886 2887 2888 2889
    go func() {
        time.Sleep(delay)
        fmt.Println(message)
    }()  // Note the parentheses - must call the function.
Rob Pike's avatar
Rob Pike committed
2890 2891 2892
}
</pre>
<p>
Rob Pike's avatar
Rob Pike committed
2893
In Go, function literals are closures: the implementation makes
Rob Pike's avatar
Rob Pike committed
2894
sure the variables referred to by the function survive as long as they are active.
2895
</p>
Rob Pike's avatar
Rob Pike committed
2896 2897 2898 2899 2900
<p>
These examples aren't too practical because the functions have no way of signaling
completion.  For that, we need channels.
</p>

2901 2902
<h3 id="channels">Channels</h3>

Rob Pike's avatar
Rob Pike committed
2903
<p>
2904 2905
Like maps, channels are allocated with <code>make</code>, and
the resulting value acts as a reference to an underlying data structure.
Rob Pike's avatar
Rob Pike committed
2906 2907 2908 2909
If an optional integer parameter is provided, it sets the buffer size for the channel.
The default is zero, for an unbuffered or synchronous channel.
</p>
<pre>
2910 2911 2912
ci := make(chan int)            // unbuffered channel of integers
cj := make(chan int, 0)         // unbuffered channel of integers
cs := make(chan *os.File, 100)  // buffered channel of pointers to Files
Rob Pike's avatar
Rob Pike committed
2913 2914
</pre>
<p>
2915
Unbuffered channels combine communication&mdash;the exchange of a value&mdash;with
Rob Pike's avatar
Rob Pike committed
2916 2917 2918 2919 2920 2921 2922 2923 2924
synchronization&mdash;guaranteeing that two calculations (goroutines) are in
a known state.
</p>
<p>
There are lots of nice idioms using channels.  Here's one to get us started.
In the previous section we launched a sort in the background. A channel
can allow the launching goroutine to wait for the sort to complete.
</p>
<pre>
2925
c := make(chan int)  // Allocate a channel.
Rob Pike's avatar
Rob Pike committed
2926 2927
// Start the sort in a goroutine; when it completes, signal on the channel.
go func() {
2928
    list.Sort()
2929
    c &lt;- 1  // Send a signal; value does not matter.
2930 2931 2932
}()
doSomethingForAWhile()
&lt;-c   // Wait for sort to finish; discard sent value.
Rob Pike's avatar
Rob Pike committed
2933 2934 2935 2936 2937 2938
</pre>
<p>
Receivers always block until there is data to receive.
If the channel is unbuffered, the sender blocks until the receiver has
received the value.
If the channel has a buffer, the sender blocks only until the
Ian Lance Taylor's avatar
Ian Lance Taylor committed
2939 2940
value has been copied to the buffer; if the buffer is full, this
means waiting until some receiver has retrieved a value.
Rob Pike's avatar
Rob Pike committed
2941 2942 2943 2944
</p>
<p>
A buffered channel can be used like a semaphore, for instance to
limit throughput.  In this example, incoming requests are passed
2945 2946 2947
to <code>handle</code>, which sends a value into the channel, processes
the request, and then receives a value from the channel
to ready the &ldquo;semaphore&rdquo; for the next consumer.
Rob Pike's avatar
Rob Pike committed
2948
The capacity of the channel buffer limits the number of
2949
simultaneous calls to <code>process</code>.
Rob Pike's avatar
Rob Pike committed
2950 2951 2952 2953 2954
</p>
<pre>
var sem = make(chan int, MaxOutstanding)

func handle(r *Request) {
2955 2956 2957
    sem &lt;- 1    // Wait for active queue to drain.
    process(r)  // May take a long time.
    &lt;-sem       // Done; enable next request to run.
Rob Pike's avatar
Rob Pike committed
2958 2959 2960 2961
}

func Serve(queue chan *Request) {
    for {
2962
        req := &lt;-queue
2963
        go handle(req)  // Don't wait for handle to finish.
Rob Pike's avatar
Rob Pike committed
2964 2965 2966
    }
}
</pre>
2967

Rob Pike's avatar
Rob Pike committed
2968
<p>
2969 2970 2971
Once <code>MaxOutstanding</code> handlers are executing <code>process</code>,
any more will block trying to send into the filled channel buffer,
until one of the existing handlers finishes and receives from the buffer.
2972 2973 2974 2975 2976 2977 2978 2979 2980 2981
</p>

<p>
This design has a problem, though: <code>Serve</code>
creates a new goroutine for
every incoming request, even though only <code>MaxOutstanding</code>
of them can run at any moment.
As a result, the program can consume unlimited resources if the requests come in too fast.
We can address that deficiency by changing <code>Serve</code> to
gate the creation of the goroutines.
2982
Here's an obvious solution, but beware it has a bug we'll fix subsequently:
2983 2984 2985 2986 2987
</p>

<pre>
func Serve(queue chan *Request) {
    for req := range queue {
2988
        sem &lt;- 1
2989 2990
        go func() {
            process(req) // Buggy; see explanation below.
2991
            &lt;-sem
2992 2993 2994 2995 2996 2997 2998 2999 3000 3001 3002 3003 3004 3005 3006 3007 3008
        }()
    }
}</pre>

<p>
The bug is that in a Go <code>for</code> loop, the loop variable
is reused for each iteration, so the <code>req</code>
variable is shared across all goroutines.
That's not what we want.
We need to make sure that <code>req</code> is unique for each goroutine.
Here's one way to do that, passing the value of <code>req</code> as an argument
to the closure in the goroutine:
</p>

<pre>
func Serve(queue chan *Request) {
    for req := range queue {
3009
        sem &lt;- 1
3010 3011
        go func(req *Request) {
            process(req)
3012
            &lt;-sem
3013 3014 3015 3016 3017 3018 3019 3020 3021 3022 3023 3024 3025 3026 3027
        }(req)
    }
}</pre>

<p>
Compare this version with the previous to see the difference in how
the closure is declared and run.
Another solution is just to create a new variable with the same
name, as in this example:
</p>

<pre>
func Serve(queue chan *Request) {
    for req := range queue {
        req := req // Create new instance of req for the goroutine.
3028
        sem &lt;- 1
3029 3030
        go func() {
            process(req)
3031
            &lt;-sem
3032
        }()
3033 3034 3035 3036
    }
}</pre>

<p>
3037 3038 3039 3040 3041 3042 3043 3044 3045 3046 3047 3048 3049 3050 3051 3052
It may seem odd to write
</p>

<pre>
req := req
</pre>

<p>
but it's a legal and idiomatic in Go to do this.
You get a fresh version of the variable with the same name, deliberately
shadowing the loop variable locally but unique to each goroutine.
</p>

<p>
Going back to the general problem of writing the server,
another approach that manages resources well is to start a fixed
Rob Pike's avatar
Rob Pike committed
3053 3054 3055 3056 3057 3058 3059 3060
number of <code>handle</code> goroutines all reading from the request
channel.
The number of goroutines limits the number of simultaneous
calls to <code>process</code>.
This <code>Serve</code> function also accepts a channel on which
it will be told to exit; after launching the goroutines it blocks
receiving from that channel.
</p>
3061

Rob Pike's avatar
Rob Pike committed
3062 3063
<pre>
func handle(queue chan *Request) {
3064 3065 3066
    for r := range queue {
        process(r)
    }
Rob Pike's avatar
Rob Pike committed
3067 3068
}

3069
func Serve(clientRequests chan *Request, quit chan bool) {
3070
    // Start handlers
3071
    for i := 0; i &lt; MaxOutstanding; i++ {
3072 3073
        go handle(clientRequests)
    }
3074
    &lt;-quit  // Wait to be told to exit.
Rob Pike's avatar
Rob Pike committed
3075 3076 3077 3078 3079 3080 3081 3082 3083
}
</pre>

<h3 id="chan_of_chan">Channels of channels</h3>
<p>
One of the most important properties of Go is that
a channel is a first-class value that can be allocated and passed
around like any other.  A common use of this property is
to implement safe, parallel demultiplexing.
3084
</p>
Rob Pike's avatar
Rob Pike committed
3085 3086 3087 3088 3089 3090 3091 3092 3093
<p>
In the example in the previous section, <code>handle</code> was
an idealized handler for a request but we didn't define the
type it was handling.  If that type includes a channel on which
to reply, each client can provide its own path for the answer.
Here's a schematic definition of type <code>Request</code>.
</p>
<pre>
type Request struct {
3094 3095 3096
    args        []int
    f           func([]int) int
    resultChan  chan int
Rob Pike's avatar
Rob Pike committed
3097 3098 3099 3100 3101 3102 3103 3104
}
</pre>
<p>
The client provides a function and its arguments, as well as
a channel inside the request object on which to receive the answer.
</p>
<pre>
func sum(a []int) (s int) {
3105 3106 3107 3108
    for _, v := range a {
        s += v
    }
    return
Rob Pike's avatar
Rob Pike committed
3109 3110 3111 3112
}

request := &amp;Request{[]int{3, 4, 5}, sum, make(chan int)}
// Send request
3113
clientRequests &lt;- request
Rob Pike's avatar
Rob Pike committed
3114
// Wait for response.
3115
fmt.Printf("answer: %d\n", &lt;-request.resultChan)
Rob Pike's avatar
Rob Pike committed
3116 3117 3118 3119 3120 3121
</pre>
<p>
On the server side, the handler function is the only thing that changes.
</p>
<pre>
func handle(queue chan *Request) {
3122
    for req := range queue {
3123
        req.resultChan &lt;- req.f(req.args)
3124
    }
Rob Pike's avatar
Rob Pike committed
3125 3126 3127 3128 3129 3130 3131 3132 3133 3134 3135 3136
}
</pre>
<p>
There's clearly a lot more to do to make it realistic, but this
code is a framework for a rate-limited, parallel, non-blocking RPC
system, and there's not a mutex in sight.
</p>

<h3 id="parallel">Parallelization</h3>
<p>
Another application of these ideas is to parallelize a calculation
across multiple CPU cores.  If the calculation can be broken into
Rob Pike's avatar
Rob Pike committed
3137 3138
separate pieces that can execute independently, it can be parallelized,
with a channel to signal when each piece completes.
Rob Pike's avatar
Rob Pike committed
3139 3140
</p>
<p>
Rob Pike's avatar
Rob Pike committed
3141
Let's say we have an expensive operation to perform on a vector of items,
Rob Pike's avatar
Rob Pike committed
3142 3143 3144 3145
and that the value of the operation on each item is independent,
as in this idealized example.
</p>
<pre>
Rob Pike's avatar
Rob Pike committed
3146
type Vector []float64
Rob Pike's avatar
Rob Pike committed
3147

Rob Pike's avatar
Rob Pike committed
3148
// Apply the operation to v[i], v[i+1] ... up to v[n-1].
Rob Pike's avatar
Rob Pike committed
3149
func (v Vector) DoSome(i, n int, u Vector, c chan int) {
3150
    for ; i &lt; n; i++ {
Rob Pike's avatar
Rob Pike committed
3151 3152
        v[i] += u.Op(v[i])
    }
3153
    c &lt;- 1    // signal that this piece is done
Rob Pike's avatar
Rob Pike committed
3154 3155 3156 3157 3158 3159 3160 3161 3162
}
</pre>
<p>
We launch the pieces independently in a loop, one per CPU.
They can complete in any order but it doesn't matter; we just
count the completion signals by draining the channel after
launching all the goroutines.
</p>
<pre>
3163
const NCPU = 4  // number of CPU cores
Rob Pike's avatar
Rob Pike committed
3164

Rob Pike's avatar
Rob Pike committed
3165
func (v Vector) DoAll(u Vector) {
3166
    c := make(chan int, NCPU)  // Buffering optional but sensible.
3167
    for i := 0; i &lt; NCPU; i++ {
3168
        go v.DoSome(i*len(v)/NCPU, (i+1)*len(v)/NCPU, u, c)
Rob Pike's avatar
Rob Pike committed
3169 3170
    }
    // Drain the channel.
3171 3172
    for i := 0; i &lt; NCPU; i++ {
        &lt;-c    // wait for one task to complete
Rob Pike's avatar
Rob Pike committed
3173 3174 3175 3176 3177 3178
    }
    // All done.
}

</pre>

Rob Pike's avatar
Rob Pike committed
3179
<p>
Rob Pike's avatar
Rob Pike committed
3180
The current implementation of the Go runtime
Rob Pike's avatar
Rob Pike committed
3181 3182 3183 3184 3185 3186 3187 3188 3189
will not parallelize this code by default.
It dedicates only a single core to user-level processing.  An
arbitrary number of goroutines can be blocked in system calls, but
by default only one can be executing user-level code at any time.
It should be smarter and one day it will be smarter, but until it
is if you want CPU parallelism you must tell the run-time
how many goroutines you want executing code simultaneously.  There
are two related ways to do this.  Either run your job with environment
variable <code>GOMAXPROCS</code> set to the number of cores to use
3190
or import the <code>runtime</code> package and call
Rob Pike's avatar
Rob Pike committed
3191
<code>runtime.GOMAXPROCS(NCPU)</code>.
3192 3193
A helpful value might be <code>runtime.NumCPU()</code>, which reports the number
of logical CPUs on the local machine.
Rob Pike's avatar
Rob Pike committed
3194 3195 3196
Again, this requirement is expected to be retired as the scheduling and run-time improve.
</p>

3197 3198 3199 3200 3201 3202 3203 3204 3205 3206 3207
<p>
Be sure not to confuse the ideas of concurrency—structuring a program
as independently executing components—and parallelism—executing
calculations in parallel for efficiency on multiple CPUs.
Although the concurrency features of Go can make some problems easy
to structure as parallel computations, Go is a concurrent language,
not a parallel one, and not all parallelization problems fit Go's model.
For a discussion of the distinction, see the talk cited in
<a href="http://blog.golang.org/2013/01/concurrency-is-not-parallelism.html">this
blog post</a>.

3208 3209 3210
<h3 id="leaky_buffer">A leaky buffer</h3>

<p>
Rob Pike's avatar
Rob Pike committed
3211
The tools of concurrent programming can even make non-concurrent
3212 3213 3214 3215 3216 3217 3218 3219 3220
ideas easier to express.  Here's an example abstracted from an RPC
package.  The client goroutine loops receiving data from some source,
perhaps a network.  To avoid allocating and freeing buffers, it keeps
a free list, and uses a buffered channel to represent it.  If the
channel is empty, a new buffer gets allocated.
Once the message buffer is ready, it's sent to the server on
<code>serverChan</code>.
</p>
<pre>
Russ Cox's avatar
Russ Cox committed
3221 3222
var freeList = make(chan *Buffer, 100)
var serverChan = make(chan *Buffer)
3223 3224

func client() {
3225
    for {
3226 3227 3228 3229 3230 3231 3232
        var b *Buffer
        // Grab a buffer if available; allocate if not.
        select {
        case b = &lt;-freeList:
            // Got one; nothing more to do.
        default:
            // None free, so allocate a new one.
3233 3234
            b = new(Buffer)
        }
3235 3236
        load(b)              // Read next message from the net.
        serverChan &lt;- b      // Send to server.
3237
    }
3238 3239 3240
}
</pre>
<p>
3241
The server loop receives each message from the client, processes it,
3242 3243 3244 3245
and returns the buffer to the free list.
</p>
<pre>
func server() {
3246
    for {
3247
        b := &lt;-serverChan    // Wait for work.
3248
        process(b)
3249 3250 3251 3252 3253 3254 3255
        // Reuse buffer if there's room.
        select {
        case freeList &lt;- b:
            // Buffer on free list; nothing more to do.
        default:
            // Free list full, just carry on.
        }
3256
    }
3257 3258 3259
}
</pre>
<p>
3260 3261 3262
The client attempts to retrieve a buffer from <code>freeList</code>;
if none is available, it allocates a fresh one.
The server's send to <code>freeList</code> puts <code>b</code> back
3263 3264 3265
on the free list unless the list is full, in which case the
buffer is dropped on the floor to be reclaimed by
the garbage collector.
3266 3267 3268
(The <code>default</code> clauses in the <code>select</code>
statements execute when no other case is ready,
meaning that the <code>selects</code> never block.)
3269 3270 3271 3272 3273
This implementation builds a leaky bucket free list
in just a few lines, relying on the buffered channel and
the garbage collector for bookkeeping.
</p>

Rob Pike's avatar
Rob Pike committed
3274
<h2 id="errors">Errors</h2>
Russ Cox's avatar
Russ Cox committed
3275

Rob Pike's avatar
Rob Pike committed
3276 3277 3278 3279
<p>
Library routines must often return some sort of error indication to
the caller.  As mentioned earlier, Go's multivalue return makes it
easy to return a detailed error description alongside the normal
3280 3281
return value.  By convention, errors have type <code>error</code>,
a simple built-in interface.
Rob Pike's avatar
Rob Pike committed
3282 3283
</p>
<pre>
3284 3285
type error interface {
    Error() string
Rob Pike's avatar
Rob Pike committed
3286 3287 3288 3289 3290 3291 3292 3293
}
</pre>
<p>
A library writer is free to implement this interface with a
richer model under the covers, making it possible not only
to see the error but also to provide some context.
For example, <code>os.Open</code> returns an <code>os.PathError</code>.
</p>
Russ Cox's avatar
Russ Cox committed
3294
<pre>
Rob Pike's avatar
Rob Pike committed
3295 3296 3297
// PathError records an error and the operation and
// file path that caused it.
type PathError struct {
3298 3299
    Op string    // "open", "unlink", etc.
    Path string  // The associated file.
3300
    Err error    // Returned by the system call.
Rob Pike's avatar
Rob Pike committed
3301 3302
}

3303 3304
func (e *PathError) Error() string {
    return e.Op + " " + e.Path + ": " + e.Err.Error()
Rob Pike's avatar
Rob Pike committed
3305
}
Russ Cox's avatar
Russ Cox committed
3306
</pre>
Rob Pike's avatar
Rob Pike committed
3307
<p>
3308
<code>PathError</code>'s <code>Error</code> generates
Rob Pike's avatar
Rob Pike committed
3309
a string like this:
Rob Pike's avatar
Rob Pike committed
3310
</p>
Russ Cox's avatar
Russ Cox committed
3311
<pre>
Rob Pike's avatar
Rob Pike committed
3312
open /etc/passwx: no such file or directory
Russ Cox's avatar
Russ Cox committed
3313
</pre>
Rob Pike's avatar
Rob Pike committed
3314
<p>
Rob Pike's avatar
Rob Pike committed
3315 3316 3317 3318 3319
Such an error, which includes the problematic file name, the
operation, and the operating system error it triggered, is useful even
if printed far from the call that caused it;
it is much more informative than the plain
"no such file or directory".
Rob Pike's avatar
Rob Pike committed
3320 3321
</p>

3322 3323
<p>
When feasible, error strings should identify their origin, such as by having
3324
a prefix naming the operation or package that generated the error.  For example, in package
3325 3326
<code>image</code>, the string representation for a decoding error due to an
unknown format is "image: unknown format".
3327 3328
</p>

Rob Pike's avatar
Rob Pike committed
3329 3330
<p>
Callers that care about the precise error details can
Rob Pike's avatar
Rob Pike committed
3331
use a type switch or a type assertion to look for specific
3332
errors and extract details.  For <code>PathErrors</code>
3333
this might include examining the internal <code>Err</code>
Rob Pike's avatar
Rob Pike committed
3334
field for recoverable failures.
Rob Pike's avatar
Rob Pike committed
3335
</p>
Russ Cox's avatar
Russ Cox committed
3336

Rob Pike's avatar
Rob Pike committed
3337
<pre>
3338
for try := 0; try &lt; 2; try++ {
3339
    file, err = os.Create(filename)
3340 3341 3342
    if err == nil {
        return
    }
3343
    if e, ok := err.(*os.PathError); ok &amp;&amp; e.Err == syscall.ENOSPC {
3344 3345 3346 3347
        deleteTempFiles()  // Recover some space.
        continue
    }
    return
Rob Pike's avatar
Rob Pike committed
3348 3349 3350
}
</pre>

3351
<p>
3352
The second <code>if</code> statement here is another <a href="#interface_conversions">type assertion</a>.
3353
If it fails, <code>ok</code> will be false, and <code>e</code>
3354 3355 3356 3357 3358 3359
will be <code>nil</code>.
If it succeeds,  <code>ok</code> will be true, which means the
error was of type <code>*os.PathError</code>, and then so is <code>e</code>,
which we can examine for more information about the error.
</p>

Rob Pike's avatar
Rob Pike committed
3360
<h3 id="panic">Panic</h3>
3361 3362

<p>
Rob Pike's avatar
Rob Pike committed
3363
The usual way to report an error to a caller is to return an
3364
<code>error</code> as an extra return value.  The canonical
Rob Pike's avatar
Rob Pike committed
3365
<code>Read</code> method is a well-known instance; it returns a byte
3366
count and an <code>error</code>.  But what if the error is
Rob Pike's avatar
Rob Pike committed
3367 3368 3369 3370 3371 3372 3373 3374 3375
unrecoverable?  Sometimes the program simply cannot continue.
</p>

<p>
For this purpose, there is a built-in function <code>panic</code>
that in effect creates a run-time error that will stop the program
(but see the next section).  The function takes a single argument
of arbitrary type&mdash;often a string&mdash;to be printed as the
program dies.  It's also a way to indicate that something impossible has
3376
happened, such as exiting an infinite loop.
Rob Pike's avatar
Rob Pike committed
3377 3378 3379 3380 3381 3382
</p>


<pre>
// A toy implementation of cube root using Newton's method.
func CubeRoot(x float64) float64 {
Andrew Gerrand's avatar
Andrew Gerrand committed
3383
    z := x/3   // Arbitrary initial value
3384
    for i := 0; i &lt; 1e6; i++ {
Rob Pike's avatar
Rob Pike committed
3385 3386 3387 3388 3389 3390 3391
        prevz := z
        z -= (z*z*z-x) / (3*z*z)
        if veryClose(z, prevz) {
            return z
        }
    }
    // A million iterations has not converged; something is wrong.
3392
    panic(fmt.Sprintf("CubeRoot(%g) did not converge", x))
Rob Pike's avatar
Rob Pike committed
3393 3394 3395 3396 3397 3398 3399 3400 3401 3402 3403 3404 3405 3406 3407 3408 3409 3410 3411 3412 3413 3414 3415 3416 3417 3418
}
</pre>

<p>
This is only an example but real library functions should
avoid <code>panic</code>.  If the problem can be masked or worked
around, it's always better to let things continue to run rather
than taking down the whole program.  One possible counterexample
is during initialization: if the library truly cannot set itself up,
it might be reasonable to panic, so to speak.
</p>

<pre>
var user = os.Getenv("USER")

func init() {
    if user == "" {
        panic("no value for $USER")
    }
}
</pre>

<h3 id="recover">Recover</h3>

<p>
When <code>panic</code> is called, including implicitly for run-time
3419
errors such as indexing a slice out of bounds or failing a type
Rob Pike's avatar
Rob Pike committed
3420 3421 3422 3423 3424 3425 3426 3427 3428 3429 3430 3431 3432 3433 3434 3435 3436 3437 3438 3439 3440
assertion, it immediately stops execution of the current function
and begins unwinding the stack of the goroutine, running any deferred
functions along the way.  If that unwinding reaches the top of the
goroutine's stack, the program dies.  However, it is possible to
use the built-in function <code>recover</code> to regain control
of the goroutine and resume normal execution.
</p>

<p>
A call to <code>recover</code> stops the unwinding and returns the
argument passed to <code>panic</code>.  Because the only code that
runs while unwinding is inside deferred functions, <code>recover</code>
is only useful inside deferred functions.
</p>

<p>
One application of <code>recover</code> is to shut down a failing goroutine
inside a server without killing the other executing goroutines.
</p>

<pre>
3441
func server(workChan &lt;-chan *Work) {
Rob Pike's avatar
Rob Pike committed
3442
    for work := range workChan {
3443
        go safelyDo(work)
Rob Pike's avatar
Rob Pike committed
3444 3445 3446 3447 3448 3449
    }
}

func safelyDo(work *Work) {
    defer func() {
        if err := recover(); err != nil {
3450
            log.Println("work failed:", err)
Rob Pike's avatar
Rob Pike committed
3451 3452 3453 3454 3455 3456 3457 3458 3459 3460 3461 3462 3463 3464
        }
    }()
    do(work)
}
</pre>

<p>
In this example, if <code>do(work)</code> panics, the result will be
logged and the goroutine will exit cleanly without disturbing the
others.  There's no need to do anything else in the deferred closure;
calling <code>recover</code> handles the condition completely.
</p>

<p>
3465 3466 3467 3468 3469 3470 3471 3472 3473 3474
Because <code>recover</code> always returns <code>nil</code> unless called directly
from a deferred function, deferred code can call library routines that themselves
use <code>panic</code> and <code>recover</code> without failing.  As an example,
the deferred function in <code>safelyDo</code> might call a logging function before
calling <code>recover</code>, and that logging code would run unaffected
by the panicking state.
</p>

<p>
With our recovery pattern in place, the <code>do</code>
Rob Pike's avatar
Rob Pike committed
3475 3476 3477
function (and anything it calls) can get out of any bad situation
cleanly by calling <code>panic</code>.  We can use that idea to
simplify error handling in complex software.  Let's look at an
3478
idealized version of a <code>regexp</code> package, which reports
Rob Pike's avatar
Rob Pike committed
3479
parsing errors by calling <code>panic</code> with a local
3480
error type.  Here's the definition of <code>Error</code>,
Rob Pike's avatar
Rob Pike committed
3481 3482 3483 3484
an <code>error</code> method, and the <code>Compile</code> function.
</p>

<pre>
3485
// Error is the type of a parse error; it satisfies the error interface.
Rob Pike's avatar
Rob Pike committed
3486
type Error string
3487
func (e Error) Error() string {
Rob Pike's avatar
Rob Pike committed
3488 3489 3490 3491 3492 3493 3494 3495 3496 3497
    return string(e)
}

// error is a method of *Regexp that reports parsing errors by
// panicking with an Error.
func (regexp *Regexp) error(err string) {
    panic(Error(err))
}

// Compile returns a parsed representation of the regular expression.
3498
func Compile(str string) (regexp *Regexp, err error) {
Rob Pike's avatar
Rob Pike committed
3499 3500 3501 3502 3503 3504 3505 3506 3507 3508 3509 3510 3511 3512 3513
    regexp = new(Regexp)
    // doParse will panic if there is a parse error.
    defer func() {
        if e := recover(); e != nil {
            regexp = nil    // Clear return value.
            err = e.(Error) // Will re-panic if not a parse error.
        }
    }()
    return regexp.doParse(str), nil
}
</pre>

<p>
If <code>doParse</code> panics, the recovery block will set the
return value to <code>nil</code>&mdash;deferred functions can modify
Andrew Gerrand's avatar
Andrew Gerrand committed
3514
named return values.  It will then check, in the assignment
Rob Pike's avatar
Rob Pike committed
3515
to <code>err</code>, that the problem was a parse error by asserting
3516
that it has the local type <code>Error</code>.
Rob Pike's avatar
Rob Pike committed
3517 3518
If it does not, the type assertion will fail, causing a run-time error
that continues the stack unwinding as though nothing had interrupted
3519 3520
it.
This check means that if something unexpected happens, such
3521
as an index out of bounds, the code will fail even though we
Rob Pike's avatar
Rob Pike committed
3522
are using <code>panic</code> and <code>recover</code> to handle
3523
parse errors.
Rob Pike's avatar
Rob Pike committed
3524 3525 3526
</p>

<p>
3527 3528 3529
With error handling in place, the <code>error</code> method (because it's a
method bound to a type, it's fine, even natural, for it to have the same name
as the builtin <code>error</code> type)
Rob Pike's avatar
Rob Pike committed
3530
makes it easy to report parse errors without worrying about unwinding
3531
the parse stack by hand:
Rob Pike's avatar
Rob Pike committed
3532 3533
</p>

3534
<pre>
3535
if pos == 0 {
3536 3537 3538 3539
    re.error("'*' illegal at start of expression")
}
</pre>

Rob Pike's avatar
Rob Pike committed
3540 3541 3542
<p>
Useful though this pattern is, it should be used only within a package.
<code>Parse</code> turns its internal <code>panic</code> calls into
3543
<code>error</code> values; it does not expose <code>panics</code>
Rob Pike's avatar
Rob Pike committed
3544
to its client.  That is a good rule to follow.
3545 3546
</p>

3547 3548 3549 3550 3551 3552 3553 3554 3555 3556 3557
<p>
By the way, this re-panic idiom changes the panic value if an actual
error occurs.  However, both the original and new failures will be
presented in the crash report, so the root cause of the problem will
still be visible.  Thus this simple re-panic approach is usually
sufficient&mdash;it's a crash after all&mdash;but if you want to
display only the original value, you can write a little more code to
filter unexpected problems and re-panic with the original error.
That's left as an exercise for the reader.
</p>

3558

3559 3560 3561 3562 3563 3564 3565 3566 3567 3568 3569 3570 3571 3572 3573 3574 3575 3576 3577 3578
<h2 id="web_server">A web server</h2>

<p>
Let's finish with a complete Go program, a web server.
This one is actually a kind of web re-server.
Google provides a service at
<a href="http://chart.apis.google.com">http://chart.apis.google.com</a>
that does automatic formatting of data into charts and graphs.
It's hard to use interactively, though,
because you need to put the data into the URL as a query.
The program here provides a nicer interface to one form of data: given a short piece of text,
it calls on the chart server to produce a QR code, a matrix of boxes that encode the
text.
That image can be grabbed with your cell phone's camera and interpreted as,
for instance, a URL, saving you typing the URL into the phone's tiny keyboard.
</p>
<p>
Here's the complete program.
An explanation follows.
</p>
3579
{{code "/doc/progs/eff_qr.go" `/package/` `$`}}
3580 3581 3582 3583 3584 3585 3586 3587 3588 3589 3590 3591 3592 3593 3594
<p>
The pieces up to <code>main</code> should be easy to follow.
The one flag sets a default HTTP port for our server.  The template
variable <code>templ</code> is where the fun happens. It builds an HTML template
that will be executed by the server to display the page; more about
that in a moment.
</p>
<p>
The <code>main</code> function parses the flags and, using the mechanism
we talked about above, binds the function <code>QR</code> to the root path
for the server.  Then <code>http.ListenAndServe</code> is called to start the
server; it blocks while the server runs.
</p>
<p>
<code>QR</code> just receives the request, which contains form data, and
Russ Cox's avatar
Russ Cox committed
3595
executes the template on the data in the form value named <code>s</code>.
3596 3597
</p>
<p>
3598
The template package <code>html/template</code> is powerful;
3599
this program just touches on its capabilities.
3600
In essence, it rewrites a piece of HTML text on the fly by substituting elements derived
3601
from data items passed to <code>templ.Execute</code>, in this case the
3602
form value.
3603
Within the template text (<code>templateStr</code>),
3604
double-brace-delimited pieces denote template actions.
3605 3606
The piece from <code>{{html "{{if .}}"}}</code>
to <code>{{html "{{end}}"}}</code> executes only if the value of the current data item, called <code>.</code> (dot),
3607 3608
is non-empty.
That is, when the string is empty, this piece of the template is suppressed.
3609 3610
</p>
<p>
3611 3612 3613 3614
The two snippets <code>{{html "{{.}}"}}</code> say to show the data presented to
the template—the query string—on the web page.
The HTML template package automatically provides appropriate escaping so the
text is safe to display.
3615 3616 3617
</p>
<p>
The rest of the template string is just the HTML to show when the page loads.
3618
If this is too quick an explanation, see the <a href="/pkg/html/template/">documentation</a>
3619 3620 3621
for the template package for a more thorough discussion.
</p>
<p>
Rob Pike's avatar
Rob Pike committed
3622
And there you have it: a useful web server in a few lines of code plus some
3623 3624 3625 3626
data-driven HTML text.
Go is powerful enough to make a lot happen in a few lines.
</p>

Rob Pike's avatar
Rob Pike committed
3627
<!--
Rob Pike's avatar
slices  
Rob Pike committed
3628
TODO
3629 3630 3631
<pre>
verifying implementation
type Color uint32
3632

3633 3634 3635 3636
// Check that Color implements image.Color and image.Image
var _ image.Color = Black
var _ image.Image = Black
</pre>
Rob Pike's avatar
Rob Pike committed
3637
-->
Rob Pike's avatar
Rob Pike committed
3638