Golang : Read large file with bufio.Scanner cause token too long error
There are couple of ways to read files with Golang and one of the common way to read in file is with bufio.NewScanner() function. For example :
package main
import (
"bufio"
"fmt"
"os"
)
func main() {
file, err := os.Open("somefile.dat")
if err != nil {
panic(err.Error())
}
defer file.Close()
reader := bufio.NewReader(file)
scanner := bufio.NewScanner(reader)
scanner.Split(bufio.ScanBytes)
for scanner.Scan() {
fmt.Println(scanner.Bytes())
}
if err := scanner.Err(); err != nil {
fmt.Println("error while reading :", err)
}
}
Most of the time, things should be fine and dandy. However, there will be cases when the program attempts to read in a very large file and it will throw out this error message :
"bufio.Scanner: token too long"
This error message was caused by the input token size that exceeded the bufio.MaxScanTokenSize
buffer size. The allocated size is MaxScanTokenSize = 64 * 1024
To read in large files, it is recommended to use bufio.Reader.ReadLine instead.
References :
See also : Golang : Scanf function weird error in Windows
By Adam Ng
IF you gain some knowledge or the information here solved your programming problem. Please consider donating to the less fortunate or some charities that you like. Apart from donation, planting trees, volunteering or reducing your carbon footprint will be great too.
Advertisement
Tutorials
+14.3k Golang : Read a file line by line
+8.5k Golang : Test a slice of integers for odd and even numbers
+7.4k Golang : Intercept and compare HTTP response code example
+18.3k Golang : How to force compile or remove object files first before rebuild?
+3.7k Golang : Print instead of building pyramids
+43.3k Golang : Marshal and unmarshal json.RawMessage struct example
+11.8k Swift : Convert (cast) Int to String ?
+27.6k Golang : Get and Set User-Agent examples
+13.2k Golang : Search folders for file recursively with wildcard support
+4.8k Golang : Experimenting with the Rejang script
+6k Golang : Example of how to detect which type of script a word belongs to
+17.8k Swift : Convert (cast) Int to int32 or Uint32