Golang : Read large file with bufio.Scanner cause token too long error
There are couple of ways to read files with Golang and one of the common way to read in file is with bufio.NewScanner() function. For example :
package main
import (
"bufio"
"fmt"
"os"
)
func main() {
file, err := os.Open("somefile.dat")
if err != nil {
panic(err.Error())
}
defer file.Close()
reader := bufio.NewReader(file)
scanner := bufio.NewScanner(reader)
scanner.Split(bufio.ScanBytes)
for scanner.Scan() {
fmt.Println(scanner.Bytes())
}
if err := scanner.Err(); err != nil {
fmt.Println("error while reading :", err)
}
}
Most of the time, things should be fine and dandy. However, there will be cases when the program attempts to read in a very large file and it will throw out this error message :
"bufio.Scanner: token too long"
This error message was caused by the input token size that exceeded the bufio.MaxScanTokenSize
buffer size. The allocated size is MaxScanTokenSize = 64 * 1024
To read in large files, it is recommended to use bufio.Reader.ReadLine instead.
References :
See also : Golang : Scanf function weird error in Windows
By Adam Ng
IF you gain some knowledge or the information here solved your programming problem. Please consider donating to the less fortunate or some charities that you like. Apart from donation, planting trees, volunteering or reducing your carbon footprint will be great too.
Advertisement
Tutorials
+13k Golang : Linear algebra and matrix calculation example
+9.5k Golang : Turn string or text file into slice example
+6k Golang : Process non-XML/JSON formatted ASCII text file example
+4.6k Javascript : How to get width and height of a div?
+8.8k Golang : Handle sub domain with Gin
+8.8k Golang : Intercept and compare HTTP response code example
+11.8k Linux : How to install driver for 600Mbps Dual Band Wifi USB Adapter
+10.3k Golang : Allow Cross-Origin Resource Sharing request
+11.9k Golang : convert(cast) string to integer value
+15.4k Golang : Get current time from the Internet time server(ntp) example
+40.7k Golang : How to count duplicate items in slice/array?