Benchmarking TensorFlow and TensorFlow Lite on Raspberry Pi 5

June 7, 2024

307

All the way in which again in 2019 I spent a variety of time taking a look at machine studying on the sting. Over the course of about six months I revealed greater than a dozen articles on benchmarking the then new technology of machine studying accelerator {hardware} that was solely simply beginning to seem in the marketplace, and gave a sequence of talks round the findings.

Quite a bit has modified within the intervening years, however after a getting a current nudge I returned to my benchmark code and — after fixing among the inevitable bit rot — I ran it on the new Raspberry Pi 5.

Headline outcomes from benchmarking

Operating the benchmarks on the new Raspberry Pi 5 we see important enhancements in inferencing velocity, with full TensorFlow fashions working virtually ×5 quicker than on they did on Raspberry Pi 4. We see an identical improve in inferencing velocity when utilizing TensorFlow Lite, with fashions once more working virtually ×5 quicker than on the Raspberry Pi 4.

Inferencing time in milli-seconds for the MobileNet v2 SSD mannequin (left hand bars) and MobileNet v1 SSD 0.75 depth mannequin (proper hand bars), educated utilizing the Frequent Objects in Context (COCO) dataset with an enter dimension of 300×300. Timings are for Raspberry Pi 3, Mannequin B+, Raspberry Pi 4, and Raspberry Pi 5 utilizing TensorFlow and TensorFlow Lite.

Nonetheless maybe the extra spectacular result’s that, whereas inferencing on Coral accelerator {hardware} continues to be quicker than utilizing full TensorFlow fashions on the Raspberry Pi 5, the brand new Raspberry Pi 5 has related efficiency when utilizing TensorFlow Lite to the Coral TPU, displaying primarily the identical inferencing speeds.

Ultimate benchmarking ends in milli-seconds for MobileNet v1 SSD 0.75 depth mannequin and the MobileNet v2 SSD mannequin, each educated utilizing the Frequent Objects in Context (COCO) dataset with an enter dimension of 300×300.

ℹ️ Data As per our earlier outcomes with the Raspberry Pi 4 we used energetic cooling with the Raspberry Pi 5 to CPU temperature steady and stop thermal throttling of the CPU throughout inferencing.

The conclusion is that customized accelerator {hardware} might not be wanted for some inferencing duties on the edge, as inferencing straight on the Raspberry Pi 5 CPU — with no GPU acceleration — is now on a par with the efficiency Coral TPU.

Inferencing time in milli-seconds for the MobileNet v2 mannequin (left hand bars, blue) and MobileNet v1 SSD 0.75 depth mannequin (proper hand bars, inexperienced), educated utilizing the Frequent Objects in Context (COCO) dataset with an enter dimension of 300×300. Timings are for Raspberry Pi 3, Mannequin B+, Raspberry Pi 4, and Raspberry Pi 5 utilizing TensorFlow and TensorFlow Lite. Comparability timings from our unique benchmark are proven for the Google’s Coral Dev Board utilizing the Edge TPU.

ℹ️ Data The Coral {hardware} makes use of quantization the identical means TensorFlow Lite does to cut back the dimensions of fashions. Nonetheless to make use of a TensorFlow Lite mannequin with Edge TPU {hardware} there are just a few additional steps concerned. First it’s essential convert your TensorFlow mannequin to the optimized FlatBuffer format to signify graphs utilized by TensorFlow Lite. However moreover you additionally must compile your TensorFlow Lite mannequin for compatibility with the Edge TPU utilizing Google’s compiler.

Conclusion

Inferencing time in milli-seconds for the MobileNet v2 mannequin (left hand bars, blue) and MobileNet v1 SSD 0.75 depth mannequin (proper hand bars, inexperienced), educated utilizing the Frequent Objects in Context (COCO) dataset with an enter dimension of 300×300. The Raspberry Pi 5 now provides related efficiency to the Coral TPU.

Inferencing speeds with TensorFlow and TensorFlow Lite on the Raspberry Pi 5 are considerably improved over Raspberry Pi 4. Moreover, the Raspberry Pi 5 now provides related efficiency to the Coral TPU.

Benchmarking TensorFlow and TensorFlow Lite on Raspberry Pi 5

Follow More

Related Articles

That is the POCO X7 Professional Iron Man Version

New 50 Sequence Graphics Playing cards

Good Garments Definition, Working, Expertise & Functions

LEAVE A REPLY Cancel reply

Latest Articles

That is the POCO X7 Professional Iron Man Version

New 50 Sequence Graphics Playing cards

Good Garments Definition, Working, Expertise & Functions

SparkFun Spooktacular – Information – SparkFun Electronics

PWMpot approximates a Dpot

Benchmarking TensorFlow and TensorFlow Lite on Raspberry Pi 5

Half I — Benchmarking

Half II — Methodology

Follow More

Related Articles

LEAVE A REPLY Cancel reply

Latest Articles