TensorFlow - Tagged Articles

AI & Machine Learning • Aug 27, 2020

Edge ML in Norway: Deploying Low-Latency Inference while Surviving Schrems II

Cloud latency kills real-time AI. In the wake of the Schrems II ruling, moving inference to the edge isn't just about performance—it's about compliance. Here is the 2020 architecture for deploying quantized TensorFlow models on Norwegian infrastructure.

AI & Machine Learning • Jun 18, 2020

Production-Grade AI: Serving TensorFlow Models with Low Latency in Norway

Stop wrapping your Keras models in Flask. Learn how to deploy TensorFlow Serving via Docker on high-performance NVMe infrastructure for sub-100ms inference times while keeping your data compliant with Norwegian standards.

AI & Machine Learning • Mar 13, 2019

NVIDIA T4 & Turing Architecture: Optimizing AI Inference Workloads in 2019

Stop burning budget on V100s for simple inference. We benchmark the new NVIDIA T4 against the Pascal generation and show you how to deploy mixed-precision models on Ubuntu 18.04 using nvidia-docker2.

AI & Machine Learning • Jan 02, 2019

Maximizing AI Inference Performance: From AVX-512 to NVMe in the Norwegian Cloud

Latency kills AI projects. We dissect CPU threading, TensorFlow 1.x configurations, and why NVMe storage is non-negotiable for production models in 2019.

AI & Machine Learning • Mar 13, 2017

Deep Learning Bottlenecks: Why Fast NVMe and KVM Matter More Than Your GPU

It is 2017, and TensorFlow 1.0 has changed the game. But throwing a Titan X at your model is useless if your I/O is choking the pipeline. Here is how to architecture a training stack that actually saturates the bus, strictly for Norwegian data compliance.

AI & Machine Learning • Feb 06, 2017

TensorFlow in Production: High-Performance Serving Strategies (Feb 2017 Edition)

Stop serving models with Flask. Learn how to deploy TensorFlow 1.0 candidates using gRPC and Docker for sub-millisecond inference latency on Norwegian infrastructure.

🍪 We Value Your Privacy

Privacy & Cookie Settings