Nvidia:HardwareSelect
딥러닝 (Deep learning)을 위한 NVIDIA하드웨어 선택방법에 대하여 설명한다.
Hardware select
- Best GPU overall: GTX Titan X
- Cost efficient but expensive: GTX Titan X, GTX 980, GTX 980 Ti
- Cost efficient but troubled: GTX 580 3GB (lacks software support) or GTX 970 (has memory problem)
- Cheapest card with no troubles: GTX 960 4GB or GTX 680
- I work with data sets > 250GB: GTX Titan, GTX 980 Ti or GTX Titan X
- I have little money: GTX 680 3GB eBay
- I have almost no money: AWS GPU spot instance
- I do Kaggle: GTX 980 or GTX 960 4GB
- I am a researcher: 1-4x GTX Titan X
- I want to build a GPU cluster: This is really complicated, you can get some ideas here
- I started deep learning and I am serious about it: Start with one GTX 680, GTX 980, or GTX 970 and buy more of those as you feel the need for them; save money for Pascal GPUs in 2016 Q2/Q3 (they will be much faster than current GPUs)
결론은 아래와 같다.
- 돈이 충분하다면 GTX Titan X or GTX 980
- 돈이 부족하다면 GTX 960 or GTX 680
- 문제가 있지만 컨트롤할 수 있다면 GTX 970
GTX TitanX vs Tesla K40/K80
Specs:
- GeForce Titan X Features: 3072 Cores, 12GB Ram, 336GB/s, 384-bit bus, Boost Clock: 1075MHz, Core Clock 1000MHz, 7GHz DDR5, 6.2 TFLOPS SP
- Tesla K40 Features: 2880 Cores, 12GB Ram, 288 GB/s, 384-bit bus, Boost Clock: 810-875MHz, Core Clock: 745MHz, 6GHz GDDR5, 4.29 Tflops SP
Pros of Geforce Titan X:
- Cheap price
- Because there is graphics support you can visualize your processed data easily
- It has latest Maxwell architecture, whereas Tesla K40 has kepler architecture (Not sure but Nvidia might not release Maxwell architecture Tesla GPUs)
- More GPU cores than Tesla K40
Pros of Tesla K40:
- It is targeted for workstations and servers
- Tesla accelerators deliver the best cluster performance while jobs complete with 100-percent reliability and manageability
- More number of double precision units and hence improves double precision performance
- Some Tesla-exclusive features include:
- NVIDIA GPUDirect RDMA for InfiniBand performance
- Hyper-Q for MPI (Hyper-Q for CUDA Streams is supported on GeForce GTX TITAN)
- ECC protection for all internal and external registers and memories
- Supported tools for GPU and cluster management, such as Bright Computing, Ganglia.
Final word:
- You can decide what you want reliability or cost effectiveness
- Based on the specs for single precision operations Geforce Titan X will give more performance
- ECC support which is not available with GeForce Titan X will not hurt deep learning applications (that is my guess, you can verify more on this and let us all know)
See also
Favorite site
TL;DR advice