Apple interview question

mplement a simple PyTorch neural network that takes a multimodal input