Highly immersive content in the form of extended reality (XR) is positioned as an alternative to conventional video services such as YouTube and Facebook. The virtual tour application is one of popular virtual reality (VR) services, allowing users to walk inside virtual spaces, beyond merely looking around. Many galleries and museums already offer online VR tours, and users are free to choose the spot they want to move to. Although the ease of implementation, this key-spot hopping is insufficient to provide the experience of walking in a space or on a street. Meanwhile, in recent volumetric or light-field-based studies, view rendering that supports free and continuous viewpoint movements has been attempted. With online services in mind, however, the high data volume and computational complexity are a concern here. One of compromise is path-walking VR, where the viewer can walk freely following the path taken by the camera. The interactive path-walking VR service is entry-level immersive video, but streaming over the network is still challenging. This paper proposes schemes to reduce streaming delays and thus enhance user experience satisfaction. The first is a caching strategy that reflects geometrical locality in virtual space. This not only reduces the communication delay from the server, but also decreases the burden of server-side request handling. The second scheme uses the relationship between the viewer's speed and the field of vision. The view quality is adjusted according to the viewer's speed and head direction. Experimental results show that the proposed schemes achieve stable viewer's experience by considering walking characteristics in virtual space. It is expected that the results of this paper will provide insight to those who design a B5G/6G systems for immersive media applications.