Taking away the hand and bottle would leave a 'gap' in the image.
To fill that gap you'll need to replace the 'information' behind the hand and bottle.
This missing information needs to be replaced piece by piece with information available on the image.
For example:
Select a part of the bench, copy and past it
Every 'added' piece needs to be refined to match the picture.
With a picture where there's only a small part of information to replace the task is easy.
In this case it is still possible but with a huge amount of work and raffinement.
Sometimes it is easier to 'recreate' the scene without the disturbing element, if possible.